Andrew Zisserman
#4,714
Most Influential Person Now
British computer scientist
Andrew Zisserman's AcademicInfluence.com Rankings
Andrew Zissermancomputer-science Degrees
Computer Science
#275
World Rank
#287
Historical Rank
Database
#12
World Rank
#12
Historical Rank
Download Badge
Computer Science
Why Is Andrew Zisserman Influential?
(Suggest an Edit or Addition)According to Wikipedia, Andrew Zisserman is a British computer scientist and a professor at the University of Oxford, and a researcher in computer vision. As of 2014 he is affiliated with DeepMind. Education Zisserman received the Part III of the Mathematical Tripos, and his PhD in theoretical physics from the Sunderland Polytechnic.
Andrew Zisserman's Published Works
Published Works
- Very Deep Convolutional Networks for Large-Scale Image Recognition (2014) (76530)
- Multiple View Geometry in Computer Vision (2001) (17306)
- The Pascal Visual Object Classes (VOC) Challenge (2010) (13911)
- Video Google: a text retrieval approach to object matching in videos (2003) (6999)
- Two-Stream Convolutional Networks for Action Recognition in Videos (2014) (6339)
- Spatial Transformer Networks (2015) (5655)
- Deep Inside Convolutional Networks: Visualising Image Classification Models and Saliency Maps (2013) (5397)
- Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset (2017) (5214)
- Deep Face Recognition (2015) (4572)
- The Pascal Visual Object Classes Challenge: A Retrospective (2014) (4541)
- A Comparison of Affine Region Detectors (2005) (3456)
- Return of the Devil in the Details: Delving Deep into Convolutional Nets (2014) (3278)
- Multiple view geometry in computer vision (2. ed.) (2003) (3261)
- Object retrieval with large vocabularies and fast spatial matching (2007) (3052)
- The PASCAL visual object classes challenge 2006 (VOC2006) results (2006) (2553)
- Object class recognition by unsupervised scale-invariant learning (2003) (2524)
- The Kinetics Human Action Video Dataset (2017) (2422)
- Automated Flower Classification over a Large Number of Classes (2008) (2312)
- MLESAC: A New Robust Estimator with Application to Estimating Image Geometry (2000) (2239)
- Convolutional Two-Stream Network Fusion for Video Action Recognition (2016) (2224)
- VGGFace2: A Dataset for Recognising Faces across Pose and Age (2017) (1840)
- Non-local sparse models for image restoration (2009) (1744)
- Representing shape with a spatial pyramid kernel (2007) (1540)
- Lost in quantization: Improving particular object retrieval in large scale image databases (2008) (1540)
- VoxCeleb: A Large-Scale Speaker Identification Dataset (2017) (1506)
- Image Classification using Random Forests and Ferns (2007) (1426)
- Three things everyone should know to improve object retrieval (2012) (1366)
- VoxCeleb2: Deep Speaker Recognition (2018) (1355)
- Speeding up Convolutional Neural Networks with Low Rank Expansions (2014) (1285)
- Supervised Dictionary Learning (2008) (1166)
- Discovering objects and their location in images (2005) (1164)
- Synthetic Data for Text Localisation in Natural Images (2016) (1095)
- Efficient additive kernels via explicit feature maps (2010) (1064)
- Learning To Count Objects in Images (2010) (1041)
- A Statistical Approach to Texture Classification from Single Images (2004) (1035)
- Multiple view geometry in computer visiond (2001) (999)
- Reading Text in the Wild with Convolutional Neural Networks (2014) (986)
- The devil is in the details: an evaluation of recent feature encoding methods (2011) (954)
- Cats and dogs (2012) (944)
- Total Recall: Automatic Query Expansion with a Generative Feature Model for Object Retrieval (2007) (921)
- Multiple kernels for object detection (2009) (908)
- Scene Classification Via pLSA (2006) (882)
- Learning object categories from Google's image search (2005) (846)
- A Visual Vocabulary for Flower Classification (2006) (839)
- Discriminative learned dictionaries for local image analysis (2008) (832)
- Single View Metrology (2000) (823)
- Scene Classification Using a Hybrid Generative/Discriminative Approach (2008) (799)
- Synthetic Data and Artificial Neural Networks for Natural Scene Text Recognition (2014) (787)
- Human Detection Based on a Probabilistic Assembly of Robust Part Detectors (2004) (785)
- Visual Reconstruction (1987) (775)
- Using Multiple Segmentations to Discover Objects and their Extent in Image Collections (2006) (750)
- Multiple View Geometry (2009) (739)
- Progressive search space reduction for human pose estimation (2008) (723)
- Non-uniform Deblurring for Shaken Images (2010) (718)
- All About VLAD (2013) (718)
- Multi-view Matching for Unordered Image Sets, or "How Do I Organize My Holiday Snaps?" (2002) (711)
- Hello! My name is... Buffy'' -- Automatic Naming of Characters in TV Video (2006) (665)
- A Statistical Approach to Material Classification Using Image Patch Exemplars (2009) (645)
- Geometric invariance in computer vision (1992) (644)
- Look, Listen and Learn (2017) (643)
- Deep Features for Text Spotting (2014) (607)
- Lip Reading Sentences in the Wild (2016) (579)
- An Affine Invariant Salient Region Detector (2004) (569)
- Discovering object categories in image collections (2005) (557)
- Texture classification: are filter banks necessary? (2003) (541)
- European conference on computer vision (ECCV) (2006) (540)
- Efficient Visual Search of Videos Cast as Text Retrieval (2009) (511)
- Automatic Camera Recovery for Closed or Open Image Sequences (1998) (505)
- Near Duplicate Image Detection: min-Hash and tf-idf Weighting (2008) (504)
- Video Action Transformer Network (2018) (502)
- 3D Model Acquisition from Extended Image Sequences (1996) (494)
- Flowing ConvNets for Human Pose Estimation in Videos (2015) (494)
- Fisher Vector Faces in the Wild (2013) (481)
- Lip Reading in the Wild (2016) (478)
- Invariant Descriptors for 3D Object Recognition and Pose (1991) (460)
- Multi-task Self-Supervised Visual Learning (2017) (459)
- End-to-End Learning of Visual Representations From Uncurated Instructional Videos (2019) (457)
- Learning Visual Attributes (2007) (454)
- The VIA Annotation Software for Images, Audio and Video (2019) (452)
- Tracking People by Learning Their Appearance (2007) (452)
- The 2005 PASCAL Visual Object Classes Challenge (2005) (443)
- Harvesting Image Databases from the Web (2007) (439)
- Detect to Track and Track to Detect (2017) (435)
- Classifying Images of Materials: Achieving Viewpoint and Illumination Independence (2002) (429)
- Metric rectification for perspective images of planes (1998) (426)
- Blocks That Shout: Distinctive Parts for Scene Classification (2013) (424)
- OBJ CUT (2005) (407)
- Objects that Sound (2017) (404)
- A Boundary-Fragment-Model for Object Detection (2006) (402)
- Flamingo: a Visual Language Model for Few-Shot Learning (2022) (401)
- Deep Audio-Visual Speech Recognition (2018) (398)
- Geodesic star convexity for interactive image segmentation (2010) (390)
- A Statistical Approach to Texture Classification from Single Images (2005) (381)
- Out of Time: Automated Lip Sync in the Wild (2016) (375)
- Voxceleb: Large-scale speaker verification in the wild (2020) (374)
- Advances in Neural Information Processing Systems (NIPS) (2007) (369)
- Strike a pose: tracking people by finding stylized poses (2005) (369)
- Tabula rasa: Model transfer for object category detection (2011) (363)
- Wide baseline stereo matching (1998) (361)
- Perceiver: General Perception with Iterative Attention (2021) (359)
- A sparse object category model for efficient learning and exhaustive recognition (2005) (354)
- Microscopy cell counting and detection with fully convolutional regression networks (2018) (351)
- A Visual Category Filter for Google Images (2004) (339)
- Learning Local Feature Descriptors Using Convex Optimisation (2014) (338)
- Feature Based Methods for Structure and Motion Estimation (1999) (335)
- Automatic line matching across views (1997) (332)
- Vision Algorithms: Theory and Practice (2002) (331)
- Creating Architectural Models from Images (1999) (328)
- An Exemplar Model for Learning Object Classes (2007) (327)
- Markerless tracking using planar structures in the scene (2000) (326)
- Robust parameterization and computation of the trifocal tensor (1997) (312)
- The VGG Image Annotator (VIA) (2019) (302)
- A Short Note about Kinetics-600 (2018) (299)
- X2Face: A network for controlling face generation by using images, audio, and pose codes (2018) (290)
- Learning and Using the Arrow of Time (2018) (288)
- Recurrent Human Pose Estimation (2016) (288)
- Computer vision applied to super resolution (2003) (283)
- Automatic 3D Model Construction for Turn-Table Sequences (1998) (282)
- New Techniques for Automated Architectural Reconstruction from Photographs (2002) (273)
- Video Representation Learning by Dense Predictive Coding (2019) (273)
- “Who are you?” - Learning person specific classifiers from video (2009) (269)
- Video Google: Efficient Visual Search of Videos (2006) (267)
- Object Class Segmentation using Random Forests (2008) (264)
- Utterance-level Aggregation for Speaker Recognition in the Wild (2019) (262)
- Self-supervised Co-training for Video Representation Learning (2020) (260)
- Hand detection using multiple proposals (2011) (257)
- A plane measuring device (1999) (255)
- Unsupervised discovery of visual object class hierarchies (2008) (255)
- The Conversation: Deep Audio-Visual Speech Enhancement (2018) (254)
- Toward Category-Level Object Recognition (2006) (254)
- Automated mosaicing with super-resolution zoom (1998) (253)
- Applications of Invariance in Computer Vision (1993) (253)
- Sequential Updating of Projective and Affine Structure from Motion (1997) (251)
- Dataset Issues in Object Recognition (2006) (251)
- AUTOMATIC LINE MATCHING AND 3D RECONSTRUCTION OF BUILDINGS FROM MULTIPLE VIEWS (1999) (250)
- 2D Articulated Human Pose Estimation and Retrieval in (Almost) Unconstrained Still Images (2012) (250)
- Learning to Navigate in Cities Without a Map (2018) (249)
- Motion Deblurring and Super-resolution from an Image Sequence (1996) (249)
- Triangulation Embedding and Democratic Aggregation for Image Search (2014) (248)
- A Short Note on the Kinetics-700 Human Action Dataset (2019) (248)
- Frozen in Time: A Joint Video and Image Encoder for End-to-End Retrieval (2021) (247)
- Weakly Supervised Scale-Invariant Learning of Models for Visual Recognition (2007) (246)
- Shape recognition with edge-based features (2003) (246)
- Symbiotic Segmentation and Part Localization for Fine-Grained Categorization (2013) (244)
- Person Spotting: Video Shot Retrieval for Face Sets (2005) (243)
- Use What You Have: Video retrieval using representations from collaborative experts (2019) (240)
- Scalable near identical image and shot detection (2007) (237)
- Viewpoint invariant texture matching and wide baseline stereo (2001) (237)
- Super-resolution from multiple views using learnt image models (2001) (235)
- Taking the bite out of automated naming of characters in TV video (2009) (224)
- Automatic reconstruction of piecewise planar models from multiple views (1999) (223)
- Deep Structured Output Learning for Unconstrained Text Recognition (2014) (222)
- Image-Based Rendering Using Image-Based Priors (2003) (221)
- Robust computation and parametrization of multiple view relations (1998) (221)
- Perceiver IO: A General Architecture for Structured Inputs & Outputs (2021) (221)
- Self-Supervised MultiModal Versatile Networks (2020) (221)
- Counting in the Wild (2016) (215)
- Incremental learning of object detectors using a visual shape alphabet (2006) (212)
- Deblurring Shaken and Partially Saturated Images (2011) (211)
- A framework for spatiotemporal control in the tracking of visual contours (1993) (209)
- Learning Layered Motion Segmentations of Video (2005) (209)
- EPIC-Fusion: Audio-Visual Temporal Binding for Egocentric Action Recognition (2019) (206)
- Deep Fisher Networks for Large-Scale Image Classification (2013) (205)
- BiCoS: A Bi-level co-segmentation method for image classification (2011) (202)
- Video data mining using configurations of viewpoint invariant regions (2004) (202)
- Learning to Detect Cells Using Non-overlapping Extremal Regions (2012) (196)
- Vggsound: A Large-Scale Audio-Visual Dataset (2020) (194)
- Emotion Recognition in Speech using Cross-Modal Transfer in the Wild (2018) (191)
- Object Level Grouping for Video Shots (2004) (187)
- The Geometry and Matching of Lines and Curves Over Multiple Views (2000) (186)
- Descriptor Learning for Efficient Retrieval (2010) (185)
- Turning a Blind Eye: Explicit Removal of Biases and Variation from Deep Neural Network Embeddings (2018) (184)
- Self-Calibration from Image Triplets (1996) (183)
- A Linguistic Feature Vector for the Visual Interpretation of Sign Language (2004) (182)
- Estimation of the partial volume effect in MRI (2002) (181)
- With a Little Help from My Friends: Nearest-Neighbor Contrastive Learning of Visual Representations (2021) (180)
- Human Focused Action Localization in Video (2010) (178)
- High Five: Recognising human interactions in TV shows (2010) (178)
- Automatic face recognition for film character retrieval in feature-length films (2005) (174)
- You said that? (2017) (173)
- Pose search: Retrieving people using their pose (2009) (173)
- Exploiting Temporal Context for 3D Human Pose Estimation in the Wild (2019) (172)
- The Problem of Degeneracy in Structure and Motion Recovery from Uncalibrated Image Sequences (1999) (172)
- Memory-augmented Dense Predictive Coding for Video Representation Learning (2020) (170)
- Learning sign language by watching TV (using weakly aligned subtitles) (2009) (168)
- Seeing Voices and Hearing Faces: Cross-Modal Biometric Matching (2018) (166)
- CrossTransformers: spatially-aware few-shot transfer (2020) (166)
- Structured Learning of Human Interactions in TV Shows (2012) (164)
- Super-resolution enhancement of text image sequences (2000) (164)
- Pylon Model for Semantic Segmentation (2011) (162)
- LRS3-TED: a large-scale dataset for visual speech recognition (2018) (162)
- On Affine Invariant Clustering and Automatic Cast Listing in Movies (2002) (161)
- Combining scene and auto-calibration constraints (1999) (160)
- Interactive Object Counting (2014) (151)
- Long Term Arm and Hand Tracking for Continuous Sign Language TV Broadcasts (2008) (151)
- Robust Detection of Degenerate Configurations while Estimating the Fundamental Matrix (1998) (149)
- Template Adaptation for Face Verification and Identification (2016) (148)
- Extracting projective structure from single perspective views of 3D point sets (1993) (144)
- Deep Convolutional Neural Networks for Efficient Pose Estimation in Gesture Videos (2014) (142)
- Temporal Cycle-Consistency Learning (2019) (142)
- Self-Supervised Learning of Audio-Visual Objects from Video (2020) (141)
- Metric calibration of a stereo rig (1995) (138)
- Unsupervised Learning of Object Keypoints for Perception and Control (2019) (138)
- Robust Object Tracking (2001) (137)
- Planar grouping for automatic detection of vanishing lines and points (2000) (136)
- The truth about cats and dogs (2011) (136)
- A PLANE-SWEEP STRATEGY FOR THE 3D RECONSTRUCTION OF BUILDINGS FROM MULTIPLE IMAGES (2000) (131)
- Learning to Discover Novel Visual Categories via Deep Transfer Clustering (2019) (130)
- Automatic 3D model acquisition and generation of new images from video sequences (1998) (129)
- DisLocation: Scalable Descriptor Distinctiveness for Location Recognition (2014) (128)
- Canonical Frames for Planar Object Recognition (1992) (127)
- Diagnostically relevant facial gestalt information from ordinary photos (2014) (127)
- Geometric Grouping of Repeated Elements within Images (1998) (126)
- Navigation using Affine Structure from Motion (1994) (126)
- A Compact and Discriminative Face Track Descriptor (2014) (125)
- 3D Object Recognition Using Invariance (1995) (125)
- Get Out of my Picture! Internet-based Inpainting (2009) (125)
- Multibody Structure and Motion: 3-D Reconstruction of Independently Moving Objects (2000) (124)
- Maintaining multiple motion model hypotheses over many views to recover matching and structure (1998) (124)
- Detecting People Looking at Each Other in Videos (2014) (122)
- Bayesian Methods for Image Super-Resolution (2009) (122)
- Learning an Alphabet of Shape and Appearance for Multi-Class Object Detection (2008) (119)
- The information available to a moving observer from specularities (1989) (119)
- Joint manifold distance: a new approach to appearance based clustering (2003) (119)
- Multiple queries for large scale specific object retrieval (2012) (118)
- Regression and classification approaches to eye localization in face images (2006) (117)
- Chimpanzee face recognition from videos in the wild using deep learning (2019) (116)
- In Search of Art (2014) (116)
- TriCoS: A Tri-level Class-Discriminative Co-segmentation Method for Image Classification (2012) (112)
- Surface descriptions from stereo and shading (1986) (111)
- Efficient discriminative learning of parts-based models (2009) (108)
- Lip Reading in Profile (2017) (107)
- Human Pose Estimation Using a Joint Pixel-wise and Part-wise Formulation (2013) (106)
- Planar object recognition using projective shape representation (1995) (106)
- Minimal Training, Large Lexicon, Unconstrained Sign Language Recognition (2004) (105)
- Delving deeper into the whorl of flower segmentation (2010) (105)
- Goal-directed Video Metrology (1996) (104)
- ISSLS PRIZE IN BIOENGINEERING SCIENCE 2017: Automation of reading of radiological features from magnetic resonance images (MRIs) of the lumbar spine without human intervention is comparable with an expert radiologist (2017) (104)
- 3D Motion recovery via affine Epipolar geometry (1995) (103)
- Learnable PINs: Cross-Modal Embeddings for Person Identity (2018) (103)
- Object Mining Using a Matching Graph on Very Large Image Collections (2008) (102)
- Open-Set Recognition: A Good Closed-Set Classifier is All You Need (2021) (101)
- Sim2real transfer learning for 3D human pose estimation: motion to the rescue (2019) (101)
- Self-supervised learning of a facial attribute embedding from video (2018) (100)
- You Said That?: Synthesising Talking Faces from Audio (2019) (100)
- Smooth object retrieval using a bag of boundaries (2011) (100)
- Affine-invariant contour tracking with automatic control of spatiotemporal scale (1993) (99)
- Robust detection of degenerate configurations for the fundamental matrix (1995) (99)
- Quadric reconstruction from dual-space geometry (1998) (97)
- Delving into the Whorl of Flower Segmentation (2007) (97)
- Smooth-AP: Smoothing the Path Towards Large-Scale Image Retrieval (2020) (97)
- Sparse kernel approximations for efficient classification and detection (2012) (96)
- Efficient Visual Search for Objects in Videos (2008) (95)
- OBJCUT: Efficient Segmentation Using Top-Down and Bottom-Up Cues (2010) (93)
- A Sampled Texture Prior for Image Super-Resolution (2003) (92)
- Extending Pictorial Structures for Object Recognition (2004) (92)
- Reflections on Shading (1991) (91)
- Projectively invariant representations using implicit algebraic curves (1990) (91)
- SpineNet: Automated classification and evidence visualization in spinal MRIs (2017) (90)
- Deep Lip Reading: a comparison of models and an online application (2018) (89)
- Kickstarting Deep Reinforcement Learning (2018) (89)
- Learning Object Categories From Internet Image Searches (2010) (88)
- The State of the Art: Object Retrieval in Paintings using Discriminative Regions (2014) (88)
- Direct Estimation of Non-Rigid Registration (2004) (87)
- Seeing the Arrow of Time (2014) (87)
- Smooth Loss Functions for Deep Top-k Classification (2018) (86)
- Generalized RBF feature maps for Efficient Detection (2010) (86)
- New approach to obtain height measurements from video (1999) (85)
- Shape from symmetry: detecting and exploiting symmetry in affine images (1995) (85)
- Automatically Discovering and Learning New Visual Categories with Ranking Statistics (2020) (84)
- Improving Human Action Recognition Using Score Distribution and Ranking (2014) (84)
- Structured output regression for detection with partial truncation (2009) (84)
- Automated location matching in movies (2003) (83)
- Simultaneous Object Detection and Ranking with Weak Supervision (2010) (82)
- Bayesian Image Super-resolution, Continued (2006) (82)
- Personalizing Human Video Pose Estimation (2015) (81)
- Unifying statistical texture classification frameworks (2004) (81)
- Geometric Latent Dirichlet Allocation on a Matching Graph for Large-scale Image Datasets (2011) (81)
- BSL-1K: Scaling up co-articulated sign language recognition using mouthing cues (2020) (81)
- Automated Scene Matching in Movies (2002) (80)
- Spot the conversation: speaker diarisation in the wild (2020) (80)
- The AVA-Kinetics Localized Human Actions Video Dataset (2020) (80)
- Bayesian Estimation of Layers from Multiple Images (2002) (79)
- Descriptor Learning Using Convex Optimisation (2012) (79)
- Resolving ambiguities in auto–calibration (1998) (79)
- SilNet : Single- and Multi-View Reconstruction by Learning from Silhouettes (2017) (78)
- Upper Body Detection and Tracking in Extended Signing Sequences (2011) (78)
- Domain-Adaptive Discriminative One-Shot Learning of Gestures (2014) (77)
- Self-supervised Video Object Segmentation by Motion Grouping (2021) (77)
- Broaden Your Views for Self-Supervised Video Learning (2021) (77)
- ASR is All You Need: Cross-Modal Distillation for Lip Reading (2019) (77)
- Euclidean Structure from Uncalibrated Images (1994) (76)
- Multicolumn Networks for Face Recognition (2018) (75)
- Motion From Point Matches Using Affine Epipolar Geometry (1994) (75)
- Microscopy cell counting with fully convolutional regression networks (2015) (75)
- Thinking Fast and Slow: Efficient Text-to-Visual Retrieval with Transformers (2021) (74)
- Segmenting Scenes by Matching Image Composites (2009) (74)
- 2D Human Pose Estimation in TV Shows (2009) (72)
- Comparator Networks (2018) (72)
- A Short Note on the Kinetics-700-2020 Human Action Dataset (2020) (71)
- Active visual navigation using non-metric structure (1995) (71)
- Image-based environment matting (2002) (70)
- Identifying individuals in video by combining 'generative' and discriminative head models (2005) (69)
- Efficient model library access by projectively invariant indexing functions (1992) (69)
- Vertebrae Detection and Labelling in Lumbar MR Images (2014) (68)
- GhostVLAD for set-based face recognition (2018) (67)
- Crystal nucleation in metallic alloys using x-ray radiography and machine learning (2018) (67)
- Duality, Rigidity and Planar Parallax (1998) (67)
- Detecting overlapping instances in microscopy images using extremal region trees (2016) (66)
- Disentangled Speech Embeddings Using Cross-Modal Self-Supervision (2020) (66)
- Learning to lip read words by watching videos (2018) (65)
- Automatic and Efficient Human Pose Estimation for Sign Language Videos (2014) (65)
- VISOR: Towards On-the-Fly Large-Scale Object Category Retrieval (2012) (64)
- Class-Agnostic Counting (2018) (64)
- Localizing Visual Sounds the Hard Way (2021) (64)
- Fully‐automated alignment of 3D fetal brain ultrasound to a canonical reference space using multi‐task learning (2018) (63)
- Efficient retrieval of deformable shape classes using local self-similarities (2009) (63)
- Mutual illumination (1989) (63)
- The Geometry and Matching of Curves in Multiple Views (1998) (62)
- Matching and Reconstruction from Widely Separated Views (1998) (61)
- Large-scale Learning of Sign Language by Watching TV (Using Co-occurrences) (2013) (60)
- Talking Heads: Detecting Humans and Recognizing Their Interactions (2014) (59)
- A Hierarchical Probabilistic U-Net for Modeling Multi-Scale Ambiguities (2019) (59)
- Single Axis Geometry by Fitting Conics (2002) (59)
- Camera Calibration Using Multiple Images (1992) (59)
- SpineNet: Automatically Pinpointing Classification Evidence in Spinal MRIs (2016) (59)
- Self-supervised Learning for Spinal MRIs (2017) (59)
- Single-Histogram Class Models for Image Segmentation (2006) (59)
- Multi-Task Convolutional Neural Network for Patient Detection and Skin Segmentation in Continuous Non-Contact Vital Sign Monitoring (2017) (59)
- Segmentation and measurement of brain structures in MRI including confidence bounds (2000) (58)
- Recognizing general curved objects efficiently (1992) (58)
- Automatic Discovery and Optimization of Parts for Image Classification (2014) (58)
- NightOwls: A Pedestrians at Night Dataset (2018) (58)
- A Six Point Solution for Structure and Motion (2000) (57)
- VHS to VRML: 3D graphical models from video sequences (1999) (57)
- Detecting People Looking at Each Other in Videos (2013) (56)
- Amplifying Key Cues for Human-Object-Interaction Detection (2020) (56)
- TeachText: CrossModal Generalized Distillation for Text-Video Retrieval (2021) (55)
- Direct Estimation of Non-Rigid Registrations (2004) (55)
- A Better Baseline for AVA (2018) (55)
- Class-based grouping in perspective images (1995) (55)
- Surface Reconstruction from Multiple Views Using Apparent Contours and Surface Texture (2000) (55)
- Relative motion and pose from arbitrary plane curves (1992) (55)
- My lips are concealed: Audio-visual speech enhancement through obstructions (2019) (55)
- Counting Out Time: Class Agnostic Video Repetition Counting in the Wild (2020) (55)
- Relaxed Softmax: Efficient Confidence Auto-Calibration for Safe Pedestrian Detection (2018) (54)
- Affine and Projective Structure from Motion (1992) (54)
- Non-contact physiological monitoring of preterm infants in the Neonatal Intensive Care Unit (2019) (53)
- Efficient recognition of rotationally symmetric surfaces and straight homogeneous generalized cylinders (1993) (53)
- Synthetic Humans for Action Recognition from Unseen Viewpoints (2019) (52)
- Learning Class-Specific Edges for Object Detection and Segmentation (2006) (52)
- An Invariant Large Margin Nearest Neighbour Classifier (2007) (51)
- Discriminative Sub-categorization (2013) (51)
- Automated Person Identification in Video (2004) (51)
- Solving Markov Random Fields using Second Order Cone Programming Relaxations (2006) (50)
- Concerning Bayesian Motion Segmentation, Model, Averaging, Matching and the Trifocal Tensor (1998) (50)
- An experimental evaluation of projective invariants (1992) (50)
- Introduction—towards a new framework for vision (1992) (49)
- Visualising Cerebral Asymmetry (1996) (49)
- On-the-fly learning for visual search of large-scale image and video datasets (2015) (49)
- Humanising GrabCut: Learning to segment humans using the Kinect (2011) (48)
- VoxSRC 2020: The Second VoxCeleb Speaker Recognition Challenge (2020) (47)
- Temporal Query Networks for Fine-grained Video Understanding (2021) (46)
- Shape from shading in the light of mutual illumination (1990) (46)
- Learning to Detect Partially Overlapping Instances (2013) (45)
- What have We Learned from Deep Representations for Action Recognition? (2018) (45)
- Enhancing Exemplar SVMs using Part Level Transfer Regularization (2012) (45)
- Overcoming Registration Uncertainty in Image Super-Resolution: Maximize or Marginalize? (2007) (45)
- Repeated Structures: Image Correspondence Constraints and 3D Structure Recovery (1993) (45)
- A Geometric Approach to Obtain a Bird's Eye View From an Image (2019) (45)
- Shape from Texture: Homogeneity Revisited (2000) (45)
- Appendix—projective geometry for machine vision (1992) (44)
- Parallax geometry of smooth surfaces in multiple views (1999) (44)
- Domain Adaptation for Upper Body Pose Tracking in Signed TV Broadcasts (2013) (44)
- Geometry of Single Axis Motions Using Conic Fitting (2003) (43)
- Learning epipolar geometry from image sequences (2003) (43)
- VoxSRC 2019: The first VoxCeleb Speaker Recognition Challenge (2019) (43)
- Toward Category-Level Object Recognition (Lecture Notes in Computer Science) (2007) (43)
- Face Painting: querying art with photos (2015) (42)
- Geometric LDA: A Generative Model for Particular Object Discovery (2008) (42)
- Learning Layered Pictorial Structures from Video (2004) (42)
- From Benedict Cumberbatch to Sherlock Holmes: Character Identification in TV series without a Script (2018) (41)
- Condensed Movies: Story Based Retrieval with Contextual Embeddings (2020) (41)
- Speech2Action: Cross-Modal Supervision for Action Recognition (2020) (41)
- Object Discovery with a Copy-Pasting GAN (2019) (41)
- Efficient object retrieval from videos (2004) (41)
- Has My Algorithm Succeeded? An Evaluator for Human Pose Estimators (2012) (41)
- Vision based Interpretation of Natural Sign Languages (2003) (40)
- Total Cluster: A person agnostic clustering method for broadcast videos (2014) (40)
- Multiple View Geometry in Computer Vision: N-View Geometry (2004) (40)
- Performance characterization of fundamental matrix estimation under image degradation (1997) (40)
- Who Are You? - Real-time Person Identification (2007) (40)
- Layered neural rendering for retiming people in video (2020) (39)
- The StreetLearn Environment and Dataset (2019) (38)
- Visual Grounding in Video for Unsupervised Word Translation (2020) (38)
- Semi-Supervised Learning with Scarce Annotations (2019) (38)
- Using a mixed wave/ diffusion process to elicit the symmetry set (1989) (38)
- On-the-fly specific person retrieval (2012) (38)
- Subtitle-free Movie to Script Alignment (2009) (37)
- Temporal HeartNet: Towards Human-Level Automatic Analysis of Fetal Cardiac Screening Video (2017) (37)
- Linear auto-calibration for ground plane motion (2003) (37)
- The Visual Centrifuge: Model-Free Layered Video Representations (2018) (37)
- LAEO-Net: Revisiting People Looking at Each Other in Videos (2019) (37)
- Motion Clustering using the Trilinear Constraint over Three Views (1995) (37)
- Planar homologies as a basis for grouping and recognition (1998) (36)
- Betrayed by Motion: Camouflaged Object Discovery via Motion Segmentation (2020) (36)
- Generalized Category Discovery (2022) (36)
- AXES at TRECVID 2012: KIS, INS, and MED (2012) (36)
- Deep Frank-Wolfe For Neural Network Optimization (2018) (35)
- Detecting and Tracking Linear Features Efficiently (1996) (35)
- Visual Vocabulary with a Semantic Twist (2014) (35)
- Detection and tracking of independent motion (1995) (35)
- The AXES submissions at TRECVID 2013 (2013) (35)
- A Case Against Epipolar Geometry (1993) (35)
- Optimizing and Learning for Super-resolution (2006) (34)
- Learning equivariant structured output SVM regressors (2011) (34)
- Training Neural Networks for and by Interpolation (2019) (34)
- Using Projective Invariants for Constant Time Library Indexing in Model Based Vision (1991) (33)
- Slow-Fast Auditory Streams for Audio Recognition (2021) (33)
- Recognising rotationally symmetric surfaces from their outlines (1992) (33)
- Massively Parallel Video Networks (2018) (32)
- Invariance-a new framework for vision (1990) (32)
- Estimating illumination direction from textured images (2004) (31)
- Time-lapse imagery and volunteer classifications from the Zooniverse Penguin Watch project (2018) (31)
- AutoNovel: Automatically Discovering and Learning Novel Visual Categories (2021) (31)
- Fusing Shape and Appearance Information for Object Category Detection (2006) (31)
- Viewpoint-invariant representation of generalized cylinders using the symmetry set (1994) (30)
- Interferences in Match Kernels (2016) (30)
- Finding nemo: Deformable object class modelling using curve matching (2010) (29)
- IEEE Computer Society Conference on Computer Vision and Pattern Recognition, CVPR 1989, 4-8 June, 1989, San Diego, CA, USA (1989) (29)
- An Experimental Comparison of Appearance and Geometric Model Based Recognition (1996) (29)
- Seeing wake words: Audio-visual Keyword Spotting (2020) (29)
- Object Representation in Computer Vision II (1996) (29)
- Stereo Autocalibration from One Plane (2000) (29)
- Quadric Surface Reconstruction from Dual-Space Geometry. (1998) (28)
- Cardio-respiratory signal extraction from video camera data for continuous non-contact vital sign monitoring using deep learning (2019) (28)
- Model Selection for Automated Architectural Reconstruction from Multiple Views (2002) (28)
- PASS: An ImageNet replacement for self-supervised pretraining without humans (2021) (28)
- Finding Point Correspondences in Motion Sequences Preserving Affine Structure (1997) (28)
- Real-time Panoramic Mosaics and Augmented Reality (1998) (28)
- Efficient image retrieval for 3D structures (2010) (28)
- Bringing Pictorial Space to Life: computer techniques for the analysis of paintings (2002) (27)
- Object discovery and representation networks (2022) (26)
- Omnimatte: Associating Objects and Their Effects in Video (2021) (26)
- Automatic and Efficient Long Term Arm and Hand Tracking for Continuous Sign Language TV Broadcasts (2012) (26)
- Efficient On-the-fly Category Retrieval Using ConvNets and GPUs (2014) (26)
- A three dimensional mid sagittal plane for brain asymmetry measurement (1996) (26)
- Texture classification with minimal training images (2008) (25)
- Computer Vision – ECCV 2008 (2008) (25)
- CLAROS - Bringing Classical Art to a Global Public (2009) (25)
- Non-contact physiological monitoring of preterm infants in the Neonatal Intensive Care Unit. (2019) (25)
- Sub-word Level Lip Reading With Visual Attention (2021) (25)
- Fast and Controllable 3D Modelling From Silhouettes (2005) (24)
- Video retrieval by mimicking poses (2012) (24)
- 3D Shape Attributes (2016) (24)
- The Art of Detection (2016) (24)
- A Sparse Object Category Model for Efficient Learning and Complete Recognition (2006) (24)
- Automatic Camera Tracking (2003) (24)
- Semi-supervised Learning of Facial Attributes in Video (2010) (24)
- Visual navigation around curved obstacles (1991) (23)
- Projective Reconstruction of Surfaces of Revolution (2003) (23)
- NeRF in detail: Learning to sample for view synthesis (2021) (23)
- Label, Verify, Correct: A Simple Few Shot Object Detection Method (2021) (23)
- Future Event Prediction: If and When (2019) (22)
- Multiple View Geometry in Computer Vision: Epipolar Geometry and the Fundamental Matrix (2004) (22)
- Grouping and invariants using planar homologies (1995) (22)
- Extracting structure from an affine view of a 3D point set with one or two bilateral symmetries (1994) (22)
- Watch, read and lookup: learning to spot signs from multiple supervisors (2020) (22)
- Co-Attention for Conditioned Image Matching (2021) (22)
- With a Little Help from my Temporal Context: Multimodal Egocentric Action Recognition (2021) (22)
- Statistical Approaches to Material Classification (2002) (22)
- From Same Photo: Cheating on Visual Kinship Challenges (2018) (21)
- Self-similar Sketch (2012) (21)
- Deep Insights into Convolutional Networks for Video Recognition (2019) (21)
- Real-time Visual Tracking for Surveillance and Path Planning (1992) (21)
- Tracking People and Recognizing Their Activities (2005) (20)
- 3D Structure from Images — SMILE 2000 (2001) (20)
- Learning to Read by Spelling: Towards Unsupervised Text Recognition (2018) (20)
- Fast recognition using algebraic invariants (1992) (20)
- Multiple View Geometry in Computer Vision: Scene planes and homographies (2004) (20)
- Transformational invariance - a primer (1992) (20)
- Minimal projective reconstruction for combinations of points and lines in three views (2004) (20)
- Automated Radiological Grading of Spinal MRI (2015) (20)
- Sight to Sound: An End-to-End Approach for Visual Piano Transcription (2020) (20)
- VoxSRC 2021: The Third VoxCeleb Speaker Recognition Challenge (2022) (19)
- Semi-local projective invariants for the recognition of smooth plane curves (1996) (19)
- Model selection for automated reconstruction from multiple views (2002) (19)
- Automated audiovisual behavior recognition in wild primates (2021) (19)
- Extremely Low Bit-Rate Nearest Neighbor Search Using a Set Compression Tree (2014) (19)
- Of Gods and Goats: Weakly Supervised Learning of Figurative Art (2013) (19)
- Automatic 3D model building from video sequences (1997) (19)
- Vision algorithms : theory and practice : International Workshop on Vision Algorithms, Corfu, Greece, September 21-22, 1999 : proceedings (2000) (19)
- Immediate Structured Visual Search for Medical Images (2011) (18)
- Taxonomic Multi-class Prediction and Person Layout Using Efficient Structured Ranking (2012) (18)
- Visual exploration of free-space (1993) (18)
- Read and Attend: Temporal Localisation in Sign Language Videos (2021) (18)
- Sim2real transfer learning for 3D pose estimation: motion to the rescue (2019) (18)
- Using global consistency to recognise Euclidean objects with an uncalibrated camera (1994) (18)
- On film character retrieval in feature-length films (2006) (18)
- Video Registration (2003) (18)
- Improving Augmented Reality using Image and Scene Constraints (1999) (17)
- Monocular Depth Estimation with Self-supervised Instance Adaptation (2020) (17)
- QUERYD: A Video Dataset with High-Quality Text and Audio Narrations (2020) (17)
- Upper Body Pose Estimation with Temporal Sequential Forests (2014) (16)
- Adaptive Text Recognition through Visual Matching (2020) (16)
- Temporal Alignment Networks for Long-term Video (2022) (16)
- Action Recognition From Weak Alignment of Body Parts (2014) (16)
- Classifying materials from images: to cluster or not to cluster? (2002) (16)
- Faces in Places: compound query retrieval (2016) (16)
- Augmenting images of non-rigid scenes using point and curve correspondences (2004) (15)
- Beyond Metadata: Searching Your Archive Based on its Audio-visual Content (2014) (15)
- A Fourier series approach to magnetostatic field calculations involving magnetic material (1983) (15)
- Weak Continuity Constraints Generate Uniform Scale-Space Descriptions of Plane Curves (1986) (15)
- Localised photoplethysmography imaging for heart rate estimation of pre-term infants in the clinic (2018) (15)
- A CLIP-Hitchhiker's Guide to Long Video Retrieval (2022) (15)
- Discovering Objects and their Localization in Images (2005) (14)
- Signs in time: Encoding human motion as a temporal image (2016) (14)
- Employing signed TV broadcasts for automated learning of British Sign Language (2010) (14)
- Automated architectural acquisition from a camera undergoing planar motion (2001) (14)
- Automatic Modic Changes Classification in Spinal MRI (2015) (14)
- Name that sculpture (2012) (14)
- LSD-C: Linearly Separable Deep Clusters (2020) (14)
- Human pose search using deep poselets (2015) (13)
- Analytic solutions for axisymmetric magnetostatic systems involving iron (1987) (13)
- D2D: Learning to find good correspondences for image matching and manipulation (2020) (13)
- Automated Architecture Reconstruction from Close-range Photogrammetry ∗ (2001) (13)
- Discriminative Semi-Markov Models for automated mitotic phase labelling (2012) (13)
- Efficient Visual Content Retrieval and Mining in Videos (2004) (13)
- Training Data (2017) (13)
- Learning to Count Cells: Applications to lens-free imaging of large fields (2011) (12)
- Thread-Safe: Towards Recognizing Human Actions Across Shot Boundaries (2014) (12)
- Multiple View Geometry in Computer Vision: Estimation – 2D Projective Transformations (2004) (12)
- Face, Body, Voice: Video Person-Clustering with Multiple Modalities (2021) (12)
- Playing a Part: Speaker Verification at the movies (2020) (12)
- AutoCorrect: Deep Inductive Alignment of Noisy Geometric Annotations (2019) (12)
- Human pose search using deep networks (2017) (12)
- A Convolutional Approach to Vertebrae Detection and Labelling in Whole Spine MRI (2020) (12)
- Trusting SVM for Piecewise Linear CNNs (2016) (11)
- Oxford TRECVid 2007 - Notebook paper (2007) (11)
- Qualitative surface shape from deformation of image curves (1992) (11)
- Aligning Subtitles in Sign Language Videos (2021) (11)
- Compact Deep Aggregation for Set Retrieval (2018) (11)
- Geometry-Aware Video Object Detection for Static Cameras (2019) (11)
- Eliciting qualitative structure from image curve deformations (1993) (10)
- The AXES PRO video search system (2013) (10)
- Automated Video Face Labelling for Films and TV Material (2020) (10)
- Audio-Visual Synchronisation in the wild (2021) (10)
- Multiple View Geometry in Computer Vision: Iterative Estimation Methods (2004) (10)
- RareAct: A video dataset of unusual interactions (2020) (10)
- Digital Art History: A subject in transition (2005) (10)
- Oxford TRECVID 2006 - Notebook paper (2006) (10)
- Measurement of Brain Structures Based on Statistical and Geometrical 3D Segmentation (1998) (10)
- Automatic Intervertebral Discs Localization and Segmentation: A Vertebral Approach (2015) (10)
- BBC-Oxford British Sign Language Dataset (2021) (10)
- Content-based image recognition on printed broadside ballads: The Bodleian Libraries' ImageMatch Tool (2013) (10)
- Cooperating Motion Processes (1991) (10)
- Input-level Inductive Biases for 3D Reconstruction (2021) (10)
- Segmenting Invisible Moving Objects (2021) (10)
- Construction and Exploitation of Sign Language Corpora. 3rd Workshop on the Representation and Processing of Sign Languages (2008) (10)
- Multiple View Geometry in Computer Vision: Preface (2004) (10)
- Immediate, Scalable Object Category Detection (2014) (10)
- Localising discontinuities using weak continuity constraints (1987) (10)
- Part level transfer regularization for enhancing exemplar SVMs (2015) (9)
- Automated visual identification of characters in situation comedies (2004) (9)
- Inducing Predictive Uncertainty Estimation for Face Verification (2020) (9)
- Program F. Matinv for the solution of the sparse linear system Ax=b (1984) (9)
- Automated multisensor polyhedral model acquisition (2003) (9)
- Extracting Projective Information from Single Views of 3D Point Sets (1993) (9)
- Hierarchical Perceiver (2022) (9)
- Rectification of elemental image set and extraction of lens lattice by projective image transformation in integral imaging (2010) (9)
- AXES at TRECVID 2011 (2011) (9)
- Latent SVMs for Human Detection with a Locally Affine Deformation Field (2012) (9)
- Learning to Predict 3D Surfaces of Sculptures from Single and Multiple Views (2018) (8)
- Oxford/IIIT TRECVID 2008 - Notebook paper (2008) (8)
- Relative motion and pose from invariants (1990) (8)
- CLAROS - Collaborating on Delivering the Future of the Past (2011) (8)
- Estimation with Bilinear Constraints in Computer Vision (1998) (8)
- Automatic retrieval of visual continuity errors in movies (2009) (8)
- Immediate ROI Search for 3-D Medical Images (2012) (8)
- From Images to 3D Shape Attributes (2016) (8)
- Dynamic Time Warping for Automated Cell Cycle Labelling (2011) (8)
- Self-Supervised Multi-Modal Alignment for Whole Body Medical Imaging (2021) (7)
- Proceedings of the Second Joint European - US Workshop on Applications of Invariance in Computer Vision (1993) (7)
- Magnetostatic field calculations in the presence of iron using a Green’s function approach (1983) (7)
- Multiple View Geometry in Computer Vision: Camera Models (2004) (7)
- Two-View Geometry (2004) (7)
- ExTOL: Automatic recognition of British Sign Language using the BSL Corpus (2019) (7)
- From Images to Virtual and Augmented Reality (2000) (7)
- Segmenting Moving Objects via an Object-Centric Layered Representation (2022) (7)
- Multiple View Geometry in Computer Vision: Camera Geometry and Single View Geometry (2004) (7)
- Constrained Video Face Clustering using1NN Relations (2020) (7)
- Self-Supervised Learning of Class Embeddings from Video (2019) (7)
- Combined statistical and geometrical 3D segmentation and measurement of brain structures (1998) (7)
- Projective Geometry and Transformations of 2D (2004) (7)
- Processing citizen science- and machine-annotated time-lapse imagery for biologically meaningful metrics (2020) (7)
- The AXES research video search system (2014) (7)
- Visual Keyword Spotting with Attention (2021) (6)
- It's About Time: Analog Clock Reading in the Wild (2021) (6)
- Identifying Scoliosis in Population-Based Cohorts: Automation of a Validated Method Based on Total Body Dual Energy X-ray Absorptiometry Scans (2020) (6)
- Automated detection and identification of persons in video using a coarse 3-D head model and multiple texture maps (2005) (6)
- Grouping and Structure Recovery for Images of Objects with Finite Rotational Symmetry (2001) (6)
- Radiological Grading of Spinal MRI (2014) (6)
- The calculation of magnetostatic fields from axisymmetric conductors (1996) (6)
- Computer vision -- ECCV 2008 : 10th European Conference on Computer Vision, Marseille, France, October 12-18, 2008 : proceedings (2008) (6)
- Integrating Geometric and Photometric Information for Image Retrieval (1999) (6)
- Multiple View Geometry in Computer Vision: Algorithm Evaluation and Error Analysis (2004) (6)
- Automated Video Labelling: Identifying Faces by Corroborative Evidence (2021) (6)
- Predicting Scoliosis in DXA Scans Using Intermediate Representations (2018) (6)
- The analysis of 3-D shape: psychophysical principles and neural mechanisms. (1992) (6)
- A GFUN approach to include the effects of iron on coil systems with cylindrical symmetry (1984) (6)
- Identification of events from 3D volumes of seismic data (1994) (5)
- Transfer learning for object category detection (2014) (5)
- Efficient Visual Search for Objects in Videos Visual search using text-retrieval methods can rapidly and accurately locate objects in videos despite changes in camera viewpoint, lighting, and partial occlusions. (2008) (5)
- Controllable Attention for Structured Layered Video Decomposition (2019) (5)
- Assessing the significance of performance differences on the PASCAL VOC challenges via bootstrapping (2013) (5)
- The End-of-End-to-End: A Video Understanding Pentathlon Challenge (2020) (2020) (5)
- Multiple View Geometry in Computer Vision: Introduction – a Tour of Multiple View Geometry (2004) (5)
- Re-presentations of Art Collections (2014) (5)
- Multiple View Geometry in Computer Vision: Computation of the Fundamental Matrix F (2004) (5)
- Accurate Rendering of Curved Shadows and Interreflections (1993) (5)
- Knowledge source for describing stereoscopically viewed textured surfaces (1987) (5)
- SDL: Supervised Dictionary Learning (2008) (5)
- Understanding Higher-Order Shape via 3D Shape Attributes (2016) (5)
- Temporal models for mitotic phase labelling (2014) (4)
- Computation of the Fundamental Matrix F (2004) (4)
- The effect of iron of constant permeability on the magnetostatic field of axisymmetric conductors (1983) (4)
- The Use and Reuse of Printed Illustrations in 15th-Century Venetian Editions (2020) (4)
- Count, Crop and Recognise: Fine-Grained Recognition in the Wild (2019) (4)
- Proceedings of the International Workshop on Vision Algorithms: Theory and Practice (1999) (4)
- An Automated System For Picking Seismic Events (1993) (4)
- Estimating the Affine Transformation between Textures (2005) (4)
- Magnetostatic field calculations involving iron using an eigenfunction expansion (1983) (4)
- Invariant Scene Retrieval using Textured Regions (2004) (4)
- Applications of Invariance in Computer Vision: Second Joint European - US Workshop, Ponta Delgada, Azores, Portugal, October 9 - 14, 1993. Proceedings (1994) (4)
- An Object Category Specific mrffor Segmentation (2006) (4)
- A legendre polynomial BEM for axisymmetric coil systems including Iron (1985) (4)
- Automated reconstruction from multiple photographs (2002) (4)
- Discussion for direct versus features session (2000) (4)
- Automated method for the removal of unwanted nonperiodic patterns from forensic images (1999) (3)
- 3D Reconstruction of Cameras and Structure (2004) (3)
- Compressed Vision for Efficient Video Understanding (2022) (3)
- Inductive Visual Localisation: Factorised Training for Superior Generalisation (2018) (3)
- Report on the 1996 International Workshop on Object Representation in Computer Vision (1996) (3)
- Visual reconstruction and the GNC algorithm (1988) (3)
- Oxford TRECVid 2007 \u2013 Notebook paper (2007) (3)
- Multiple View Geometry in Computer Vision: More Single View Geometry (2004) (3)
- SpineNetV2: Automated Detection, Labelling and Radiological Grading Of Clinical MR Scans (2022) (3)
- Context-Aware Transformers For Spinal Cancer Detection and Radiological Grading (2022) (3)
- Discovery of Rare Phenotypes in Cellular Images Using Weakly Supervised Deep Learning (2017) (3)
- Multiple View Geometry in Computer Vision: Tensor Notation (2004) (3)
- Scaling Up Sign Spotting Through Sign Language Dictionaries (2022) (3)
- Layer Recurrent Neural Networks (2017) (3)
- Mitotic phase based detection of chromosome segregation errors in embryonic stem cells (2013) (3)
- Sampling Methods for Unsupervised Learning (2004) (3)
- Is an Object-Centric Video Representation Beneficial for Transfer? (2022) (3)
- Equivalence of different solutions of the vector potential for a thick solenoid (1983) (3)
- Zorro: the masked multimodal transformer (2023) (3)
- SHAPE-FROM-SHADING WITH MUTUAL ILLUMINATION (1991) (3)
- Automatic dense annotation of large-vocabulary sign language videos (2022) (3)
- Reading to Listen at the Cocktail Party: Multi-Modal Speech Separation (2022) (2)
- Extraction of events from 3D volumes of seismic data (1994) (2)
- Oxford TRECVID 2008 - Notebook paper (2008) (2)
- IEEE Computer Society Conference on Computer Vision and Pattern Recognition, CVPR 1992, Proceedings, 15-18 June, 1992, Champaign, Illinois, USA (2009) (2)
- Trainable Visual Models for Object Class Recognition (2004) (2)
- N-View Computational Methods (2004) (2)
- CounTR: Transformer-based Generalised Visual Counting (2022) (2)
- Tracking moving heads (1995) (2)
- Now You're Speaking My Language: Visual Language Identification (2020) (2)
- Book Review : Epipolar Geometry in Stereo, Motion and Object Recognition—A Unified Approach By Gang Xu and Zhengyou Zhang Published by Kluwer Academic Publishers Group; 1996; 313 pages; US$ 160 (1998) (2)
- Multiple View Geometry in Computer Vision: Three-View Geometry (2004) (2)
- MORSE: An Architecture for 3D Object Recognition Based on Invariants (1995) (2)
- Local Statistical Operators for Texture Classification (2008) (2)
- On two-dimesional quadrature for potential problems (1983) (2)
- Author Correction: Time-lapse imagery and volunteer classifications from the Zooniverse Penguin Watch project (2019) (2)
- Distinctive Representations for the Recognition of Curved Surfaces Using Outlines and Markings (1994) (2)
- Augmented Reality using uncalibrated video sequences. Discussion (2001) (2)
- Object representation in computer vision II : ECCV '96 International Workshop, Cambridge, U.K., April 13-14, 1996 : proceedings (1996) (2)
- Comment on Stochastic Polyak Step-Size: Performance of ALI-G (2021) (2)
- IEEE Computer Society Conference on Computer Vision and Pattern Recognition, CVPR 1991, 3-6 June, 1991, Lahaina, Maui, Hawaii, USA (1991) (2)
- The Background: Projective Geometry, Transformations and Estimation (2004) (2)
- Towards qualitative vision: motion parallax (1990) (2)
- Perception Test : A Diagnostic Benchmark for Multimodal Models (2022) (2)
- Time-lapse imagery and volunteer classifications from the Zooniverse Penguin Watch project (2018) (2)
- In Memoriam: Mark Everingham (2012) (2)
- Computer Vision - ECCV 2008, 10th European Conference on Computer Vision, Marseille, France, October 12-18, 2008, Proceedings, Part IV (2008) (2)
- Predicting Spine Geometry and Scoliosis from DXA Scans (2019) (2)
- Projective Invariants for Geometric Calibration in Flat-Panel Computed Tomography (2018) (2)
- Sparse in Space and Time: Audio-visual Synchronisation with Trainable Selectors (2022) (2)
- Seismic Time Section Analysis Using Machine Vision (1993) (2)
- Tutorial I: The Algebraic Approach to Invariance (1994) (1)
- LAEO-Net++: Revisiting People Looking at Each Other in Videos (2020) (1)
- Proceedings of the International Workshop on Object Representation in Computer Vision II (1996) (1)
- Photometric Invariants Related to Solid Shape 301 (2019) (1)
- WhisperX: Time-Accurate Speech Transcription of Long-Form Audio (2023) (1)
- Mining Faces from Biomedical Literature using Deep Learning (2017) (1)
- The Discrete Problem (2003) (1)
- Prolate harmonics and axisymmetric conductor systems (1988) (1)
- 3D Surface Reconstruction by Pointillism (2018) (1)
- SeeHear: Signer Diarisation and a New Dataset (2021) (1)
- Towards on-the-fly Large Scale Video Search (2013) (1)
- Multiple View Geometry in Computer Vision: Gaussian (Normal) and χ2 Distributions (2004) (1)
- HTX: a tool for the exploration and visualization of high-throughput image assays (2017) (1)
- 3d Model Acquisition from Extended Image Sequences 3d Model Acquisition from Extended Image Sequences (1995) (1)
- Multiple View Geometry in Computer Vision: Some Special Plane Projective Transformations (2004) (1)
- TAP-Vid: A Benchmark for Tracking Any Point in a Video (2022) (1)
- Semi-supervised learning and recognition of object classes (2004) (1)
- Vision-Language Modelling For Radiological Imaging and Reports In The Low Data Regime (2023) (1)
- Projective Geometry and Transformations of 3D (2004) (1)
- Program F. MATINV for the solution of the sparse linear system Ax = b including normalisation (1984) (1)
- A Light Touch Approach to Teaching Transformers Multi-view Geometry (2022) (1)
- AXES at TRECVid 2013 (2013) (1)
- A surface integral approach to determine the effects of iron on axisymmetric coil systems (1984) (1)
- Technical Report: Articulated Part-based Model for Joint Object Detection and Pose Estimation (2011) (1)
- Age and disc degeneration in low back pain: automated analysis enables a magnetic resonance imaging comparison of large cross-sectional cohorts of symptomatic and asymptomatic subjects. (2021) (1)
- Where Is the Archive? (1975) (1)
- Visual guidance for robot motion (1991) (1)
- The Trifocal Tensor (2004) (1)
- Visual Analysis of Chapbooks Printed in Scotland (2021) (1)
- Multiple View Geometry in Computer Vision: Least-squares Minimization (2004) (1)
- INVARIANT DESCRIPTORS FOR IMAGE MATCHING (2001) (1)
- Uncalibrated X-Ray Stereo Reconstruction (1995) (1)
- The Graduated Non-Convexity Algorithm (2003) (1)
- SIGNER DIARISATION IN THE WILD (2021) (1)
- Glossary of notation (2003) (1)
- British Machine Vision Conference, 2014 (2014) (1)
- Author response: Diagnostically relevant facial gestalt information from ordinary photos (2014) (1)
- A Tri-Layer Plugin to Improve Occluded Detection (2022) (1)
- Efficient SVM based Object Classification and Detection (2010) (1)
- Processing citizen science- and machine-annotated time-lapse imagery for biologically meaningful metrics (2020) (1)
- Proceedings of the 10th European Conference on Computer Vision: Part III (2008) (1)
- A progressive scheme for stereo matching. Discussion (2001) (1)
- Efficient, blind, spatially-variant deblurring for shaken images (2014) (1)
- Combining geometric and photometric information (1998) (1)
- Affine Epipolar Geometry (2004) (1)
- Yuning Chai Recognition between a Large Number of Flower Species Master Thesis (2011) (1)
- Personalised CLIP or: how to find your vacation videos (2022) (1)
- Turbo Training with Token Dropout (2022) (1)
- Philosophical Transactions of the Royal Society of London. Series A: Mathematical, Physical and Engineering Sciences: 356 (1740) (1998) (1)
- Multiframe Super-Resolution from a Bayesian Perspective (2017) (1)
- N-Linearities and Multiple View Tensors (2004) (1)
- The segmentation of sparse MR images (1998) (1)
- The effects of iron on the fields of axisymmetric conductors using a Legendre polynomial approach (1987) (1)
- University of Oxford video retrieval system (2008) (1)
- Improving Salient Object Subitizing (2015) (0)
- Edinburgh Research Explorer The PASCAL Visual Object Classes (VOC) Challenge (2009) (0)
- Advancing large scale object retrieval (2013) (0)
- Multiple View Geometry in Computer Vision: Structure Computation (2004) (0)
- Applications of Piecewise Continuous Reconstruction (2003) (0)
- Human layout estimation using structured output learning (2012) (0)
- A robust and flexible deep-learning workflow for animal tracking (2023) (0)
- Advances in fine-grained visual categorization (2015) (0)
- BASED ON ITS AUDIO-VISUAL CONTENT (2014) (0)
- Supplementary Material: Aligning Subtitles in Sign Language Videos (2021) (0)
- Energy Calculations for the String and Membrane (2003) (0)
- New Results - Image restoration, manipulation and enhancement (2011) (0)
- Automated Detection and Identification of Persons in Video (2004) (0)
- Preface to New geometric techniques in computer vision. A Discussion Meeting held at the Royal Society of London. (1998) (0)
- Image-based Environment Matting ( Online ID 213 ) (2002) (0)
- Illuminance Flow Estimation by Regression (2010) (0)
- Raw images - SPIG (2018) (0)
- Automatic and Efficient Human Pose Estimation for Sign Language Videos (2013) (0)
- Other Grants and Activities - Agence Nationale de la Recherche: MGA (INRIA/ENPC) (2010) (0)
- Detecting Handwritten Text from Forms using Deep Learning (2020) (0)
- Software - Software for computing local invariant features (2004) (0)
- Computer vision for the analysis of cellular activity (2014) (0)
- The Semantic Shift Benchmark (2022) (0)
- Learning Using Convex Optimisation (2012) (0)
- Optimisation of the Magnetic Screening of Electromagnetic Coils (1983) (0)
- Reading Text in the Wild with Convolutional Neural Networks (2015) (0)
- New Results - Image description (2004) (0)
- Persistent Animal Identification Leveraging Non-Visual Markers (2021) (0)
- ICVGIP 2002, Proceedings of the Third Indian Conference on Computer Vision, Graphics & Image Processing, Ahmadabad, India, December 16-18, 2002 (2002) (0)
- A UDIO -V ISUAL S YNCHRONISATION IN THE WILD (2021) (0)
- Introduction and Chapter Summary (1993) (0)
- Visual vocabulary with a semantic twist : Supplementary material (2014) (0)
- Co-Segmentation for Fine Grained Visual Categorization (2013) (0)
- Team SPEEDY Multi Moments in Time Challenge 2019 Technical Report (2019) (0)
- Guest Editorial: Best of CVPR 2015 (2017) (0)
- New Results - Recognition (2004) (0)
- V ERY D EEP C ONVOLUTIONAL N ETWORKS FOR L ARGE -S CALE I MAGE R ECOGNITION (2015) (0)
- Learning Object Categories From Internet Image Searches This paper shows how the results returned by an image search engine can be used to construct models from Internet images and use them for object recognition. (2010) (0)
- Verbs in Action: Improving verb understanding in video-language models (2023) (0)
- Bounding an archiving: assessing the relative completeness of the Jacques Toussele archive using pattern-matching and face-recognition (2021) (0)
- Guest editorial (2004) (0)
- In Proceedings British Machine Vision Conference 2013 (2013) (0)
- The Change You Want to See (2022) (0)
- Properties of the Weak String and Membrane (2003) (0)
- 2D Articulated Human Pose Estimation and Retrieval in (Almost) Unconstrained Still Images (2012) (0)
- ISSLS PRIZE in Clinical Science 2023: comparison of degenerative MRI features of the intervertebral disc between those with and without chronic low back pain. An exploratory study of two large female populations using automated annotation. (2023) (0)
- UNCONSTRAINED TEXT RECOGNITION (2015) (0)
- On-the-fly learning for visual search of large-scale image and video datasets (2015) (0)
- Computing the af ne fundamental matrix from two ellipses (2010) (0)
- Automated detection and identification of persons in video using a coarse 3-D head model and multiple texture maps : Recent advances in image and video retrieval (2005) (0)
- Revised Papers from Second European Workshop on 3D Structure from Multiple Images of Large-Scale Environments (2000) (0)
- Three ways to improve feature alignment for open vocabulary detection (2023) (0)
- Deep Convolutional Neural Networks for Text Spotting in Natural Images (2015) (0)
- Contracts and Grants with Industry - EADS (ENS) (2007) (0)
- Correction to: Identifying Scoliosis in Population‑Based Cohorts: Automation of a Validated Method Based on Total Body Dual Energy X‑ray Absorptiometry Scans (2020) (0)
- Modélisation, localisation, identification et reconnaissance pour la vision par ordinateur (2002) (0)
- Supplementary Material for Speech2Action: Cross-modal Supervision for Action Recognition (2020) (0)
- Additional Reviewers (2003) (0)
- Anatomy-Driven Medical Image Search (2010) (0)
- SLRTP 2020: The Sign Language Recognition, Translation & Production Workshop (2020) (0)
- Training Neural Networks for and by Interpolation: Supplementary (2020) (0)
- Editorial (2005) (0)
- Correction to: Identifying Scoliosis in Population‑Based Cohorts: Automation of a Validated Method Based on Total Body Dual Energy X‑ray Absorptiometry Scans (2020) (0)
- Neural Networks for 2 D to 3 D Human Pose Estimation (2017) (0)
- Tracking and Long-Term Identification Using Non-Visual Markers (2021) (0)
- Content-Based Retrieval, 4.-9. January 2004 (2006) (0)
- Sentences in the Wild (0)
- Magnetostatic Field Calculations Associated with Thick Solenoids with Iron Present (1983) (0)
- Automated Cell Volume Estimation in Time-Lapse Microscopy Images (2014) (0)
- Automated measurement of size of spinal curve in population-based cohorts: Validation of a method based on total body dual energy X-ray absorptiometry scans. (2023) (0)
- Modelling Piecewise Continuity (2003) (0)
- Other Grants and Activities - Agence Nationale de la Recherche: HFIMBR (INRIA) (2009) (0)
- Computation of the Trifocal Tensor T (2004) (0)
- Event Schema Induction with a Probabilistic Entity-Driven Model . EMNLP 2013 : Kaa : policy-based explorations of a richer model for adjustable autonomy (2015) (0)
- Is Epipolar Geometry Necessary ? (2001) (0)
- Multiple View Geometry in Computer Vision: Bibliography (2004) (0)
- Counting Out Time: Class Agnostic Video Periodicity in the Wild (2020) (0)
- Explorer " Here ' s looking at you , kid " (2011) (0)
- Weakly-supervised Fingerspelling Recognition in British Sign Language Videos (2022) (0)
- Deep Insights into Convolutional Networks for Video Recognition (2019) (0)
- Properties of the Weak Rod and Plate (2003) (0)
- Identifying Scoliosis in Population-Based Cohorts: Automation of a Validated Method Based on Total Body Dual Energy X-ray Absorptiometry Scans (2020) (0)
- Multiple View Geometry in Computer Vision: Matrix Properties and Decompositions (2004) (0)
- Automatic learning of British Sign Language from signed TV broadcasts (2010) (0)
- Time-lapse imagery is cheap and timely in the fight against colonial species' decline (2021) (0)
- New geometric techniques in computer vision - Preface (1998) (0)
- 06171 Abstracts Collection Content-Based Retrieval Dagstuhl Seminar (2006) (0)
- Combining principal component techniques and psychological spaces to find perceptually similar faces (2005) (0)
- HiP: Hierarchical Perceiver (2022) (0)
- VoxSRC 2022: The Fourth VoxCeleb Speaker Recognition Challenge (2023) (0)
- Multi-Task Multi-Sample Learning (2014) (0)
- Age and Disc Degeneration in Low Back Pain: Automated Analysis Enables a Magnetic Resonance Imaging Comparison of Large Cross-Sectional Groups of Symptomatic and Asymptomatic Subjects (2021) (0)
- Learning to Predict 3D Surfaces of Sculptures from Single and Multiple Views (2018) (0)
- Models of Visual Object Recognition and Scene Understanding (2007) (0)
- Video Action Transformer Network : Appendix (2018) (0)
- Noise Performance of the Weak Elastic String (2003) (0)
- Analysis of the GNC Algorithm (2003) (0)
- Energy Calculations for the Rod and Plate (2003) (0)
- Editorial: IJCV special issue: Vision and modelling of dynamic scenes (2006) (0)
- P ERSISTENT A NIMAL I DENTIFICATION L EVERAGING N ON -V ISUAL M ARKERS (2022) (0)
- Utilisation de rétroaction de pertinence dans une reconnaissance de visage (2007) (0)
- Anomalies of anatomical asymmetry detected in first episode cases of schizophrenic illness by a new method of MRI reconstruction and analysis (1998) (0)
- AutoAD: Movie Description in Context (2023) (0)
- Invariant Texture Matching and Wide Baseline Stereo (0)
- You Said That?: Synthesising Talking Faces from Audio (2019) (0)
- Other Grants and Activities - European Projects (2004) (0)
- Adaptive Text Recognition through Visual Matching Supplementary Material (2020) (0)
- Narrowing Confidence Bounds Using Estimates of Partial Volume Effects (1999) (0)
- Introduction to Weak Continuity Constraints (2003) (0)
- Organizing Committee and Area Chairs (2018) (0)
- End-to-end Tracking with a Multi-query Transformer (2022) (0)
- Multiple View Geometry in Computer Vision: Degenerate Configurations (2004) (0)
- Tion Clustering Using the Trilinear Constraint over Three Views. in Workshop on Geometrical Modeling And (1996) (0)
- 04021 Abstracts Collection - Content-Based Retrieval (2004) (0)
- Advancing human pose and gesture recognition (2015) (0)
- OP0060 MACHINE LEARNING BASED BERLIN SCORING OF MAGNETIC RESONANCE IMAGES OF THE SPINE IN PATIENTS WITH ANKYLOSING SPONDYLITIS FROM THE MEASURE 1 STUDY (2020) (0)
- IEEE Computer Society Conference on Computer Vision and Pattern Recognition, CVPR 1988, 5-9 June, 1988, Ann Arbor, Michigan, USA (1988) (0)
- Using Combinations of Lines and Points for Minimal Projective Reconstruction (2002) (0)
- New Results - Human action recognition (2010) (0)
- Automatic Creation of Virtual Artefacts from Video Sequences (2004) (0)
- Epic-Sounds: A Large-scale Dataset of Actions That Sound (2023) (0)
- Segmentation and Measurement of Brain Structures from MRI (1998) (0)
- ROB volume 6 issue 1 Cover and Back matter (1988) (0)
- Associating Objects and Their Effects in Video through Coordination Games (2022) (0)
This paper list is powered by the following services:
Other Resources About Andrew Zisserman
What Schools Are Affiliated With Andrew Zisserman?
Andrew Zisserman is affiliated with the following schools: