Trevor Darrell
#25,995
Most Influential Person Now
American computer scientist
Trevor Darrell's AcademicInfluence.com Rankings
Trevor Darrellcomputer-science Degrees
Computer Science
#1045
World Rank
#1083
Historical Rank
#553
USA Rank
Algorithms
#19
World Rank
#19
Historical Rank
#9
USA Rank
Machine Learning
#46
World Rank
#46
Historical Rank
#17
USA Rank
Database
#70
World Rank
#72
Historical Rank
#40
USA Rank

Download Badge
Computer Science
Trevor Darrell's Degrees
- Bachelors Computer Science University of California, Berkeley
Similar Degrees You Can Earn
Why Is Trevor Darrell Influential?
(Suggest an Edit or Addition)According to Wikipedia, Trevor Jackson Darrell is an American computer scientist and professor at the University of California, Berkeley. He is known for his research on computer vision and machine learning and is one of the leading experts on topics such as deep learning and explainable AI.
Trevor Darrell's Published Works
Number of citations in a given year to any of this author's works
Total number of citations to an author for the works they published in a given year. This highlights publication of the most important work(s) by the author
Published Works
- Fully convolutional networks for semantic segmentation (2014) (29791)
- Rich Feature Hierarchies for Accurate Object Detection and Semantic Segmentation (2013) (20872)
- Caffe: Convolutional Architecture for Fast Feature Embedding (2014) (14406)
- Long-term recurrent convolutional networks for visual recognition and description (2014) (5396)
- Pfinder: real-time tracking of the human body (1996) (5005)
- DeCAF: A Deep Convolutional Activation Feature for Generic Visual Recognition (2013) (4623)
- Context Encoders: Feature Learning by Inpainting (2016) (4207)
- Adversarial Discriminative Domain Adaptation (2017) (3486)
- End-to-End Training of Deep Visuomotor Policies (2015) (2811)
- Adapting Visual Category Models to New Domains (2010) (2304)
- CyCADA: Cycle-Consistent Adversarial Domain Adaptation (2017) (2243)
- Deep Domain Confusion: Maximizing for Domain Invariance (2014) (1906)
- Region-Based Convolutional Networks for Accurate Object Detection and Segmentation (2016) (1845)
- Curiosity-Driven Exploration by Self-Supervised Prediction (2017) (1722)
- The pyramid match kernel: discriminative classification with sets of image features (2005) (1654)
- Adversarial Feature Learning (2016) (1565)
- Multimodal Compact Bilinear Pooling for Visual Question Answering and Visual Grounding (2016) (1262)
- Sequence to Sequence -- Video to Text (2015) (1220)
- Simultaneous Deep Transfer Across Domains and Tasks (2015) (1191)
- Toward Multimodal Image-to-Image Translation (2017) (1080)
- Part-Based R-CNNs for Fine-Grained Category Detection (2014) (1063)
- Rethinking the Value of Network Pruning (2018) (1004)
- A ConvNet for the 2020s (2022) (925)
- Fast pose estimation with parameter-sensitive hashing (2003) (916)
- Deep Layer Aggregation (2017) (904)
- Neural Module Networks (2015) (860)
- BDD100K: A Diverse Driving Dataset for Heterogeneous Multitask Learning (2018) (859)
- Learning to Hash with Binary Reconstructive Embeddings (2009) (849)
- BDD100K: A Diverse Driving Video Database with Scalable Annotation Tooling (2018) (725)
- What you saw is not what you get: Domain adaptation using asymmetric kernel transforms (2011) (711)
- End-to-End Learning of Driving Models from Large-Scale Video Datasets (2016) (674)
- Compact Bilinear Pooling (2015) (672)
- FCNs in the Wild: Pixel-level Adversarial and Constraint-based Adaptation (2016) (633)
- DenseNet: Implementing Efficient ConvNet Descriptor Pyramids (2014) (618)
- Nearest-Neighbor Methods in Learning and Vision: Theory and Practice (Neural Information Processing) (2006) (578)
- Hidden Conditional Random Fields (2007) (572)
- Integrated Person Tracking Using Stereo, Color, and Pattern Detection (1998) (558)
- Hidden Conditional Random Fields for Gesture Recognition (2006) (553)
- Constrained Convolutional Neural Networks for Weakly Supervised Segmentation (2015) (540)
- Large-Scale Study of Curiosity-Driven Learning (2018) (529)
- Localizing Moments in Video with Natural Language (2017) (521)
- Generating Visual Explanations (2016) (518)
- Learning to Compose Neural Networks for Question Answering (2016) (517)
- PANDA: Pose Aligned Networks for Deep Attribute Modeling (2013) (490)
- Learning to Reason: End-to-End Module Networks for Visual Question Answering (2017) (486)
- Natural Language Object Retrieval (2015) (483)
- Deep spatial autoencoders for visuomotor learning (2015) (470)
- Learning Features by Watching Objects Move (2016) (467)
- Space-time gestures (1993) (452)
- A category-level 3-D object dataset: Putting the Kinect to work (2011) (449)
- YouTube2Text: Recognizing and Describing Arbitrary Activities Using Semantic Hierarchies and Zero-Shot Recognition (2013) (447)
- Conditional Random Fields for Object Recognition (2004) (430)
- Grounding of Textual Phrases in Images by Reconstruction (2015) (423)
- Recasting Gradient-Based Meta-Learning as Hierarchical Bayes (2018) (419)
- Latent-Dynamic Discriminative Models for Continuous Gesture Recognition (2007) (417)
- Deformable part models are convolutional neural networks (2014) (409)
- Few-Shot Object Detection via Feature Reweighting (2018) (405)
- Recognizing Image Style (2013) (404)
- The Pyramid Match Kernel: Efficient Learning with Sets of Features (2007) (394)
- Generalized Zero- and Few-Shot Learning via Aligned Variational Autoencoders (2018) (392)
- Semi-Supervised Domain Adaptation via Minimax Entropy (2019) (374)
- Active Learning with Gaussian Processes for Object Categorization (2007) (372)
- Women also Snowboard: Overcoming Bias in Captioning Models (2018) (345)
- Face recognition with image sets using manifold density divergence (2005) (337)
- LSDA: Large Scale Detection through Adaptation (2014) (327)
- Variational Adversarial Active Learning (2019) (326)
- Face Recognition from Long-Term Observations (2002) (321)
- Speaker-Follower Models for Vision-and-Language Navigation (2018) (320)
- Early Convolutions Help Transformers See Better (2021) (319)
- Modeling Relationships in Referential Expressions with Compositional Modular Networks (2016) (318)
- Learning modular neural network policies for multi-task and multi-robot transfer (2016) (307)
- Beyond spatial pyramids: Receptive field learning for pooled image features (2012) (302)
- Do Convnets Learn Correspondence? (2014) (299)
- Multimodal Explanations: Justifying Decisions and Pointing to the Evidence (2018) (298)
- Nearest-Neighbor Methods in Learning and Vision (2008) (294)
- Frustratingly Simple Few-Shot Object Detection (2020) (288)
- Tent: Fully Test-Time Adaptation by Entropy Minimization (2021) (287)
- Fully Convolutional Multi-Class Multiple Instance Learning (2014) (281)
- Efficient Learning of Domain-invariant Image Representations (2013) (280)
- Transfer learning for image classification with sparse prototype representations (2008) (270)
- Learning to Segment Every Thing (2017) (261)
- Deep Compositional Captioning: Describing Novel Object Categories without Paired Training Data (2015) (259)
- Fast contour matching using approximate earth mover's distance (2004) (258)
- Integrated face and gait recognition from multiple views (2001) (257)
- On learning to localize objects with minimal supervision (2014) (254)
- Privacy in Context (2001) (252)
- DeepSentiBank: Visual Sentiment Concept Classification with Deep Convolutional Neural Networks (2014) (249)
- Zero-Shot Visual Imitation (2018) (248)
- A geometric approach to robotic laundry folding (2012) (246)
- Simultaneous calibration and tracking with a network of non-overlapping sensors (2004) (245)
- Sparse probabilistic regression for activity-independent human pose inference (2008) (242)
- Multi-content GAN for Few-Shot Font Style Transfer (2017) (239)
- Inferring 3D structure with a statistical image-based shape model (2003) (239)
- Deformable Part Descriptors for Fine-Grained Recognition and Attribute Prediction (2013) (239)
- Efficient image matching with distributions of local invariant features (2005) (236)
- Segmentation from Natural Language Expressions (2016) (226)
- Birdlets: Subordinate categorization using volumetric primitives and pose-normalized appearance (2011) (220)
- Autotagging Facebook: Social network context improves photo annotation (2008) (216)
- The ALIVE system: wireless, full-body interaction with autonomous agents (1997) (213)
- The ALIVE system: full-body interaction with autonomous agents (1995) (211)
- Deep learning for tactile understanding from visual and haptic data (2015) (208)
- Gaussian Processes for Object Categorization (2010) (206)
- Learning Joint Statistical Models for Audio-Visual Fusion and Segregation (2000) (201)
- Learning cross-modality similarity for multinomial data (2011) (195)
- Unsupervised Learning of Categories from Sets of Partially Matching Image Features (2006) (195)
- Background estimation and removal based on range and color (1999) (194)
- Factorized Latent Spaces with Structured Sparsity (2010) (192)
- Data-dependent Initializations of Convolutional Neural Networks (2015) (186)
- Learning with Side Information through Modality Hallucination (2016) (185)
- Discovering Latent Domains for Multisource Domain Adaptation (2012) (182)
- Robust estimation of a multi-layered motion representation (1991) (181)
- Pyramid Match Kernels: Discriminative Classification with Sets of Image Features (version 2) (2006) (181)
- Unsupervised Domain Adaptation through Self-Supervision (2019) (179)
- Modeling, tracking and interactive animation of faces and heads//using input from video (1996) (175)
- Reinforcement Learning from Imperfect Demonstrations (2018) (174)
- Semi-supervised Domain Adaptation with Instance Constraints (2013) (173)
- What Should Not Be Contrastive in Contrastive Learning (2020) (171)
- Hierarchical Discrete Distribution Decomposition for Match Density Estimation (2018) (170)
- Clockwork Convnets for Video Semantic Segmentation (2016) (169)
- Grounding Visual Explanations (2018) (169)
- Cooperative Robust Estimation Using Layers of Support (1995) (163)
- A New Meta-Baseline for Few-Shot Learning (2020) (163)
- Weakly-supervised Discovery of Visual Pattern Configurations (2014) (160)
- Textual Explanations for Self-Driving Vehicles (2018) (160)
- Algorithmic Framework for Model-based Reinforcement Learning with Theoretical Guarantees (2018) (160)
- Conditional Networks for Few-Shot Semantic Segmentation (2018) (159)
- Deep Compositional Question Answering with Neural Module Networks (2015) (159)
- Explainable Neural Computation via Stack Neural Module Networks (2018) (158)
- MULTIMODAL INTERFACES THAT Flex, Adapt, and Persist (2004) (158)
- Captioning Images with Diverse Objects (2016) (155)
- Loss is its own Reward: Self-Supervision for Reinforcement Learning (2016) (155)
- Discriminative Gaussian process latent variable model for classification (2007) (148)
- Joint Monocular 3D Vehicle Detection and Tracking (2018) (146)
- Pose pooling kernels for sub-category recognition (2012) (146)
- Contextual recognition of head gestures (2005) (144)
- Pyramid based depth from focus (1988) (144)
- Topologically-constrained latent variable models (2008) (143)
- Active face tracking and pose estimation in an interactive room (1996) (143)
- Searching the Web with mobile images for location recognition (2004) (142)
- Continuous Manifold Based Adaptation for Evolving Visual Domains (2014) (142)
- Speaker association with signal-level audiovisual fusion (2004) (135)
- Multi-View Learning in the Presence of View Disagreement (2008) (135)
- Task-Specific Gesture Analysis in Real-Time Using Interpolated Views (1996) (135)
- Object Hallucination in Image Captioning (2018) (135)
- Quasi-Dense Similarity Learning for Multiple Object Tracking (2020) (133)
- Grounding spatial relations for human-robot interaction (2013) (131)
- Adaptive view-based appearance models (2003) (130)
- The NBNN kernel (2011) (129)
- Best Practices for Fine-Tuning Visual Classifiers to New Domains (2016) (126)
- Plan-view trajectory estimation with dense stereo background models (2001) (125)
- Approximate Correspondences in High Dimensions (2006) (124)
- An efficient projection for l1, ∞ regularization (2009) (123)
- Language-Conditioned Graph Networks for Relational Reasoning (2019) (121)
- Iterative Answer Prediction With Pointer-Augmented Multimodal Transformers for TextVQA (2019) (121)
- Adapting Deep Visuomotor Representations with Weak Pairwise Constraints (2015) (121)
- A novel environment for situated vision and behavior (1994) (116)
- A simple, real-time range camera (1989) (114)
- Uncertainty-guided Continual Learning with Bayesian Neural Networks (2019) (114)
- Learning Visual Representations using Images with Captions (2007) (114)
- Activity Zones for Context-Aware Computing (2003) (110)
- Parametrized shape models for clothing (2011) (110)
- Using robotic exploratory procedures to learn the meaning of haptic adjectives (2013) (109)
- On probabilistic combination of face and gait cues for identification (2002) (107)
- Tracking facial motion (1994) (106)
- Factorized Orthogonal Latent Spaces (2010) (106)
- Evolving Visual Routines (1994) (104)
- Localizing Moments in Video with Temporal Language (2018) (104)
- Robotic learning of haptic adjectives through physical interaction (2015) (104)
- Discriminator Rejection Sampling (2018) (104)
- Head gestures for perceptual interfaces: The role of context in improving recognition (2007) (103)
- Something-Else: Compositional Action Recognition With Spatial-Temporal Interaction Networks (2019) (99)
- Simple range cameras based on focal error (1994) (98)
- Meta-Baseline: Exploring Simple Meta-Learning for Few-Shot Learning (2020) (98)
- Few-Shot Segmentation Propagation with Guided Networks (2018) (96)
- Toward Large-Scale Face Recognition Using Social Network Context (2010) (95)
- Towards Adapting Deep Visuomotor Representations from Simulated to Real Environments (2015) (93)
- Sparselet Models for Efficient Multiclass Object Detection (2012) (92)
- Multiple person and speaker activity tracking with a particle filter (2004) (92)
- A Bayesian approach to image-based visual hull reconstruction (2003) (91)
- Adversarial Continual Learning (2020) (91)
- 3D pose tracking with linear depth and brightness constraints (1999) (89)
- Visual speech recognition with loosely synchronized feature streams (2005) (89)
- Learning appearance manifolds from video (2005) (89)
- TAFE-Net: Task-Aware Feature Embeddings for Low Shot Learning (2019) (85)
- Head gesture recognition in intelligent interfaces: the role of context in improving recognition (2006) (84)
- Pose estimation using 3D view-based eigenspaces (2003) (81)
- Fast stereo-based head tracking for interactive environments (2002) (80)
- Constraining human body tracking (2003) (80)
- Auxiliary Image Regularization for Deep CNNs with Noisy Labels (2015) (79)
- Detector discovery in the wild: Joint multiple instance and representation learning (2014) (79)
- Fast concurrent object localization and recognition (2009) (78)
- Generating Counterfactual Explanations with Natural Language (2018) (77)
- Open-vocabulary Object Retrieval (2014) (77)
- Cross-modal adaptation for RGB-D detection (2016) (76)
- One-Shot Adaptation of Supervised Deep Convolutional Models (2013) (76)
- Learning to Control Self-Assembling Morphologies: A Study of Generalization via Modularity (2019) (76)
- Towards Practical Multi-Object Manipulation using Relational Reinforcement Learning (2019) (76)
- More Control for Free! Image Synthesis with Semantic Diffusion Guidance (2021) (75)
- Conditional Random People: Tracking Humans with CRFs and Grid Filters (2006) (75)
- Perceptive Spaces for Performance and Entertainment Untethered Interaction Using Computer Vision and Audition (1997) (75)
- Segmentation by minimal description (1990) (74)
- Locality-Sensitive Hashing Using Stable Distributions (2006) (74)
- Attentive Explanations: Justifying Decisions and Pointing to the Evidence (2016) (73)
- Motion estimation from disparity images (2001) (73)
- Pyramid Match Hashing: Sub-Linear Time Indexing Over Partial Correspondences (2007) (73)
- Deep Object-Centric Representations for Generalizable Robot Learning (2017) (72)
- An Additive Latent Feature Model for Transparent Object Recognition (2009) (72)
- Are You Looking? Grounding to Multiple Modalities in Vision-and-Language Navigation (2019) (72)
- A picture is worth a thousand keywords: image-based object search on a mobile platform (2005) (69)
- Learning with Recursive Perceptual Representations (2012) (69)
- Deep Object-Centric Policies for Autonomous Driving (2018) (69)
- Evaluating look-to-talk: a gaze-aware interface in a collaborative environment (2002) (68)
- Masked Visual Pre-training for Motor Control (2022) (68)
- Learning Visual Feature Spaces for Robotic Manipulation with Deep Spatial Autoencoders (2015) (68)
- Hidden-state Conditional Random Fields (2006) (68)
- 3-D articulated pose tracking for untethered diectic reference (2002) (67)
- Learning the Structure of Deep Convolutional Networks (2015) (67)
- Region Similarity Representation Learning (2021) (66)
- Photo-based question answering (2008) (65)
- Adaptive Vocabulary Forests br Dynamic Indexing and Category Learning (2007) (64)
- Robust Change Captioning (2019) (64)
- Fine-grained pose prediction, normalization, and recognition (2015) (64)
- Active gesture recognition using partially observable Markov decision processes (1996) (64)
- TSC-DL: Unsupervised trajectory segmentation of multi-modal surgical demonstrations with Deep Learning (2016) (63)
- Visual Discovery at Pinterest (2017) (63)
- Disentangling Propagation and Generation for Video Prediction (2018) (62)
- Nearest-Neighbor Searching and Metric Space Dimensions (2006) (62)
- Weakly-Supervised Action Localization with Expectation-Maximization Multi-Instance Learning (2020) (61)
- Deep Mixture of Experts via Shallow Embedding (2018) (61)
- Unsupervised Learning of Visual Sense Models for Polysemous Words (2008) (60)
- Timely Object Recognition (2012) (59)
- Learning Canonical Representations for Scene Graph to Image Generation (2019) (59)
- Computer Vision and Applications (2000) (58)
- Adapting to Continuously Shifting Domains (2018) (58)
- Recognizing gaze aversion gestures in embodied conversational discourse (2006) (58)
- Avoiding the "streetlight effect": tracking by exploring likelihood modes (2005) (58)
- Compositional GAN: Learning Image-Conditional Binary Composition (2018) (57)
- Anytime Recognition of Objects and Scenes (2014) (57)
- Spatio-Temporal Action Graph Networks (2018) (56)
- Monocular Plan View Networks for Autonomous Driving (2019) (56)
- An efficient projection for l 1 , infinity regularization. (2009) (55)
- Stereo tracking using ICP and normal flow constraint (2002) (55)
- Adversarial Inference for Multi-Sentence Video Description (2018) (53)
- Prototypical Cross-domain Self-supervised Learning for Few-shot Unsupervised Domain Adaptation (2021) (52)
- Reducing drift in parametric motion tracking (2001) (52)
- Un-mix: Rethinking Image Mixtures for Unsupervised Visual Representation Learning (2020) (52)
- A virtual mirror interface using real-time robust face tracking (1998) (52)
- Visual Concept Learning: Combining Machine Vision and Bayesian Generalization on Concept Hierarchies (2013) (51)
- Self-Supervised Pretraining Improves Self-Supervised Pretraining (2021) (51)
- Fooling Vision and Language Models Despite Localization and Attention Mechanism (2017) (51)
- Unsupervised feature selection via distributed coding for multi-view object recognition (2008) (49)
- Depth from focus using a pyramid architecture (1990) (49)
- Scene Intrinsics and Depth from a Single Image (2015) (49)
- Multimodal Image-to-Image Translation by Enforcing Bi-Cycle Consistency (2017) (48)
- Articulatory features for robust visual speech recognition (2004) (47)
- Learning to Detect Visual Grasp Affordance (2016) (46)
- Learning Invariant Representations and Risks for Semi-supervised Domain Adaptation (2020) (45)
- A Probabilistic Framework for Multi-modal Multi-Person Tracking (2003) (44)
- Contrastive Test-Time Adaptation (2022) (44)
- Rank priors for continuous non-linear dimensionality reduction (2009) (44)
- Heavy-tailed Distances for Gradient Based Image Descriptors (2011) (43)
- Multimodal location estimation (2010) (43)
- On modelling nonlinear shape-and-texture appearance manifolds (2005) (43)
- Generalized Orderless Pooling Performs Implicit Salient Matching (2017) (43)
- Multiple-view object recognition in band-limited distributed camera networks (2009) (42)
- Ausio-visual Segmentation and "The Cocktail Party Effect" (2000) (42)
- Factorized Multi-Modal Topic Model (2012) (42)
- Learning a Precedence Effect-Like Weighting Function for the Generalized Cross-Correlation Framework (2006) (42)
- ALIVE: Artificial Life Interactive Video Environment (1994) (41)
- Modular Architecture for StarCraft II with Deep Reinforcement Learning (2018) (41)
- Gesture + play: full-body interaction for virtual environments (2003) (41)
- Perception for the manipulation of socks (2011) (41)
- Recognition of Space-Time Gestures using a Distributed Representation (1993) (41)
- Modeling Radiometric Uncertainty for Vision with Tone-Mapped Color Images (2013) (40)
- Bayesian Localized Multiple Kernel Learning (2009) (40)
- Dynamic visual category learning (2008) (40)
- DETReg: Unsupervised Pretraining with Region Priors for Object Detection (2021) (40)
- A multi-modal approach for determining speaker location and focus (2003) (40)
- Learning Instance Segmentation by Interaction (2018) (40)
- Predicting with Confidence on Unseen Distributions (2021) (39)
- Co-training with noisy perceptual observations (2009) (38)
- Co-Adaptation of audio-visual speech and gesture classifiers (2006) (38)
- Benchmark for Compositional Text-to-Image Synthesis (2021) (37)
- Monocular Quasi-Dense 3D Object Tracking (2021) (37)
- From pixels to physics: Probabilistic color de-rendering (2012) (37)
- Can you fool AI with adversarial examples on a visual Turing test? (2017) (37)
- Real-World Robot Learning with Masked Visual Pre-training (2022) (37)
- SRI-Sarnoff AURORA System at TRECVID 2012 Multimedia Event Detection and Recounting (2012) (36)
- Auto-Tuned Sim-to-Real Transfer (2021) (36)
- A radial cumulative similarity transform for robust image correspondence (1998) (36)
- ePointDA: An End-to-End Simulation-to-Real Domain Adaptation Framework for LiDAR Point Cloud Segmentation (2020) (35)
- Attention-driven Expression and Gesture Analysis in an Interactive Environment (1995) (35)
- IDeixis: image-based Deixis for finding location-based information (2004) (34)
- Blurring the Line Between Structure and Learning to Optimize and Adapt Receptive Fields (2019) (34)
- Activity maps for location-aware computing (2002) (34)
- CLIP-It! Language-Guided Video Summarization (2021) (33)
- Seeing the Un-Scene: Learning Amodal Semantic Maps for Room Navigation (2020) (33)
- Discriminatively Activated Sparselets (2013) (32)
- Fully Test-time Adaptation by Entropy Minimization (2020) (32)
- Untethered gesture acquisition and recognition for virtual world manipulation (2005) (32)
- Detection bank: an object detection based video representation for multimedia event recognition (2012) (31)
- 'Nulling' filters and the separation of transparent motions (1993) (31)
- SelfAugment: Automatic Augmentation Policies for Self-Supervised Learning (2020) (30)
- Separation of transparent motion into layers using velocity-tuned mechanisms (1994) (30)
- Multistream Articulatory Feature-Based Models for Visual Speech Recognition (2009) (30)
- Learning to Recognize Objects from Unseen Modalities (2010) (29)
- Regularization Matters in Policy Optimization (2019) (27)
- Reinforcement Learning of Active Recognition Behaviors (1997) (27)
- An Efficient Projection for l 1 , ∞ Regularization (2009) (27)
- Object-Region Video Transformers (2021) (26)
- Articulated-pose estimation using brightness- and depth-constancy constraints (2000) (26)
- Using Multiple-Hypothesis Disparity Maps and Image Velocity for 3-D Motion Estimation (2001) (26)
- Classifying Hand Gestures with a View-Based Distributed Representation (1993) (26)
- Body2Hands: Learning to Infer 3D Hands from Conversational Gesture Body Dynamics (2020) (26)
- Compositional GAN: Learning Conditional Image Composition (2018) (26)
- Production domain modeling of pronunciation for visual speech recognition (2005) (26)
- Dynamic Feature Selection for Classification on a Budget (2013) (26)
- Unsupervised Domain Adaptation (2019) (25)
- Practical 3-D Object detection using category and instance-level appearance models (2011) (25)
- Remembering for the Right Reasons: Explanations Reduce Catastrophic Forgetting (2020) (25)
- Regularization Matters in Policy Optimization - An Empirical Study on Continuous Control (2020) (25)
- Gradient-free Policy Architecture Search and Adaptation (2017) (25)
- Proton: A visuo-haptic data acquisition system for robotic learning of surface properties (2016) (25)
- Face-Responsive Interfaces: From Direct Manipulation to Perceptive Presence (2002) (24)
- Correlation and Interpolation Networks for Real-time Expression Analysis/Synthesis (1994) (24)
- IDeixis - Searching the Web with Mobile Images for Location-Based Information (2004) (23)
- Compositional Video Synthesis with Action Graphs (2020) (23)
- K-LITE: Learning Transferable Visual Models with External Knowledge (2022) (23)
- Incorporating Object Tracking Feedback into Background Maintenance Framework (2005) (23)
- Who is “You”? Combining Linguistic and Gaze Features to Resolve Second-Person References in Dialogue (2009) (23)
- NewsCLIPpings: Automatic Generation of Out-of-Context Multimodal Media (2021) (22)
- Robust, real-time people tracking in open environments using integrated stereo, color, and face detection (1998) (22)
- Spatial Semantic Regularisation for Large Scale Object Detection (2015) (22)
- Nodding in conversations with a robot (2004) (22)
- The Whole Is More Than Its Parts? From Explicit to Implicit Pose Normalization (2020) (22)
- Generalized Sparselet Models for Real-Time Multiclass Object Recognition (2015) (22)
- Audio-video array source separation for perceptual user interfaces (2001) (21)
- Learning object color models from multi-view constraints (2011) (21)
- Rethinking Image Mixture for Unsupervised Visual Representation Learning (2020) (21)
- Semantic Bottleneck Scene Generation (2019) (21)
- Learning Detection with Diverse Proposals (2017) (20)
- Learning Saliency Propagation for Semi-Supervised Instance Segmentation (2020) (20)
- A novel image-based tool to reunite children with their families after disasters. (2012) (19)
- ReCLIP: A Strong Zero-Shot Baseline for Referring Expression Comprehension (2022) (19)
- Classifying Collisions with Spatio-Temporal Action Graph Networks (2018) (19)
- Visual Prompting via Image Inpainting (2022) (19)
- Fighting Copycat Agents in Behavioral Cloning from Observation Histories (2020) (19)
- Hierarchical Style-based Networks for Motion Synthesis (2020) (19)
- Large Scale Visual Recognition through Adaptation using Joint Representation and Multiple Instance Learning (2016) (19)
- Advisable Learning for Self-Driving Vehicles by Internalizing Observation-to-Action Rules (2020) (19)
- Active Gesture Recognition using Learned Visual Attention (1995) (19)
- Visual grasp affordances from appearance-based cues (2011) (19)
- Learning to Detect Every Thing in an Open World (2021) (19)
- Fast 3D model acquisition from stereo images (2002) (18)
- Contrastive Examples for Addressing the Tyranny of the Majority (2020) (18)
- Auxiliary Task Reweighting for Minimum-data Learning (2020) (18)
- Size Matters: Metric Visual Search Constraints from Monocular Metadata (2010) (18)
- Learning Scalable Discriminative Dictionary with Sample Relatedness (2014) (18)
- Interactive adaptation of real-time object detectors (2014) (18)
- Tracking People with a Sparse Network of Bearing Sensors (2004) (18)
- Recovering Articulated Model Topology from Observed Rigid Motion (2002) (17)
- Supervised hierarchical Pitman-Yor process for natural scene segmentation (2011) (17)
- Signal level fusion for multimodal perceptual user interface (2001) (17)
- Probabalistic Models and Informative Subspaces for Audiovisual Correspondence (2002) (17)
- Tune it the Right Way: Unsupervised Validation of Domain Adaptation via Soft Neighborhood Density (2021) (17)
- Reducing drift in differential tracking (2008) (17)
- Perceptive Presence (2003) (17)
- From conversational tooltips to grounded discourse: head poseTracking in interactive dialog systems (2004) (17)
- Minimax Active Learning (2020) (16)
- Understanding object descriptions in robotics by open-vocabulary object retrieval and detection (2016) (16)
- SPLAT: Semantic Pixel-Level Adaptation Transforms for Detection (2018) (15)
- Quantification in-the-wild: data-sets and baselines (2015) (15)
- Improving audio source localization by learning the precedence effect (2005) (15)
- Video Prediction via Example Guidance (2020) (15)
- Utilizing Large Scale Vision and Text Datasets for Image Segmentation from Referring Expressions (2016) (15)
- Robust Object Detection via Instance-Level Temporal Cycle Confusion (2021) (14)
- Learning to Listen: Modeling Non-Deterministic Dyadic Facial Motion (2022) (14)
- Mapping Images to Sentiment Adjective Noun Pairs with Factorized Neural Nets (2015) (14)
- Light Field Appearance Manifolds (2004) (14)
- Guest Editor’s Introduction to the Special Issue on Domain Adaptation for Vision Applications (2014) (14)
- Towards Adapting ImageNet to Reality: Scalable Domain Adaptation with Implicit Low-rank Transformations (2013) (13)
- Object Category Recognition Using Probabilistic Fusion of Speech and Image Classifiers (2007) (13)
- Navigating in virtual environments using a vision-based interface (2004) (13)
- Exemplar-Specific Patch Features for Fine-Grained Recognition (2014) (13)
- Bayesian Articulated Tracking Using Single Frame Pose Sampling (2003) (13)
- Exploring Vision-Based Interfaces: How to Use Your Head in Dual Pointing Tasks (2002) (13)
- Fighting Gradients with Gradients: Dynamic Defenses against Adversarial Attacks (2021) (13)
- Untethered gesture acquisition and recognition for a multimodal conversational system (2003) (12)
- Filtering Abstract Senses From Image Search Results (2009) (12)
- Adaptive Mean Shift Based Clustering in High Dimensions (2006) (12)
- Second-Order Method for Occlusion Relationships in Motion Layers (1995) (12)
- Task-Aware Feature Generation for Zero-Shot Compositional Learning. (2019) (12)
- Generalized Zero-Shot Learning via Aligned Variational Autoencoders (2019) (12)
- Conditional Sequence Model for Context-Based Recognition of Gaze Aversion (2007) (12)
- Latent Task Adaptation with Large-Scale Hierarchies (2013) (11)
- Pooling-Invariant Image Feature Learning (2013) (11)
- Modeling and Interactive Animation of Facial Expression using Vision (1994) (11)
- Accurate Visual Localization for Automotive Applications (2019) (11)
- Similarity R-C3D for Few-shot Temporal Activity Detection (2018) (11)
- Combining object and feature dynamics in probabilistic tracking (2005) (11)
- Perceptually-driven Avatars and Interfaces: active methods for direct control (1997) (11)
- Adversarial Discriminative Domain Adaptation (workshop extended abstract) (2017) (11)
- The BDD-Nexar Collective : A Large-Scale , Crowsourced , Dataset of Driving Scenes Vashisht Madhavan (2017) (11)
- Machine Learning with Interdependent and Non-identically Distributed Data (Dagstuhl Seminar 15152) (2015) (11)
- Compositional Plan Vectors (2019) (10)
- Constrained Structured Regression with Convolutional Neural Networks (2015) (10)
- Modeling Interactive Agents in ALIVE (1995) (10)
- Team SRI-Sarnoff's AURORA System @ TRECVID 2011 (2012) (10)
- Recovering Articulated Model Topology from Observed Motion (2002) (10)
- Unsupervised Distributed Feature Selection for Multi-view Object Recognition (2008) (10)
- Transferring Nonlinear Representations using Gaussian Processes with a Shared Latent Space (2008) (10)
- Multiple-View Object Recognition in Smart Camera Networks (2011) (10)
- Zero-Shot Reward Specification via Grounded Natural Language (2022) (10)
- Multimodal question answering for mobile devices (2008) (10)
- LabelAR: A Spatial Guidance Interface for Fast Computer Vision Image Collection (2019) (10)
- Virtual Visual Hulls: Example-Based 3D Shape Inference from Silhouettes (2004) (9)
- Magic morphin mirror: face-sensitive distortion and exaggeration (1997) (9)
- Gesture + Play Exploring Full-Body Navigation for Virtual Environments (2003) (9)
- A probabilistic model for recursive factorized image features (2011) (9)
- Visually guided animation (1994) (9)
- Audiovisual arrays for untethered spoken interfaces (2002) (9)
- Transferring Visual Category Models to New Domains (2010) (9)
- Temporal Action Detection with Multi-level Supervision (2020) (9)
- Range Segmentation Using Visibility Constraints (2001) (9)
- Tracking people with integrated stereo, color, and face detection. (1998) (9)
- Corrigendum to "Robotic learning of haptic adjectives through physical interaction" [Robot. Auton. Syst. 63 (P3) (2015) 279-292] (2016) (9)
- PSFIG - A DITROFF Preprocessor for Postscript Figures (1987) (9)
- Transferable Recognition-Aware Image Processing (2019) (9)
- Quasi-Dense Instance Similarity Learning (2020) (8)
- Evaluating Self-Supervised Pretraining Without Using Labels (2020) (8)
- Towards Context-Based Visual Feedback Recognition for Embodied Agents (2005) (8)
- Informative subspaces for audio-visual processing: High-level function from low-level fusion (2002) (8)
- Bayesian network for online global pose estimation (2002) (8)
- Doubleshot: an interactive user-aided segmentation tool (2005) (8)
- Shape-Guided Diffusion with Inside-Outside Attention (2022) (8)
- Correspondence with Cumulative Similiarity Transforms (2001) (8)
- Strumming to the Beat: Audio-Conditioned Contrastive Video Textures (2021) (8)
- Object Recognition using Locality Sensitive Hashing of Shape Contexts (2006) (7)
- Dynamic occluding contours: a new external-energy term for snakes (1999) (7)
- Revisiting Few-shot Activity Detection with Class Similarity Control (2020) (7)
- The Role of Context in Head Gesture Recognition (2006) (7)
- Back to the Source: Diffusion-Driven Test-Time Adaptation (2022) (7)
- On Compact Codes for Spatially Pooled Features (2013) (7)
- On the use of "nulling" filters to separate transparent motions (1993) (7)
- Geometric and Statistical Approaches to Audiovisual Segmentation (2005) (7)
- The Berkeley 3D Object Dataset (2012) (7)
- Identity-Aware Multi-Sentence Video Description (2020) (7)
- Communal Cuts : sharing cuts across images (2014) (6)
- Rethinking preventing class-collapsing in metric learning with margin-based losses (2020) (6)
- On Guiding Visual Attention with Language Specification (2022) (6)
- sensAI: ConvNets Decomposition via Class Parallelism for Fast Inference on Live Data (2021) (6)
- Dynamic Scale Inference by Entropy Minimization (2019) (6)
- Mid-level Features Improve Recognition of Interactive Activities (2012) (6)
- Twitter-COMMs: Detecting Climate, COVID, and Military Multimodal Misinformation (2021) (6)
- Plan Arithmetic: Compositional Plan Vectors for Multi-Task Control (2019) (6)
- Reliable Visual Question Answering: Abstain Rather Than Answer Incorrectly (2022) (6)
- ParkPredict: Motion and Intent Prediction of Vehicles in Parking Lots (2020) (6)
- sing Interpolated Views (1996) (5)
- Modularity Improves Out-of-Domain Instruction Following (2020) (5)
- An efficient projection for {\it l}$_{\mbox{1}}$,$_{\mbox{infinity}}$ regularization (2009) (5)
- Example-Based Image Synthesis of Articulated Figures (1998) (5)
- Discovering Non-monotonic Autoregressive Orderings with Variational Inference (2021) (5)
- Visual perception of human bodies and faces for multi-modal interfaces (1994) (5)
- Anytime Dense Prediction with Confidence Adaptivity (2021) (5)
- Reducing Class Collapse in Metric Learning with Easy Positive Sampling (2020) (5)
- Towards adaptive object recognition for situated human-computer interaction (2007) (5)
- Tracking with Non-Linear Dynamic Models (2002) (5)
- Evaluating look-to-talk (2002) (5)
- Viewpoint Invariant Change Captioning (2019) (5)
- Scalable classifiers for Internet vision tasks (2008) (5)
- Probabilistic Kernel Combination for Hierarchical Object Categorization (2009) (4)
- Scale-MAE: A Scale-Aware Masked Autoencoder for Multiscale Geospatial Representation Learning (2022) (4)
- TL;DW? Summarizing Instructional Videos with Task Relevance & Cross-Modal Saliency (2022) (4)
- Spatio-Temporal Action Detection with Multi-Object Interaction (2020) (4)
- Generating Post-Hoc Rationales of Deep Visual Classification Decisions (2018) (4)
- Watch Those Words: Video Falsification Detection Using Word-Conditioned Facial Motion (2021) (4)
- DenseNet : Implementing Efficient ConvNet Descriptor Pyramids Technical Report (2014) (4)
- Location Estimation with a Differential Update Network (2002) (4)
- On-target Adaptation (2021) (4)
- New Algorithms for Efficient High-Dimensional Nonparametric Classification (2006) (4)
- Incorporating Semantic Constraints into a Discriminative Categorization and Labelling Model. (2005) (3)
- Multimodal communication error detection for driver-car interaction (2007) (3)
- Meta-Learning to Guide Segmentation (2018) (3)
- Instance-Aware Predictive Navigation in Multi-Agent Environments (2021) (3)
- From Large-Scale Object Classifiers to Large-Scale Object Detectors: An Adaptation Approach (2014) (3)
- QDTrack: Quasi-Dense Similarity Learning for Appearance-Only Multiple Object Tracking (2022) (3)
- Gesture + Play (2002) (3)
- Guest Editors' Introduction to the Special Section on Learning with Shared Information for Computer Vision and Multimedia Analysis (2018) (3)
- Visual Attention Emerges from Recurrent Sparse Reconstruction (2022) (3)
- Refine and Represent: Region-to-Object Representation Learning (2022) (3)
- Non-parametric and light-field deformable models (2006) (3)
- Magic Morphin' Mirror: Person Detection and Tracking (1998) (3)
- Efficient Receptive Field Learning by Dynamic Gaussian Structure (2019) (3)
- On the representation of occluded shapes (1991) (3)
- Combining Simple Models to Approximate Complex Dynamics (2004) (3)
- Learning Compact Convolutional Neural Networks with Nested Dropout (2014) (3)
- Structured Video Tokens @ Ego4D PNR Temporal Localization Challenge 2022 (2022) (3)
- Exposing the Limits of Video-Text Models through Contrast Sets (2022) (3)
- Multitask Vision-Language Prompt Tuning (2022) (3)
- Finding lost children (2011) (3)
- LabelAR (2019) (2)
- Studying Bias in GANs through the Lens of Race (2022) (2)
- End to End Learning in Autonomous Driving Systems (2020) (2)
- Learning Rich Image Representation with Deep Layer Aggregation (2018) (2)
- Differentiable Gradient Sampling for Learning Implicit 3D Scene Reconstructions from a Single Image (2022) (2)
- Grounding Visual Explanations (Extended Abstract) (2017) (2)
- Guiding Pretraining in Reinforcement Learning with Large Language Models (2023) (2)
- Design and Implementation of a Visuo-Haptic Data Acquisition System for Robotic Learning of Surface Properties (2016) (2)
- Bringing Image Scene Structure to Video via Frame-Clip Consistency of Object Tokens (2022) (2)
- High-Level Analysis of Human Pose (2002) (2)
- Learning cross-modal appearance models with application to tracking (2003) (2)
- Hierarchical Deep Reinforcement Learning Agent with Counter Self-play on Competitive Games (2018) (2)
- Exploring Simple and Transferable Recognition-Aware Image Processing (2019) (2)
- Uncertainty-guided Lifelong Learning in Bayesian Networks (2018) (2)
- Approximate Nearest Neighbor Regression in Very High Dimensions (2006) (2)
- Virtual Visual Hulls: Example-Based 3D Shape Estimation from a Single Silhouette (2004) (2)
- Modular Networks for Compositional Instruction Following (2021) (2)
- Cross-Linked Variational Autoencoders for Generalized Zero-Shot Learning (2019) (2)
- Voxel-informed Language Grounding (2022) (2)
- A Projected Subgradient Method for Scalable Multi-Task Learning (2008) (2)
- Visual Domain Adaptation Using Regularized Cross- Domain Transforms (2010) (2)
- The Ratio Method for Multi-view Color Constancy (2011) (1)
- FOLS : Factorized Orthogonal Latent Spaces (2010) (1)
- Visually guided interaction and animation (1994) (1)
- Calculating Depth From Focus Using A Pyramid Architecture (1988) (1)
- Why Size Matters: Feature Coding as Nystrom Sampling (2013) (1)
- Chapter 5.2 – A Geometric Approach to Robotic Laundry Folding1 (2015) (1)
- Task-Aware Deep Sampling for Feature Generation (2019) (1)
- Explaining Reinforcement Learning Policies through Counterfactual Trajectories (2022) (1)
- Scoring-Aggregating-Planning: Learning task-agnostic priors from interactions and sparse rewards for zero-shot generalization (2019) (1)
- Confidence Adaptive Anytime Pixel-Level Recognition (2021) (1)
- for Recent Trends in 3 D Computer Vision Learning with Side Information through Modality Hallucination (2016) (1)
- G^3: Geolocation via Guidebook Grounding (2022) (1)
- Teachable Reinforcement Learning via Advice Distillation (2022) (1)
- Composable Semi-parametric Modelling for Long-range Motion Generation (2019) (1)
- Weakly-Supervised Trajectory Segmentation for Learning Reusable Skills (2019) (1)
- PromptonomyViT: Multi-Task Prompt Learning Improves Video Transformers using Synthetic Scene Data (2022) (1)
- Interacting with computers using images for search and automation (2009) (1)
- Disentangled Action Recognition with Knowledge Bases (2022) (1)
- Using Language to Extend to Unseen Domains (2022) (1)
- Compositional GAN (Extended Abstract): Learning Image-Conditional Binary Composition (2019) (1)
- Generating Visual Explanations with Natural Language (2021) (1)
- Introduction to the CVIU special issue on "Parts and Attributes: Mid-level representation for object recognition, scene classification and object detection" (2015) (1)
- Artificial Intelligence and Statistics Conference (2010) (1)
- Robust Visual Person Tracking for Interactive Displays (1998) (1)
- TAFE-Net: Task-Aware Feature Embeddings for Efficient Learning and Inference. (2018) (1)
- Nonlinear Latent Variable Models for Video Sequences (2005) (1)
- Region-level Active Learning for Cluttered Scenes (2021) (1)
- Detecting communication errors from visual cues during the system's conversational turn (2007) (1)
- Real-time audio-visual tracking for meeting analysis (2004) (1)
- Mass hallucination (1998) (1)
- Supplementary Material to : Adversarial Inference for Multi-Sentence Video Description (2019) (1)
- Fast concurrent object classification and localization (2008) (1)
- Dropout Reduces Underfitting (2023) (1)
- Visually-Grounded Bayesian Word Learning (2012) (1)
- PAIR-Diffusion: Object-Level Image Editing with Structure-and-Appearance Paired Diffusion Models (2023) (1)
- Region-level Active Detector Learning (2021) (1)
- Explaining robot policies (2021) (1)
- Don ' t Look Back : Post-hoc Category Detection via Sparse Reconstruction (2012) (1)
- Parameter-Sensitive Hashing for Fast Pose Estimation (2006) (1)
- Scaling Vision-Language Models with Sparse Mixture of Experts (2023) (0)
- Integrated descriptions for vision (1990) (0)
- Semantic Segmentation and Image Processing with Convnets (2017) (0)
- Show, Attend, Control, and Justify: Interpretable Learning for Self-Driving Cars (2017) (0)
- Prior Knowledge-Guided Attention in Self-Supervised Vision Transformers (2022) (0)
- Captioning Images with Diverse Objects Supplementary Material (2017) (0)
- Vision-Aided Acoustic Array Processing for Perceptive Environments (2001) (0)
- for QVH IGHLIGHTS : Detecting Moments and Highlights in Videos via Natural Language Queries (2021) (0)
- Learning Embeddings for Fast Approximate Nearest Neighbor Retrieval (2006) (0)
- Proceedings of the 5th International Conference on Multimodal Interfaces, ICMI 2003, Vancouver, British Columbia, Canada, November 5-7, 2003 (2003) (0)
- Ego 4 D PNR Temporal Localization Challenge 2022 (2022) (0)
- Learning and Verification of Task Structure in Instructional Videos (2023) (0)
- Adversarial Continual Learning ( Supplementary Materials ) (2020) (0)
- Extended Abstract: Concept Acquisition Through Meta-Learning (2017) (0)
- Language Embedding Space aug θ Training Domain Unseen Domain ≈ Training Domain “ Snowy ” similarity “ Sunny ” similarity (2023) (0)
- General Examination on Technical Area : Recognizing human activity in audio / visual scenes (0)
- Towards Practical Robot Manipulation using Relational Reinforcement Learning (2019) (0)
- Against Edges: Function Approximation with Multiple Support Maps (1991) (0)
- T-2 T-1 T Frame Input Image Region Proposals Monocular 3 D Estimation Deep Association Multi-frame Refinement (2018) (0)
- Recovery of integrated image descriptions (1990) (0)
- Statement in Computer Vision (2015) (0)
- New Results - Visual recognition in images (2013) (0)
- Variational Adversarial Active Learning-Extended Abstract (2020) (0)
- Perceptive agents with attentive interfaces : learning and vision for man-machine systems (1996) (0)
- Hybrid rigid and non-rigid image-based modeling of articulated figures (1998) (0)
- REMEMBERING FOR THE RIGHT REASONS: EXPLANATIONS REDUCE (2021) (0)
- Decentralized Vehicle Coordination: The Berkeley DeepDrive Drone Dataset (2022) (0)
- Dialogue Context for Visual Feedback Recognition (2007) (0)
- Semantic & Panoptic Segmentation and Image Processing with Convnets (2022) (0)
- irror: Person etection and Trac (1998) (0)
- Pssg/t E X 1.10 Users Guide (2008) (0)
- Multi-Modal Recognition Using Multiple Views (2001) (0)
- ICMI'03: Fifth International Conference on Multimodal Interfaces: Preface (2003) (0)
- Proceedings of the 6th International Conference on Multimodal Interfaces, ICMI 2004, State College, PA, USA, October 13-15, 2004 (2004) (0)
- Object Recognition with Latent Conditional Random Fields by Ariadna Quattoni (2005) (0)
- Modeling the Uncertainty in Inverse Radiometric Calibration (2011) (0)
- Program Synthesis for Autonomous Driving Decisions (2020) (0)
- Supplementary Material for Automatic Augmentation Policies for Self-Supervised Learning (2021) (0)
- IDEIXIS --- IMAGE-BASED DEIXIS FOR RECOGNIZING (2014) (0)
- Invited talk: image recognition for intelligent interfaces (2009) (0)
- Supplementary Material : Natural Language Object Retrieval (2016) (0)
- UvA-DARE ( Digital Academic Repository ) Textual Explanations for Self-Driving Vehicles (2018) (0)
- E Perceptive Presence Systems Automatically Convey User States to a Remote Location or Application without User Input. Our Component-based Architecture Creates Presence Applications Using Perceptual User Interface Widgets. Perceptive Presence 26 (0)
- U SING A SEMANTIC BOTTLENECK (2021) (0)
- Towards Explainable and Advisable Model for Self‐driving Cars (2021) (0)
- Appendix: A ConvNet for the 2020s (2022) (0)
- Information-theoretic fusion for multimodal interfaces (2002) (0)
- Blurring Structure and Learning to Optimize and Adapt Receptive Fields (2019) (0)
- Perceptive presence - Computer Graphics and Applications, IEEE (2001) (0)
- Rendering articulated figures from examples (1999) (0)
- Computer Science and Artificial Intelligence Laboratory Nonlinear Latent Variable Models for Video Sequences (2005) (0)
- 1 Dialogue Context for Visual Feedback Recognition (2007) (0)
- Special Editors' Introduction to the Special Issue on Award-Winning Papers from the IEEE Conference on Computer Vision and Pattern Recognition 2010 (CVPR 2010) (2012) (0)
- The Ratio Method for Multiview Color Constancy (2011) (0)
- View-Invariant Change Captioning (2019) (0)
- Compositional GAN: Learning Image-Conditional Binary Composition (2020) (0)
- CONTINUOUS CONTROL (2021) (0)
- Multilayer robust estimation for motion segmentation (1992) (0)
- Person Tracking with Stereo Range Sensors (2000) (0)
- Scalable Transform-based Domain Adaptation (2013) (0)
- Localizing a Network of Non-Overlapping Cameras (2005) (0)
- Audio-video array source localization for intelligent environments (2002) (0)
- New in-situ training image and one-shot detection model without adaptation GENETIM detection model for synset bottle (2014) (0)
- Multi-Person Tracking with Stereo Range Sensors (2001) (0)
- Fast Pedestrian Detection from a Moving Vehicle by Shuang You (2007) (0)
- Light Field Morphable Models (2003) (0)
- Learning Detection with Diverse Proposals Supplementary Material (2017) (0)
- Face-R esponsive Interfaces : From Direct Manipulation to Perceptive P resence (0)
- Uncertainty-Guided Continual Learning in Bayesian Neural Networks - Extended Abstract (2019) (0)
- What Problem is being solved ? Ø Problem Domain : Visual Question Answering (0)
- Does unsupervised grammar induction need pixels? (2022) (0)
- Deformable Part Models are Convolutional Neural Networks Tech report (2014) (0)
- Supplementary Material: Quasi-Dense Similarity Learning for Multiple Object Tracking (2021) (0)
- Exploiting and Introducing Parallelism for Efficient Object Detection (2013) (0)
- Fighting Copycat Agents in Behavioral Cloning from Multiple Observations (2020) (0)
- Step 1 : Update Estimated Prototypes Step 2 : Update Feature Extractor Estimated Prototypes Labeled Source Labeled Target Unlabeled Target Class (2019) (0)
- Knowledge-Guided Self-Supervised Vision Transformers for Medical Imaging (2022) (0)
- Published at ICLR 2021 Workshop on Security and Safety in Machine Learning Systems (2021) (0)
- Nonparametric Representations for Integrated Inference, Control, and Sensing (2015) (0)
- Top-Down Visual Attention from Analysis by Synthesis (2023) (0)
- Tracking Articualted Figures with Cylindrical Limb Constraints (2000) (0)
This paper list is powered by the following services:
Other Resources About Trevor Darrell
What Schools Are Affiliated With Trevor Darrell?
Trevor Darrell is affiliated with the following schools: