Antonio B. Torralba

Antonio B. Torralba's AcademicInfluence.com Rankings

Antonio B. Torralba

Computer Science

#2396

World Rank

#2501

Historical Rank

Artificial Intelligence

#262

World Rank

#267

Historical Rank

Database

#277

World Rank

#288

Historical Rank

computer-science Degrees

Download Badge

Computer Science

Why Is Antonio B. Torralba Influential?

(Suggest an Edit or Addition)

(See a Problem?)

Antonio B. Torralba's Published Works

Number of citations in a given year to any of this author's works

Total number of citations to an author for the works they published in a given year. This highlights publication of the most important work(s) by the author

Published Works

Modeling the Shape of the Scene: A Holistic Representation of the Spatial Envelope (2001) (6844)
Learning Deep Features for Discriminative Localization (2015) (6566)
LabelMe: A Database and Web-Based Tool for Image Annotation (2008) (3406)
Learning Deep Features for Scene Recognition using Places Database (2014) (2788)
SUN database: Large-scale scene recognition from abbey to zoo (2010) (2703)
Places: A 10 Million Image Database for Scene Recognition (2018) (2698)
Spectral Hashing (2008) (2562)
Skip-Thought Vectors (2015) (2100)
Unbiased look at dataset bias (2011) (2051)
80 Million Tiny Images: A Large Data Set for Nonparametric Object and Scene Recognition (2008) (2007)
Learning to predict where humans look (2009) (1959)
Aligning Books and Movies: Towards Story-Like Visual Explanations by Watching Movies and Reading Books (2015) (1817)
Scene Parsing through ADE20K Dataset (2017) (1730)
Contextual guidance of eye movements and attention in real-world scenes: the role of global features in object search. (2006) (1633)
SIFT Flow: Dense Correspondence across Scenes and Its Applications (2011) (1620)
Building the gist of a scene: the role of global image features in recognition. (2006) (1465)
Recognizing indoor scenes (2009) (1459)
Generating Videos with Scene Dynamics (2016) (1253)
Object Detectors Emerge in Deep Scene CNNs (2014) (1148)
Network Dissection: Quantifying Interpretability of Deep Visual Representations (2017) (1102)
Context-based vision system for place and object recognition (2003) (1001)
Semantic Understanding of Scenes Through the ADE20K Dataset (2016) (958)
The role of context in object recognition (2007) (932)
Contextual Priming for Object Detection (2003) (925)
Statistics of natural image categories (2003) (870)
SoundNet: Learning Sound Representations from Unlabeled Video (2016) (864)
Small codes and large image databases for recognition (2008) (807)
Sharing Visual Features for Multiclass and Multiview Object Detection (2007) (802)
Temporal Relational Reasoning in Videos (2017) (791)
Sharing features: efficient boosting procedures for multiclass object detection (2004) (739)
A large-scale benchmark dataset for event recognition in surveillance video (2011) (700)
SIFT Flow: Dense Correspondence across Different Scenes (2008) (692)
SUN3D: A Database of Big Spaces Reconstructed Using SfM and Object Labels (2013) (631)
What Do Different Evaluation Metrics Tell Us About Saliency Models? (2016) (613)
Eye Tracking for Everyone (2016) (611)
MovieQA: Understanding Stories in Movies through Question-Answering (2015) (574)
A Benchmark of Computational Models of Saliency to Predict Human Fixations (2012) (551)
Top-down control of visual attention in object detection (2003) (535)
Comparison of deep neural networks to spatio-temporal cortical dynamics of human visual object recognition reveals hierarchical correspondence (2016) (509)
Learning the signatures of the human grasp using a scalable tactile glove (2019) (489)
Undoing the Damage of Dataset Bias (2012) (480)
Anticipating Visual Representations from Unlabeled Video (2015) (442)
Using the Forest to See the Trees: A Graphical Model Relating Features, Objects, and Scenes (2003) (421)
Contextual Models for Object Detection Using Boosted Random Fields (2004) (419)
Neural-Symbolic VQA: Disentangling Reasoning from Vision and Language Understanding (2018) (412)
Nonparametric Scene Parsing via Label Transfer (2011) (400)
What makes an image memorable? (2011) (384)
The Sound of Pixels (2018) (380)
Through-Wall Human Pose Estimation Using Radio Signals (2018) (374)
Places: An Image Database for Deep Scene Understanding (2016) (373)
Learning hierarchical models of scenes, objects, and parts (2005) (372)
A Compositional Object-Based Approach to Learning Physical Dynamics (2016) (370)
Nonparametric scene parsing: Label transfer via dense scene alignment (2009) (367)
Ambient Sound Provides Supervision for Visual Learning (2016) (360)
Exploiting hierarchical context on a large database of object categories (2010) (359)
Depth Estimation from Image Structure (2002) (356)
Learning to share visual appearance for multiclass object detection (2011) (339)
GAN Dissection: Visualizing and Understanding Generative Adversarial Networks (2018) (324)
Modelling search for people in 900 scenes: A combined source model of eye guidance (2009) (317)
Specular reflections and the perception of shape. (2004) (316)
Visually Indicated Sounds (2015) (304)
HOGgles: Visualizing Object Detection Features (2013) (302)
SUN Database: Exploring a Large Collection of Scene Categories (2014) (301)
Motion magnification (2005) (299)
Learning Cross-Modal Embeddings for Cooking Recipes and Food Images (2017) (286)
Modeling global scene factors in attention. (2003) (282)
Semi-Supervised Learning in Gigantic Image Collections (2009) (279)
Single Image 3D Interpreter Network (2016) (278)
Parsing IKEA Objects: Fine Pose Estimation (2013) (273)
Debiased Contrastive Learning (2020) (273)
Semantic photo manipulation with a generative image prior (2019) (270)
Recognizing scene viewpoint using panoramic place representation (2012) (263)
Understanding and Predicting Image Memorability at a Large Scale (2015) (257)
Dataset Issues in Object Recognition (2006) (251)
CLEVRER: CoLlision Events for Video REpresentation and Reasoning (2019) (248)
Interpreting Deep Visual Representations via Network Dissection (2017) (231)
LabelMe video: Building a video database with human annotations (2009) (228)
Learning with Hierarchical-Deep Models (2013) (219)
Unsupervised Learning of Spoken Language with Visual Context (2016) (219)
RF-based 3D skeletons (2018) (218)
Learning Particle Dynamics for Manipulating Rigid Bodies, Deformable Objects, and Fluids (2018) (216)
What Makes a Photograph Memorable? (2014) (214)
Interpretable Basis Decomposition for Visual Explanation (2018) (212)
BARF: Bundle-Adjusting Neural Radiance Fields (2021) (211)
Understanding the role of individual units in a deep neural network (2020) (211)
VirtualHome: Simulating Household Activities Via Programs (2018) (207)
Statistical context priming for object detection (2001) (206)
Describing Visual Scenes Using Transformed Objects and Parts (2008) (206)
Seeing What a GAN Cannot Generate (2019) (201)
Visual Object Networks: Image Generation with Disentangled 3D Representations (2018) (198)
Ego4D: Around the World in 3,000 Hours of Egocentric Video (2021) (193)
Object Detection and Localization Using Local and Global Features (2006) (182)
The Sound of Motions (2019) (178)
Understanding the Intrinsic Memorability of Images (2011) (177)
Recipe1M+: A Dataset for Learning Cross-Modal Embeddings for Cooking Recipes and Food Images (2018) (174)
Meta-Sim: Learning to Generate Synthetic Datasets (2019) (173)
LabelMe: Online Image Annotation and Applications (2010) (172)
HACS: Human Action Clips and Segments Dataset for Recognition and Temporal Localization (2017) (170)
Generating the Future with Adversarial Transformers (2017) (168)
Multidimensional Spectral Hashing (2012) (168)
One-Shot Learning with a Hierarchical Nonparametric Bayesian Model (2011) (166)
What are the shapes of response time distributions in visual search? (2011) (164)
Jointly Discovering Visual Objects and Spoken Words from Raw Sensory Input (2018) (163)
Intrinsic and extrinsic effects on image memorability (2015) (160)
Describing Visual Scenes using Transformed Dirichlet Processes (2005) (159)
A Tree-Based Context Model for Object Recognition (2012) (155)
Assessing the Quality of Actions (2014) (155)
Where are they looking? (2015) (155)
Using the forest to see the trees: exploiting context for visual object detection and localization (2010) (149)
Learning Aligned Cross-Modal Representations from Weakly Aligned Data (2016) (148)
Gaze360: Physically Unconstrained Gaze Estimation in the Wild (2019) (145)
Semantic Label Sharing for Learning with Many Categories (2010) (145)
Anticipating the future by watching unlabeled video (2015) (143)
Where Should Saliency Models Look Next? (2016) (143)
Transfer Learning by Borrowing Examples for Multiclass Object Detection (2011) (141)
Music Gesture for Visual Sound Separation (2020) (137)
Recognizing City Identity via Attribute Analysis of Geo-tagged Images (2014) (135)
Dataset Distillation (2018) (134)
How many pixels make an image? (2009) (134)
DatasetGAN: Efficient Labeled Data Factory with Minimal Human Effort (2021) (133)
Object Recognition by Scene Alignment (2007) (132)
Hybrid images (2006) (128)
Context models and out-of-context objects (2012) (122)
SegICP: Integrated deep semantic segmentation and pose estimation (2017) (120)
Memorability of Image Regions (2012) (117)
Unsupervised Detection of Regions of Interest Using Iterative Link Analysis (2009) (114)
Self-Supervised Moving Vehicle Tracking With Stereo Sound (2019) (110)
Localizing 3D cuboids in single-view images (2012) (110)
Modifying the Memorability of Face Photographs (2013) (107)
Turning Corners into Cameras: Principles and Methods (2017) (106)
Random Lens Imaging (2006) (105)
Cross-Modal Scene Networks (2016) (104)
See, Hear, and Read: Deep Aligned Representations (2017) (102)
Building a database of 3D scenes from user annotations (2009) (99)
A Data-Driven Approach for Event Prediction (2010) (98)
Learning human–environment interactions using conformal tactile textiles (2021) (98)
Global semantic classification of scenes using power spectrum templates (1999) (95)
Fixations on low-resolution images. (2010) (94)
Propagation Networks for Model-Based Control Under Partial Observation (2018) (92)
FPM: Fine Pose Parts-Based Model with 3D CAD Models (2014) (92)
Part and appearance sharing: Recursive Compositional Models for multi-view (2010) (90)
Revisiting the Importance of Individual Units in CNNs via Ablation (2018) (90)
Image GANs meet Differentiable Rendering for Inverse Graphics and Interpretable 3D Neural Rendering (2020) (90)
Evaluation of image features using a photorealistic virtual world (2011) (89)
AVLnet: Learning Audio-Visual Language Representations from Instructional Videos (2020) (88)
Rewriting a Deep Generative Model (2020) (87)
EditGAN: High-Precision Semantic Image Editing (2021) (87)
Semantic organization of scenes using discriminant structural templates (1999) (85)
Semantic Segmentation with Generative Models: Semi-Supervised Learning and Strong Out-of-Domain Generalization (2021) (80)
Learning Compositional Koopman Operators for Model-Based Control (2019) (80)
Self-supervised Audio-visual Co-segmentation (2019) (80)
The Hessian Penalty: A Weak Prior for Unsupervised Disentanglement (2020) (80)
Scene-Centered Description from Spatial Envelope Properties (2002) (80)
Understanding Intra-Class Knowledge Inside CNN (2015) (78)
Depth from Familiar Objects: A Hierarchical Model for 3D Scenes (2006) (77)
3D-Aware Scene Manipulation via Inverse Graphics (2018) (73)
Learning to Zoom: a Saliency-Based Sampling Layer for Neural Networks (2018) (72)
Foley Music: Learning to Generate Music from Videos (2020) (71)
Deep Neural Networks predict Hierarchical Spatio-temporal Cortical Dynamics of Human Visual Object Recognition (2016) (70)
Connecting Touch and Vision via Cross-Modal Prediction (2019) (70)
Open Vocabulary Scene Parsing (2017) (68)
Looking Beyond the Visible Scene (2014) (68)
Using AI and Social Media Multimodal Content for Disaster Response and Management: Opportunities, Challenges, and Future Directions (2020) (67)
Creating and exploring a large photorealistic virtual space (2010) (66)
Diverse Image Generation via Self-Conditioned GANs (2020) (66)
Are all training examples equally valuable? (2013) (64)
Is Saki #delicious?: The Food Perception Gap on Instagram and Its Relation to Health (2017) (64)
Causal Discovery in Physical Systems from Videos (2020) (63)
Learning to Simulate Dynamic Environments With GameGAN (2020) (61)
Compositional Visual Generation with Composable Diffusion Models (2022) (59)
Face-to-BMI: Using Computer Vision to Infer Body Mass Index on Social Media (2017) (59)
Visualizing Object Detection Features (2015) (59)
Learning to Act Properly: Predicting and Explaining Affordances from Images (2017) (56)
Exploiting Occlusion in Non-Line-of-Sight Active Imaging (2017) (56)
Through-Wall Human Mesh Recovery Using Radio Signals (2019) (56)
Dataset Distillation by Matching Training Trajectories (2022) (55)
Revealing hidden scenes by photon-efficient occlusion-based opportunistic active imaging. (2018) (55)
Single Image Intrinsic Decomposition Without a Single Intrinsic Image (2018) (54)
3D Neural Scene Representations for Visuomotor Control (2021) (54)
Pre-Trained Language Models for Interactive Decision-Making (2022) (53)
Visualizing and Understanding Generative Adversarial Networks (Extended Abstract) (2019) (52)
Following Gaze in Video (2017) (52)
Accidental pinhole and pinspeck cameras: Revealing the scene outside the picture (2012) (51)
Learning Sight from Sound: Ambient Sound Provides Supervision for Visual Learning (2017) (49)
An efficient neuromorphic analog network for motion estimation (1999) (47)
Properties and applications of shape recipes (2003) (47)
Inferring Light Fields from Shadows (2018) (44)
Neural Turtle Graphics for Modeling City Road Layouts (2019) (43)
Watch-And-Help: A Challenge for Social Perception and Human-AI Collaboration (2020) (42)
Visual Grounding of Learned Physical Models (2020) (41)
Contextual Influences on Saliency (2004) (40)
Image memorability and visual inception (2012) (39)
Predicting Motivations of Actions by Leveraging Text (2014) (39)
Infinite Images: Creating and Exploring a Large Photorealistic Virtual Space (2008) (38)
Skill Induction and Planning with Latent Language (2021) (38)
Inferring the Why in Images (2014) (36)
DriveGAN: Towards a Controllable High-Quality Neural Simulation (2021) (36)
Learning to Learn with Compound HD Models (2011) (35)
Estimating scene typicality from human ratings and image features (2011) (34)
Notes on image annotation (2012) (34)
Graphical Model For Recognizing Scenes and Objects. (2003) (33)
Improving Inversion and Generation Diversity in StyleGAN using a Gaussianized Latent Space (2020) (33)
Estimating Generalization under Distribution Shifts via Domain-Invariant Representations (2020) (33)
SLAC: A Sparsely Labeled Dataset for Action Classification and Localization (2017) (32)
Human Learning of Contextual Priors for Object Search: Where does the time go? (2005) (32)
Paint by Word (2021) (32)
Editing a classifier by rewriting its prediction rules (2021) (31)
The ThreeDWorld Transport Challenge: A Visually Guided Task-and-Motion Planning Benchmark Towards Physically Realistic Embodied AI (2021) (30)
Detecting faces in impoverished images (2010) (30)
Learning to See by Looking at Noise (2021) (30)
Detecting natural disasters, damage, and incidents in the wild (2020) (30)
Contextual Modulation of Target Saliency (2001) (29)
GAN-Supervised Dense Visual Alignment (2021) (26)
How to Make a Pizza: Learning a Compositional Layer-Based GAN Model (2019) (26)
Accidental Pinhole and Pinspeck Cameras (2014) (25)
Learning to Compose Visual Relations (2021) (24)
Shared Features for Multiclass Object Detection (2006) (24)
Next-generation deep learning based on simulators and synthetic data (2021) (24)
Learning visual biases from human imagination (2014) (24)
3D Interpreter Networks for Viewer-Centered Wireframe Modeling (2018) (24)
Real-Time Object Pose Estimation with Pose Interpreter Networks (2018) (23)
PTR: A Benchmark for Part-based Conceptual, Relational, and Physical Reasoning (2021) (23)
Modeling and Analysis of Dynamic Behaviors of Web Image Collections (2010) (23)
Deep Audio Priors Emerge From Harmonic Convolutional Networks (2020) (21)
Synthesizing Environment-Aware Activities via Activity Sketches (2019) (21)
Basic level scene understanding: categories, attributes and structures (2013) (21)
Energy-Based Models for Continual Learning (2020) (21)
Grounding Spoken Words in Unlabeled Video (2019) (20)
BigDatasetGAN: Synthesizing ImageNet with Pixel-wise Annotations (2022) (20)
Natural Language Descriptions of Deep Visual Features (2022) (20)
Correcting Robot Plans with Natural Language Feedback (2022) (19)
Shape Recipes: Scene Representations that Refer to the Image (2010) (19)
Inverting and Visualizing Features for Object Detection (2012) (19)
What Makes a Photograph Memorable? (2014) (18)
Learning Neural Acoustic Fields (2022) (18)
Robust Contrastive Learning against Noisy Views (2022) (18)
ComPhy: Compositional Physical Reasoning of Objects and Events from Videos (2022) (18)
Shape Anchors for Data-Driven Multi-view Reconstruction (2013) (17)
Computer Vision in the Operating Room: Opportunities and Caveats (2021) (17)
Guest Editorial: Big Data (2016) (14)
Simultaneous detection and segmentation for generic objects (2011) (14)
Shape from Sheen (2009) (14)
Intelligent Carpet: Inferring 3D Human Pose from Tactile Signals (2021) (13)
Object and scene recognition in tiny images (2010) (13)
Denoised MDPs: Learning World Models Better Than the World Itself (2022) (13)
Using Computer Vision to Study the Effects of BMI on Online Popularity and Weight-Based Homophily (2018) (12)
From retinal circuits to motion processing: a neuromorphic approach to velocity estimation (1997) (12)
Comparing the Interpretability of Deep Networks via Network Dissection (2019) (11)
Disentangling visual and written concepts in CLIP (2022) (11)
Basic level scene understanding: from labels to structure and beyond (2012) (11)
ConceptFusion: Open-set Multimodal 3D Mapping (2023) (11)
Deep Feedback Inverse Problem Solver (2021) (10)
Virtual Correspondence: Humans as a Cue for Extreme-View Geometry (2022) (9)
Learning Words by Drawing Images (2019) (9)
Who is Mistaken? (2016) (9)
Matching and Predicting Street Level Images (2010) (9)
Measuring Generalization with Optimal Transport (2021) (8)
Finding Fallen Objects Via Asynchronous Audio-Visual Integration (2022) (8)
Learning Program Representations for Food Images and Cooking Recipes (2022) (8)
Mapping human visual representations in space and time by neural networks. (2015) (8)
How Little Do We Need for 3-D Shape Perception? (2011) (8)
ShadowCam: Real-Time Detection of Moving Obstacles Behind A Corner For Autonomous Vehicles (2018) (8)
Search for arbitrary objects in natural scenes is remarkably efficient (2010) (7)
Composing Ensembles of Pre-trained Models via Iterative Consensus (2022) (7)
A taxonomy of visual scenes: Typicality ratings and hierarchical classification (2010) (7)
What You Can Learn by Staring at a Blank Wall (2021) (6)
Toward a Visual Concept Vocabulary for GAN Latent Space (2021) (6)
Saliency, objects and scenes: global scene factors in attention and object detection (2004) (6)
Weakly Supervised Human-Object Interaction Detection in Video via Contrastive Spatiotemporal Regions (2021) (6)
Interpreting Visual Representations of Neural Networks via Network Dissection (2018) (6)
Dynamic Modeling of Hand-Object Interactions via Tactile Sensing (2021) (6)
LID 2020: The Learning from Imperfect Data Challenge Results (2020) (5)
Following Gaze Across Views (2016) (5)
Acquiring Visual Classifiers from Human Imagination (2014) (5)
Wearable ImageNet: Synthesizing Tileable Textures via Dataset Distillation (2022) (5)
Global Depth Perception from Familiar Scene Structure (2001) (5)
A boosting approach for the simultaneous detection and segmentation of generic objects (2013) (5)
MTFormer: Multi-task Learning via Transformer and Cross-Task Reasoning (2022) (5)
Modeling visual search in a thousand scenes: The roles of saliency, target features, and scene context (2010) (4)
A 2d + 3d rich data approach to scene understanding (2013) (4)
Top-down control of visual attention in real world scenes (2010) (4)
An ensemble prior of image structure for cross-modal inference (2005) (4)
Cetacean Translation Initiative: a roadmap to deciphering the communication of sperm whales (2021) (4)
Debiasing Vision-Language Models via Biased Prompts (2023) (3)
A systolic array with applications to image processing and wire-routing in VLSI circuits (1991) (3)
LabelMe: Online Image Annotation and Applications By developing a publicly available tool that allows users to use the Internet to quickly and easily annotate images, the authors were able to collect many detailed image descriptions. (2010) (3)
A Framework for Encoding Object-level Image Priors (2011) (3)
Remodeling visual search: How gamma distributions can bring those boring old RTs to life (2010) (3)
Counterfactual Image Networks (2018) (3)
Height and Uprightness Invariance for 3D Prediction From a Single View (2020) (3)
Scaling up instance annotation via label propagation (2021) (3)
Noisy Agents: Self-supervised Exploration by Predicting Auditory Events (2020) (2)
Trees and beyond: exploiting and improving tree-structured graphical models (2011) (2)
Global Semantic Classi cation of Scenes using Power Spectrum Templates 3 Power Spectrum Families (1999) (2)
FluidLab: A Differentiable Environment for Benchmarking Complex Fluid Manipulation (2023) (2)
Fixing Malfunctional Objects With Learned Physical Simulation and Functional Prediction (2022) (2)
Incidents1M: A Large-Scale Dataset of Images With Natural Disasters, Damage, and Incidents (2022) (2)
Experiences and Insights for Collaborative Industry-Academic Research in Artificial Intelligence (2020) (2)
Visualizing and Understanding GANs (2019) (2)
LEARNING PHYSICAL DYNAMICS (2017) (2)
Asymmetrical filters for vision chips: a basis for the design of large sets of spatial and spatiotemporal filters (1999) (2)
Unsupervised Non-parametric Geospatial Modeling from Ground Imagery (2014) (2)
OPEn: An Open-ended Physics Environment for Learning Without a Task (2021) (2)
Nonparametric Bayesian Texture Learning and Synthesis (2009) (2)
Self-Supervised Segmentation and Source Separation on Videos (2019) (2)
Local Relighting of Real Scenes (2022) (1)
Totems: Physical Objects for Verifying Visual Integrity (2022) (1)
Guest Editorial: Generative Adversarial Networks for Computer Vision (2020) (1)
Polymorphic-GAN: Generating Aligned Samples across Multiple Domains with Learned Morph Maps (2022) (1)
ActionSense: A Multimodal Dataset and Recording Framework for Human Activities Using Wearable Sensors in a Kitchen Environment (2022) (1)
Exemplar Network: A Generalized Mixture Model (2014) (1)
Intrinsic and Extrinsic Effects on Image Memorability Supplemental Material (2015) (1)
BT2: Backward-compatible Training with Basis Transformation (2022) (1)
What's behind the box? Measuring scene context effects with Shannon's guessing game on indoor scenes (2010) (1)
Reframing Explanation as an Interactive Medium: The EQUAS (Explainable QUestion Answering System) Project (2021) (1)
How image statistics drive shape-from-texture and shape-from-specularity (2010) (1)
Towards cognitive saliency: narrowing the gap to human performance (2017) (1)
HARMONIC CONVOLUTIONAL NETWORKS (2020) (1)
CHAPTER 96 Contextual Influences on Saliency (2004) (1)
Publisher Correction: Learning human–environment interactions using conformal tactile textiles (2021) (1)
Matrix ? Why does Cypher betray Morpheus ? How does the movie end ? (2016) (0)
Detecting Everything in the Open World: Towards Universal Object Detection (2023) (0)
W ATCH -A ND -H ELP : A C HALLENGE FOR S OCIAL P ER CEPTION AND H UMAN -AI C OLLABORATION (2021) (0)
Accidental Cameras. Revealing the scene outside the picture. (2013) (0)
Accidental Pinhole and Pinspeck Cameras (2014) (0)
A Latent Variable Ranking Model for Content-Based Retrieval (2012) (0)
Supplementary Material: EditGAN: High-Precision Semantic Image Editing (2021) (0)
Global statistical features and early scene interpretation (2010) (0)
SUPPLEMENTAL MATERIAL : Where should saliency models look next ? (2016) (0)
Deep Neural Networks explain spatio-temporal dynamics of visual scene and object processing (2016) (0)
A data-driven approach for even prediction (2010) (0)
Depth perception from familiar scene structure (2010) (0)
Aliasing is a Driver of Adversarial Attacks (2022) (0)
SUN Database: Exploring a Large Collection of Scene Categories (2014) (0)
Benchmarking Convolutional Neural Networks for Object Segmentation and Pose Estimation (2017) (0)
Minimal complexity velocity-tuned filters with analogue neuromorphic networks: A theoretical approach for efficient design (1998) (0)
Shared ' Cross + Modal ' Representation religious , * church , * plants (2016) (0)
Modifying a face to make it more memorable or forgettable (2014) (0)
Object Detection and Contextual Priming (2002) (0)
Open-vocabulary Panoptic Segmentation with Embedding Modulation (2023) (0)
Section 16.2 Classifying Images of Single Objects 504 16.2 Classifying Images of Single Objects (0)
The Role of Embedding Complexity in Domain-invariant Representations (2019) (0)
Reframing Explanation as an Interactive Medium: The EQUAS (Explainable QUestion Answering System) Project (2021) (0)
Learning the signatures of the human grasp using a scalable tactile glove (2019) (0)
Benchmarking Convolutional Neural Networks for Object Segmentation and Pose Estimation (2017) (0)
On the Units of GANs (2020) (0)
Predicting the future (2011) (0)
Preprint prepared for ArXiv submission (2018) (0)
O BJECT DETECTORS EMERGE IN D EEP S CENE CNN S (2015) (0)
Inferring Regional and Temporal Eating Habits from Social Media Images (2016) (0)
Quantifying Interpretations of Deep Visual Representations (2017) (0)
MIT Open Access Articles Accidental Pinhole and Pinspeck Cameras (2022) (0)
Optimal Goal-Reaching Reinforcement Learning via Quasimetric Learning (2023) (0)
Role of Low-level Mechanisms in Brightness Perception (2010) (0)
ON ( plate , table ) : 2 ON ( glass , table ) : 1 ON ( fork , table ) : 1 Ground-truth Goal VirtualHome-Social Task Demonstration (2021) (0)
Vocabulary Scene Parsing (0)
Predicting object and scene descriptions with an information-theoretic model of pragmatics (2010) (0)
Supplementary Material: Deep Feedback Inverse Problem Solver (2020) (0)
Physics-Driven Diffusion Models for Impact Sound Synthesis from Videos (2023) (0)
SCENE RECOGNITION WITH BAG OF WORDS (2016) (0)
Modeling Context Effects on Image Memorability (2015) (0)
2 SUNS ’ 08 Remembering thousands of natural images with high fidelity (2008) (0)
Scene categorization and detection: the power of global features (2010) (0)
Procedural Image Programs for Representation Learning (2022) (0)
3D-IntPhys: Towards More Generalized 3D-grounded Visual Intuitive Physics under Challenging Scenes (2023) (0)
Goals Inferring the Why in Images (2015) (0)
Interaction of contour, shading and texture in natural images (2010) (0)
Transfer Matrix : A x ( b ) Sample Estimation Gain Image Direction of Light Integration W al l 1-D H idden Scene : Positive Negative ✓ = 0 ( a ) Constructing Transfer Matrix (2017) (0)
Generalizing Dataset Distillation via Deep Generative Prior (2023) (0)
Quantifying Context Effects on Image Memorability. (2015) (0)
Spatio-chromatic processing in the human retina: towards an optimal trade-off between spatial resolution of luminance and range colour perception (1998) (0)
NOPA: Neurally-guided Online Probabilistic Assistance (2022) (0)
Not all scene categories are created equal: The role of object and layout diagnosticity in scene gist understanding (2010) (0)
Visualizing Object Detection Features (2016) (0)
NOPA: Neurally-guided Online Probabilistic Assistance for Building Socially Intelligent Home Assistants (2023) (0)
Recognition with purely 3D information (2010) (0)
Comparing the Interpretability of the Deep Visual Representations via Network Dissection (2017) (0)
Asymmetrical Filters for Vision Chips : a Basis for the Design ofLarge Sets of Spatial and Spatiotemporal (1999) (0)
NeuralField-LDM: Scene Generation with Hierarchical Latent Diffusion Models (2023) (0)
Object Detectors Emerge from Training CNNs for Scene Recognition (2015) (0)
Shape from sheen 1 Shape from sheen (2009) (0)
Learning Sight from Sound: Ambient Sound Provides Supervision for Visual Learning (2018) (0)
The Power of Averaging (2006) (0)
Modeling contextual influences on object recognition (2010) (0)
Understanding and Estimating the Adaptability of Domain-Invariant Representations (2020) (0)
Kitchen Units for Food Units for Table . . . Past Future Predictor (2015) (0)
Dataset Distillation by Matching Training Trajectories (2022) (0)
3D Interpreter Networks for Viewer-Centered Wireframe Modeling (2018) (0)

This paper list is powered by the following services:

What Schools Are Affiliated With Antonio B. Torralba?

Antonio B. Torralba is affiliated with the following schools:

Massachusetts Institute of Technology

Antonio B. Torralba's Academic­Influence.com Rankings

Why Is Antonio B. Torralba Influential?

Antonio B. Torralba's Published Works

Published Works

What Schools Are Affiliated With Antonio B. Torralba?

Antonio B. Torralba's AcademicInfluence.com Rankings