Antonio B. Torralba
#72,749
Most Influential Person Now
Antonio B. Torralba's AcademicInfluence.com Rankings
Antonio B. Torralbacomputer-science Degrees
Computer Science
#2396
World Rank
#2501
Historical Rank
Artificial Intelligence
#262
World Rank
#267
Historical Rank
Database
#277
World Rank
#288
Historical Rank
Download Badge
Computer Science
Why Is Antonio B. Torralba Influential?
(Suggest an Edit or Addition)Antonio B. Torralba's Published Works
Published Works
- Modeling the Shape of the Scene: A Holistic Representation of the Spatial Envelope (2001) (6844)
- Learning Deep Features for Discriminative Localization (2015) (6566)
- LabelMe: A Database and Web-Based Tool for Image Annotation (2008) (3406)
- Learning Deep Features for Scene Recognition using Places Database (2014) (2788)
- SUN database: Large-scale scene recognition from abbey to zoo (2010) (2703)
- Places: A 10 Million Image Database for Scene Recognition (2018) (2698)
- Spectral Hashing (2008) (2562)
- Skip-Thought Vectors (2015) (2100)
- Unbiased look at dataset bias (2011) (2051)
- 80 Million Tiny Images: A Large Data Set for Nonparametric Object and Scene Recognition (2008) (2007)
- Learning to predict where humans look (2009) (1959)
- Aligning Books and Movies: Towards Story-Like Visual Explanations by Watching Movies and Reading Books (2015) (1817)
- Scene Parsing through ADE20K Dataset (2017) (1730)
- Contextual guidance of eye movements and attention in real-world scenes: the role of global features in object search. (2006) (1633)
- SIFT Flow: Dense Correspondence across Scenes and Its Applications (2011) (1620)
- Building the gist of a scene: the role of global image features in recognition. (2006) (1465)
- Recognizing indoor scenes (2009) (1459)
- Generating Videos with Scene Dynamics (2016) (1253)
- Object Detectors Emerge in Deep Scene CNNs (2014) (1148)
- Network Dissection: Quantifying Interpretability of Deep Visual Representations (2017) (1102)
- Context-based vision system for place and object recognition (2003) (1001)
- Semantic Understanding of Scenes Through the ADE20K Dataset (2016) (958)
- The role of context in object recognition (2007) (932)
- Contextual Priming for Object Detection (2003) (925)
- Statistics of natural image categories (2003) (870)
- SoundNet: Learning Sound Representations from Unlabeled Video (2016) (864)
- Small codes and large image databases for recognition (2008) (807)
- Sharing Visual Features for Multiclass and Multiview Object Detection (2007) (802)
- Temporal Relational Reasoning in Videos (2017) (791)
- Sharing features: efficient boosting procedures for multiclass object detection (2004) (739)
- A large-scale benchmark dataset for event recognition in surveillance video (2011) (700)
- SIFT Flow: Dense Correspondence across Different Scenes (2008) (692)
- SUN3D: A Database of Big Spaces Reconstructed Using SfM and Object Labels (2013) (631)
- What Do Different Evaluation Metrics Tell Us About Saliency Models? (2016) (613)
- Eye Tracking for Everyone (2016) (611)
- MovieQA: Understanding Stories in Movies through Question-Answering (2015) (574)
- A Benchmark of Computational Models of Saliency to Predict Human Fixations (2012) (551)
- Top-down control of visual attention in object detection (2003) (535)
- Comparison of deep neural networks to spatio-temporal cortical dynamics of human visual object recognition reveals hierarchical correspondence (2016) (509)
- Learning the signatures of the human grasp using a scalable tactile glove (2019) (489)
- Undoing the Damage of Dataset Bias (2012) (480)
- Anticipating Visual Representations from Unlabeled Video (2015) (442)
- Using the Forest to See the Trees: A Graphical Model Relating Features, Objects, and Scenes (2003) (421)
- Contextual Models for Object Detection Using Boosted Random Fields (2004) (419)
- Neural-Symbolic VQA: Disentangling Reasoning from Vision and Language Understanding (2018) (412)
- Nonparametric Scene Parsing via Label Transfer (2011) (400)
- What makes an image memorable? (2011) (384)
- The Sound of Pixels (2018) (380)
- Through-Wall Human Pose Estimation Using Radio Signals (2018) (374)
- Places: An Image Database for Deep Scene Understanding (2016) (373)
- Learning hierarchical models of scenes, objects, and parts (2005) (372)
- A Compositional Object-Based Approach to Learning Physical Dynamics (2016) (370)
- Nonparametric scene parsing: Label transfer via dense scene alignment (2009) (367)
- Ambient Sound Provides Supervision for Visual Learning (2016) (360)
- Exploiting hierarchical context on a large database of object categories (2010) (359)
- Depth Estimation from Image Structure (2002) (356)
- Learning to share visual appearance for multiclass object detection (2011) (339)
- GAN Dissection: Visualizing and Understanding Generative Adversarial Networks (2018) (324)
- Modelling search for people in 900 scenes: A combined source model of eye guidance (2009) (317)
- Specular reflections and the perception of shape. (2004) (316)
- Visually Indicated Sounds (2015) (304)
- HOGgles: Visualizing Object Detection Features (2013) (302)
- SUN Database: Exploring a Large Collection of Scene Categories (2014) (301)
- Motion magnification (2005) (299)
- Learning Cross-Modal Embeddings for Cooking Recipes and Food Images (2017) (286)
- Modeling global scene factors in attention. (2003) (282)
- Semi-Supervised Learning in Gigantic Image Collections (2009) (279)
- Single Image 3D Interpreter Network (2016) (278)
- Parsing IKEA Objects: Fine Pose Estimation (2013) (273)
- Debiased Contrastive Learning (2020) (273)
- Semantic photo manipulation with a generative image prior (2019) (270)
- Recognizing scene viewpoint using panoramic place representation (2012) (263)
- Understanding and Predicting Image Memorability at a Large Scale (2015) (257)
- Dataset Issues in Object Recognition (2006) (251)
- CLEVRER: CoLlision Events for Video REpresentation and Reasoning (2019) (248)
- Interpreting Deep Visual Representations via Network Dissection (2017) (231)
- LabelMe video: Building a video database with human annotations (2009) (228)
- Learning with Hierarchical-Deep Models (2013) (219)
- Unsupervised Learning of Spoken Language with Visual Context (2016) (219)
- RF-based 3D skeletons (2018) (218)
- Learning Particle Dynamics for Manipulating Rigid Bodies, Deformable Objects, and Fluids (2018) (216)
- What Makes a Photograph Memorable? (2014) (214)
- Interpretable Basis Decomposition for Visual Explanation (2018) (212)
- BARF: Bundle-Adjusting Neural Radiance Fields (2021) (211)
- Understanding the role of individual units in a deep neural network (2020) (211)
- VirtualHome: Simulating Household Activities Via Programs (2018) (207)
- Statistical context priming for object detection (2001) (206)
- Describing Visual Scenes Using Transformed Objects and Parts (2008) (206)
- Seeing What a GAN Cannot Generate (2019) (201)
- Visual Object Networks: Image Generation with Disentangled 3D Representations (2018) (198)
- Ego4D: Around the World in 3,000 Hours of Egocentric Video (2021) (193)
- Object Detection and Localization Using Local and Global Features (2006) (182)
- The Sound of Motions (2019) (178)
- Understanding the Intrinsic Memorability of Images (2011) (177)
- Recipe1M+: A Dataset for Learning Cross-Modal Embeddings for Cooking Recipes and Food Images (2018) (174)
- Meta-Sim: Learning to Generate Synthetic Datasets (2019) (173)
- LabelMe: Online Image Annotation and Applications (2010) (172)
- HACS: Human Action Clips and Segments Dataset for Recognition and Temporal Localization (2017) (170)
- Generating the Future with Adversarial Transformers (2017) (168)
- Multidimensional Spectral Hashing (2012) (168)
- One-Shot Learning with a Hierarchical Nonparametric Bayesian Model (2011) (166)
- What are the shapes of response time distributions in visual search? (2011) (164)
- Jointly Discovering Visual Objects and Spoken Words from Raw Sensory Input (2018) (163)
- Intrinsic and extrinsic effects on image memorability (2015) (160)
- Describing Visual Scenes using Transformed Dirichlet Processes (2005) (159)
- A Tree-Based Context Model for Object Recognition (2012) (155)
- Assessing the Quality of Actions (2014) (155)
- Where are they looking? (2015) (155)
- Using the forest to see the trees: exploiting context for visual object detection and localization (2010) (149)
- Learning Aligned Cross-Modal Representations from Weakly Aligned Data (2016) (148)
- Gaze360: Physically Unconstrained Gaze Estimation in the Wild (2019) (145)
- Semantic Label Sharing for Learning with Many Categories (2010) (145)
- Anticipating the future by watching unlabeled video (2015) (143)
- Where Should Saliency Models Look Next? (2016) (143)
- Transfer Learning by Borrowing Examples for Multiclass Object Detection (2011) (141)
- Music Gesture for Visual Sound Separation (2020) (137)
- Recognizing City Identity via Attribute Analysis of Geo-tagged Images (2014) (135)
- Dataset Distillation (2018) (134)
- How many pixels make an image? (2009) (134)
- DatasetGAN: Efficient Labeled Data Factory with Minimal Human Effort (2021) (133)
- Object Recognition by Scene Alignment (2007) (132)
- Hybrid images (2006) (128)
- Context models and out-of-context objects (2012) (122)
- SegICP: Integrated deep semantic segmentation and pose estimation (2017) (120)
- Memorability of Image Regions (2012) (117)
- Unsupervised Detection of Regions of Interest Using Iterative Link Analysis (2009) (114)
- Self-Supervised Moving Vehicle Tracking With Stereo Sound (2019) (110)
- Localizing 3D cuboids in single-view images (2012) (110)
- Modifying the Memorability of Face Photographs (2013) (107)
- Turning Corners into Cameras: Principles and Methods (2017) (106)
- Random Lens Imaging (2006) (105)
- Cross-Modal Scene Networks (2016) (104)
- See, Hear, and Read: Deep Aligned Representations (2017) (102)
- Building a database of 3D scenes from user annotations (2009) (99)
- A Data-Driven Approach for Event Prediction (2010) (98)
- Learning human–environment interactions using conformal tactile textiles (2021) (98)
- Global semantic classification of scenes using power spectrum templates (1999) (95)
- Fixations on low-resolution images. (2010) (94)
- Propagation Networks for Model-Based Control Under Partial Observation (2018) (92)
- FPM: Fine Pose Parts-Based Model with 3D CAD Models (2014) (92)
- Part and appearance sharing: Recursive Compositional Models for multi-view (2010) (90)
- Revisiting the Importance of Individual Units in CNNs via Ablation (2018) (90)
- Image GANs meet Differentiable Rendering for Inverse Graphics and Interpretable 3D Neural Rendering (2020) (90)
- Evaluation of image features using a photorealistic virtual world (2011) (89)
- AVLnet: Learning Audio-Visual Language Representations from Instructional Videos (2020) (88)
- Rewriting a Deep Generative Model (2020) (87)
- EditGAN: High-Precision Semantic Image Editing (2021) (87)
- Semantic organization of scenes using discriminant structural templates (1999) (85)
- Semantic Segmentation with Generative Models: Semi-Supervised Learning and Strong Out-of-Domain Generalization (2021) (80)
- Learning Compositional Koopman Operators for Model-Based Control (2019) (80)
- Self-supervised Audio-visual Co-segmentation (2019) (80)
- The Hessian Penalty: A Weak Prior for Unsupervised Disentanglement (2020) (80)
- Scene-Centered Description from Spatial Envelope Properties (2002) (80)
- Understanding Intra-Class Knowledge Inside CNN (2015) (78)
- Depth from Familiar Objects: A Hierarchical Model for 3D Scenes (2006) (77)
- 3D-Aware Scene Manipulation via Inverse Graphics (2018) (73)
- Learning to Zoom: a Saliency-Based Sampling Layer for Neural Networks (2018) (72)
- Foley Music: Learning to Generate Music from Videos (2020) (71)
- Deep Neural Networks predict Hierarchical Spatio-temporal Cortical Dynamics of Human Visual Object Recognition (2016) (70)
- Connecting Touch and Vision via Cross-Modal Prediction (2019) (70)
- Open Vocabulary Scene Parsing (2017) (68)
- Looking Beyond the Visible Scene (2014) (68)
- Using AI and Social Media Multimodal Content for Disaster Response and Management: Opportunities, Challenges, and Future Directions (2020) (67)
- Creating and exploring a large photorealistic virtual space (2010) (66)
- Diverse Image Generation via Self-Conditioned GANs (2020) (66)
- Are all training examples equally valuable? (2013) (64)
- Is Saki #delicious?: The Food Perception Gap on Instagram and Its Relation to Health (2017) (64)
- Causal Discovery in Physical Systems from Videos (2020) (63)
- Learning to Simulate Dynamic Environments With GameGAN (2020) (61)
- Compositional Visual Generation with Composable Diffusion Models (2022) (59)
- Face-to-BMI: Using Computer Vision to Infer Body Mass Index on Social Media (2017) (59)
- Visualizing Object Detection Features (2015) (59)
- Learning to Act Properly: Predicting and Explaining Affordances from Images (2017) (56)
- Exploiting Occlusion in Non-Line-of-Sight Active Imaging (2017) (56)
- Through-Wall Human Mesh Recovery Using Radio Signals (2019) (56)
- Dataset Distillation by Matching Training Trajectories (2022) (55)
- Revealing hidden scenes by photon-efficient occlusion-based opportunistic active imaging. (2018) (55)
- Single Image Intrinsic Decomposition Without a Single Intrinsic Image (2018) (54)
- 3D Neural Scene Representations for Visuomotor Control (2021) (54)
- Pre-Trained Language Models for Interactive Decision-Making (2022) (53)
- Visualizing and Understanding Generative Adversarial Networks (Extended Abstract) (2019) (52)
- Following Gaze in Video (2017) (52)
- Accidental pinhole and pinspeck cameras: Revealing the scene outside the picture (2012) (51)
- Learning Sight from Sound: Ambient Sound Provides Supervision for Visual Learning (2017) (49)
- An efficient neuromorphic analog network for motion estimation (1999) (47)
- Properties and applications of shape recipes (2003) (47)
- Inferring Light Fields from Shadows (2018) (44)
- Neural Turtle Graphics for Modeling City Road Layouts (2019) (43)
- Watch-And-Help: A Challenge for Social Perception and Human-AI Collaboration (2020) (42)
- Visual Grounding of Learned Physical Models (2020) (41)
- Contextual Influences on Saliency (2004) (40)
- Image memorability and visual inception (2012) (39)
- Predicting Motivations of Actions by Leveraging Text (2014) (39)
- Infinite Images: Creating and Exploring a Large Photorealistic Virtual Space (2008) (38)
- Skill Induction and Planning with Latent Language (2021) (38)
- Inferring the Why in Images (2014) (36)
- DriveGAN: Towards a Controllable High-Quality Neural Simulation (2021) (36)
- Learning to Learn with Compound HD Models (2011) (35)
- Estimating scene typicality from human ratings and image features (2011) (34)
- Notes on image annotation (2012) (34)
- Graphical Model For Recognizing Scenes and Objects. (2003) (33)
- Improving Inversion and Generation Diversity in StyleGAN using a Gaussianized Latent Space (2020) (33)
- Estimating Generalization under Distribution Shifts via Domain-Invariant Representations (2020) (33)
- SLAC: A Sparsely Labeled Dataset for Action Classification and Localization (2017) (32)
- Human Learning of Contextual Priors for Object Search: Where does the time go? (2005) (32)
- Paint by Word (2021) (32)
- Editing a classifier by rewriting its prediction rules (2021) (31)
- The ThreeDWorld Transport Challenge: A Visually Guided Task-and-Motion Planning Benchmark Towards Physically Realistic Embodied AI (2021) (30)
- Detecting faces in impoverished images (2010) (30)
- Learning to See by Looking at Noise (2021) (30)
- Detecting natural disasters, damage, and incidents in the wild (2020) (30)
- Contextual Modulation of Target Saliency (2001) (29)
- GAN-Supervised Dense Visual Alignment (2021) (26)
- How to Make a Pizza: Learning a Compositional Layer-Based GAN Model (2019) (26)
- Accidental Pinhole and Pinspeck Cameras (2014) (25)
- Learning to Compose Visual Relations (2021) (24)
- Shared Features for Multiclass Object Detection (2006) (24)
- Next-generation deep learning based on simulators and synthetic data (2021) (24)
- Learning visual biases from human imagination (2014) (24)
- 3D Interpreter Networks for Viewer-Centered Wireframe Modeling (2018) (24)
- Real-Time Object Pose Estimation with Pose Interpreter Networks (2018) (23)
- PTR: A Benchmark for Part-based Conceptual, Relational, and Physical Reasoning (2021) (23)
- Modeling and Analysis of Dynamic Behaviors of Web Image Collections (2010) (23)
- Deep Audio Priors Emerge From Harmonic Convolutional Networks (2020) (21)
- Synthesizing Environment-Aware Activities via Activity Sketches (2019) (21)
- Basic level scene understanding: categories, attributes and structures (2013) (21)
- Energy-Based Models for Continual Learning (2020) (21)
- Grounding Spoken Words in Unlabeled Video (2019) (20)
- BigDatasetGAN: Synthesizing ImageNet with Pixel-wise Annotations (2022) (20)
- Natural Language Descriptions of Deep Visual Features (2022) (20)
- Correcting Robot Plans with Natural Language Feedback (2022) (19)
- Shape Recipes: Scene Representations that Refer to the Image (2010) (19)
- Inverting and Visualizing Features for Object Detection (2012) (19)
- What Makes a Photograph Memorable? (2014) (18)
- Learning Neural Acoustic Fields (2022) (18)
- Robust Contrastive Learning against Noisy Views (2022) (18)
- ComPhy: Compositional Physical Reasoning of Objects and Events from Videos (2022) (18)
- Shape Anchors for Data-Driven Multi-view Reconstruction (2013) (17)
- Computer Vision in the Operating Room: Opportunities and Caveats (2021) (17)
- Guest Editorial: Big Data (2016) (14)
- Simultaneous detection and segmentation for generic objects (2011) (14)
- Shape from Sheen (2009) (14)
- Intelligent Carpet: Inferring 3D Human Pose from Tactile Signals (2021) (13)
- Object and scene recognition in tiny images (2010) (13)
- Denoised MDPs: Learning World Models Better Than the World Itself (2022) (13)
- Using Computer Vision to Study the Effects of BMI on Online Popularity and Weight-Based Homophily (2018) (12)
- From retinal circuits to motion processing: a neuromorphic approach to velocity estimation (1997) (12)
- Comparing the Interpretability of Deep Networks via Network Dissection (2019) (11)
- Disentangling visual and written concepts in CLIP (2022) (11)
- Basic level scene understanding: from labels to structure and beyond (2012) (11)
- ConceptFusion: Open-set Multimodal 3D Mapping (2023) (11)
- Deep Feedback Inverse Problem Solver (2021) (10)
- Virtual Correspondence: Humans as a Cue for Extreme-View Geometry (2022) (9)
- Learning Words by Drawing Images (2019) (9)
- Who is Mistaken? (2016) (9)
- Matching and Predicting Street Level Images (2010) (9)
- Measuring Generalization with Optimal Transport (2021) (8)
- Finding Fallen Objects Via Asynchronous Audio-Visual Integration (2022) (8)
- Learning Program Representations for Food Images and Cooking Recipes (2022) (8)
- Mapping human visual representations in space and time by neural networks. (2015) (8)
- How Little Do We Need for 3-D Shape Perception? (2011) (8)
- ShadowCam: Real-Time Detection of Moving Obstacles Behind A Corner For Autonomous Vehicles (2018) (8)
- Search for arbitrary objects in natural scenes is remarkably efficient (2010) (7)
- Composing Ensembles of Pre-trained Models via Iterative Consensus (2022) (7)
- A taxonomy of visual scenes: Typicality ratings and hierarchical classification (2010) (7)
- What You Can Learn by Staring at a Blank Wall (2021) (6)
- Toward a Visual Concept Vocabulary for GAN Latent Space (2021) (6)
- Saliency, objects and scenes: global scene factors in attention and object detection (2004) (6)
- Weakly Supervised Human-Object Interaction Detection in Video via Contrastive Spatiotemporal Regions (2021) (6)
- Interpreting Visual Representations of Neural Networks via Network Dissection (2018) (6)
- Dynamic Modeling of Hand-Object Interactions via Tactile Sensing (2021) (6)
- LID 2020: The Learning from Imperfect Data Challenge Results (2020) (5)
- Following Gaze Across Views (2016) (5)
- Acquiring Visual Classifiers from Human Imagination (2014) (5)
- Wearable ImageNet: Synthesizing Tileable Textures via Dataset Distillation (2022) (5)
- Global Depth Perception from Familiar Scene Structure (2001) (5)
- A boosting approach for the simultaneous detection and segmentation of generic objects (2013) (5)
- MTFormer: Multi-task Learning via Transformer and Cross-Task Reasoning (2022) (5)
- Modeling visual search in a thousand scenes: The roles of saliency, target features, and scene context (2010) (4)
- A 2d + 3d rich data approach to scene understanding (2013) (4)
- Top-down control of visual attention in real world scenes (2010) (4)
- An ensemble prior of image structure for cross-modal inference (2005) (4)
- Cetacean Translation Initiative: a roadmap to deciphering the communication of sperm whales (2021) (4)
- Debiasing Vision-Language Models via Biased Prompts (2023) (3)
- A systolic array with applications to image processing and wire-routing in VLSI circuits (1991) (3)
- LabelMe: Online Image Annotation and Applications By developing a publicly available tool that allows users to use the Internet to quickly and easily annotate images, the authors were able to collect many detailed image descriptions. (2010) (3)
- A Framework for Encoding Object-level Image Priors (2011) (3)
- Remodeling visual search: How gamma distributions can bring those boring old RTs to life (2010) (3)
- Counterfactual Image Networks (2018) (3)
- Height and Uprightness Invariance for 3D Prediction From a Single View (2020) (3)
- Scaling up instance annotation via label propagation (2021) (3)
- Noisy Agents: Self-supervised Exploration by Predicting Auditory Events (2020) (2)
- Trees and beyond: exploiting and improving tree-structured graphical models (2011) (2)
- Global Semantic Classi cation of Scenes using Power Spectrum Templates 3 Power Spectrum Families (1999) (2)
- FluidLab: A Differentiable Environment for Benchmarking Complex Fluid Manipulation (2023) (2)
- Fixing Malfunctional Objects With Learned Physical Simulation and Functional Prediction (2022) (2)
- Incidents1M: A Large-Scale Dataset of Images With Natural Disasters, Damage, and Incidents (2022) (2)
- Experiences and Insights for Collaborative Industry-Academic Research in Artificial Intelligence (2020) (2)
- Visualizing and Understanding GANs (2019) (2)
- LEARNING PHYSICAL DYNAMICS (2017) (2)
- Asymmetrical filters for vision chips: a basis for the design of large sets of spatial and spatiotemporal filters (1999) (2)
- Unsupervised Non-parametric Geospatial Modeling from Ground Imagery (2014) (2)
- OPEn: An Open-ended Physics Environment for Learning Without a Task (2021) (2)
- Nonparametric Bayesian Texture Learning and Synthesis (2009) (2)
- Self-Supervised Segmentation and Source Separation on Videos (2019) (2)
- Local Relighting of Real Scenes (2022) (1)
- Totems: Physical Objects for Verifying Visual Integrity (2022) (1)
- Guest Editorial: Generative Adversarial Networks for Computer Vision (2020) (1)
- Polymorphic-GAN: Generating Aligned Samples across Multiple Domains with Learned Morph Maps (2022) (1)
- ActionSense: A Multimodal Dataset and Recording Framework for Human Activities Using Wearable Sensors in a Kitchen Environment (2022) (1)
- Exemplar Network: A Generalized Mixture Model (2014) (1)
- Intrinsic and Extrinsic Effects on Image Memorability Supplemental Material (2015) (1)
- BT2: Backward-compatible Training with Basis Transformation (2022) (1)
- What's behind the box? Measuring scene context effects with Shannon's guessing game on indoor scenes (2010) (1)
- Reframing Explanation as an Interactive Medium: The EQUAS (Explainable QUestion Answering System) Project (2021) (1)
- How image statistics drive shape-from-texture and shape-from-specularity (2010) (1)
- Towards cognitive saliency: narrowing the gap to human performance (2017) (1)
- HARMONIC CONVOLUTIONAL NETWORKS (2020) (1)
- CHAPTER 96 Contextual Influences on Saliency (2004) (1)
- Publisher Correction: Learning human–environment interactions using conformal tactile textiles (2021) (1)
- Matrix ? Why does Cypher betray Morpheus ? How does the movie end ? (2016) (0)
- Detecting Everything in the Open World: Towards Universal Object Detection (2023) (0)
- W ATCH -A ND -H ELP : A C HALLENGE FOR S OCIAL P ER CEPTION AND H UMAN -AI C OLLABORATION (2021) (0)
- Accidental Cameras. Revealing the scene outside the picture. (2013) (0)
- Accidental Pinhole and Pinspeck Cameras (2014) (0)
- A Latent Variable Ranking Model for Content-Based Retrieval (2012) (0)
- Supplementary Material: EditGAN: High-Precision Semantic Image Editing (2021) (0)
- Global statistical features and early scene interpretation (2010) (0)
- SUPPLEMENTAL MATERIAL : Where should saliency models look next ? (2016) (0)
- Deep Neural Networks explain spatio-temporal dynamics of visual scene and object processing (2016) (0)
- A data-driven approach for even prediction (2010) (0)
- Depth perception from familiar scene structure (2010) (0)
- Aliasing is a Driver of Adversarial Attacks (2022) (0)
- SUN Database: Exploring a Large Collection of Scene Categories (2014) (0)
- Benchmarking Convolutional Neural Networks for Object Segmentation and Pose Estimation (2017) (0)
- Minimal complexity velocity-tuned filters with analogue neuromorphic networks: A theoretical approach for efficient design (1998) (0)
- Shared ' Cross + Modal ' Representation religious , * church , * plants (2016) (0)
- Modifying a face to make it more memorable or forgettable (2014) (0)
- Object Detection and Contextual Priming (2002) (0)
- Open-vocabulary Panoptic Segmentation with Embedding Modulation (2023) (0)
- Section 16.2 Classifying Images of Single Objects 504 16.2 Classifying Images of Single Objects (0)
- The Role of Embedding Complexity in Domain-invariant Representations (2019) (0)
- Reframing Explanation as an Interactive Medium: The EQUAS (Explainable QUestion Answering System) Project (2021) (0)
- Learning the signatures of the human grasp using a scalable tactile glove (2019) (0)
- Benchmarking Convolutional Neural Networks for Object Segmentation and Pose Estimation (2017) (0)
- On the Units of GANs (2020) (0)
- Predicting the future (2011) (0)
- Preprint prepared for ArXiv submission (2018) (0)
- O BJECT DETECTORS EMERGE IN D EEP S CENE CNN S (2015) (0)
- Inferring Regional and Temporal Eating Habits from Social Media Images (2016) (0)
- Quantifying Interpretations of Deep Visual Representations (2017) (0)
- MIT Open Access Articles Accidental Pinhole and Pinspeck Cameras (2022) (0)
- Optimal Goal-Reaching Reinforcement Learning via Quasimetric Learning (2023) (0)
- Role of Low-level Mechanisms in Brightness Perception (2010) (0)
- ON ( plate , table ) : 2 ON ( glass , table ) : 1 ON ( fork , table ) : 1 Ground-truth Goal VirtualHome-Social Task Demonstration (2021) (0)
- Vocabulary Scene Parsing (0)
- Predicting object and scene descriptions with an information-theoretic model of pragmatics (2010) (0)
- Supplementary Material: Deep Feedback Inverse Problem Solver (2020) (0)
- Physics-Driven Diffusion Models for Impact Sound Synthesis from Videos (2023) (0)
- SCENE RECOGNITION WITH BAG OF WORDS (2016) (0)
- Modeling Context Effects on Image Memorability (2015) (0)
- 2 SUNS ’ 08 Remembering thousands of natural images with high fidelity (2008) (0)
- Scene categorization and detection: the power of global features (2010) (0)
- Procedural Image Programs for Representation Learning (2022) (0)
- 3D-IntPhys: Towards More Generalized 3D-grounded Visual Intuitive Physics under Challenging Scenes (2023) (0)
- Goals Inferring the Why in Images (2015) (0)
- Interaction of contour, shading and texture in natural images (2010) (0)
- Transfer Matrix : A x ( b ) Sample Estimation Gain Image Direction of Light Integration W al l 1-D H idden Scene : Positive Negative ✓ = 0 ( a ) Constructing Transfer Matrix (2017) (0)
- Generalizing Dataset Distillation via Deep Generative Prior (2023) (0)
- Quantifying Context Effects on Image Memorability. (2015) (0)
- Spatio-chromatic processing in the human retina: towards an optimal trade-off between spatial resolution of luminance and range colour perception (1998) (0)
- NOPA: Neurally-guided Online Probabilistic Assistance (2022) (0)
- Not all scene categories are created equal: The role of object and layout diagnosticity in scene gist understanding (2010) (0)
- Visualizing Object Detection Features (2016) (0)
- NOPA: Neurally-guided Online Probabilistic Assistance for Building Socially Intelligent Home Assistants (2023) (0)
- Recognition with purely 3D information (2010) (0)
- Comparing the Interpretability of the Deep Visual Representations via Network Dissection (2017) (0)
- Asymmetrical Filters for Vision Chips : a Basis for the Design ofLarge Sets of Spatial and Spatiotemporal (1999) (0)
- NeuralField-LDM: Scene Generation with Hierarchical Latent Diffusion Models (2023) (0)
- Object Detectors Emerge from Training CNNs for Scene Recognition (2015) (0)
- Shape from sheen 1 Shape from sheen (2009) (0)
- Learning Sight from Sound: Ambient Sound Provides Supervision for Visual Learning (2018) (0)
- The Power of Averaging (2006) (0)
- Modeling contextual influences on object recognition (2010) (0)
- Understanding and Estimating the Adaptability of Domain-Invariant Representations (2020) (0)
- Kitchen Units for Food Units for Table . . . Past Future Predictor (2015) (0)
- Dataset Distillation by Matching Training Trajectories (2022) (0)
- 3D Interpreter Networks for Viewer-Centered Wireframe Modeling (2018) (0)
This paper list is powered by the following services:
What Schools Are Affiliated With Antonio B. Torralba?
Antonio B. Torralba is affiliated with the following schools: