Silvio Savarese
#103,346
Most Influential Person Now
Academic
Silvio Savarese's AcademicInfluence.com Rankings
Silvio Savareseengineering Degrees
Engineering
#2753
World Rank
#3727
Historical Rank
Robotics
#65
World Rank
#67
Historical Rank
Electrical Engineering
#541
World Rank
#596
Historical Rank
Silvio Savaresecomputer-science Degrees
Computer Science
#3726
World Rank
#3916
Historical Rank
Database
#1014
World Rank
#1067
Historical Rank
Download Badge
Engineering Computer Science
Silvio Savarese's Degrees
- Bachelors Electrical Engineering University of Padua
Why Is Silvio Savarese Influential?
(Suggest an Edit or Addition)Silvio Savarese's Published Works
Published Works
- ShapeNet: An Information-Rich 3D Model Repository (2015) (3399)
- Social LSTM: Human Trajectory Prediction in Crowded Spaces (2016) (1989)
- Generalized Intersection Over Union: A Metric and a Loss for Bounding Box Regression (2019) (1860)
- 3D-R2N2: A Unified Approach for Single and Multi-view 3D Object Reconstruction (2016) (1336)
- Deep Metric Learning via Lifted Structured Feature Embedding (2015) (1210)
- Social GAN: Socially Acceptable Trajectories with Generative Adversarial Networks (2018) (1182)
- Learning to Track at 100 FPS with Deep Regression Networks (2016) (1109)
- 3D Semantic Parsing of Large-Scale Indoor Spaces (2016) (1041)
- Active Learning for Convolutional Neural Networks: A Core-Set Approach (2017) (1034)
- Structural-RNN: Deep Learning on Spatio-Temporal Graphs (2015) (882)
- Taskonomy: Disentangling Task Transfer Learning (2018) (850)
- 4D Spatio-Temporal ConvNets: Minkowski Convolutional Neural Networks (2019) (819)
- Beyond PASCAL: A benchmark for 3D object detection in the wild (2014) (689)
- Joint 2D-3D-Semantic Data for Indoor Scene Understanding (2017) (612)
- Learning to Track: Online Multi-object Tracking by Decision Making (2015) (596)
- Recognizing human actions by attributes (2011) (583)
- DenseFusion: 6D Object Pose Estimation by Iterative Dense Fusion (2019) (583)
- SoPhie: An Attentive GAN for Predicting Paths Compliant to Social and Physical Constraints (2018) (574)
- Learning Social Etiquette: Human Trajectory Understanding In Crowded Scenes (2016) (547)
- SEGCloud: Semantic Segmentation of 3D Point Clouds (2017) (542)
- Gibson Env: Real-World Perception for Embodied Agents (2018) (536)
- Tracking the Untrackable: Learning to Track Multiple Cues with Long-Term Dependencies (2017) (484)
- Generalizing to Unseen Domains via Adversarial Data Augmentation (2018) (468)
- 3D generic object categorization, localization and pose estimation (2007) (450)
- Evaluation of image-based modeling and laser scanning accuracy for emerging automated performance monitoring techniques (2011) (337)
- A Unified Framework for Multi-target Tracking and Collective Activity Recognition (2012) (333)
- Social-BiGAT: Multimodal Trajectory Forecasting using Bicycle-GAN and Graph Attention Networks (2019) (320)
- Universal Correspondence Network (2016) (311)
- A Hierarchical Representation for Future Action Prediction (2014) (310)
- Cross-view action recognition via view knowledge transfer (2011) (282)
- Data-driven 3D Voxel Patterns for object category recognition (2015) (280)
- Application of D4AR - A 4-Dimensional augmented reality model for automating construction progress monitoring data collection, processing and communication (2009) (274)
- Automated Progress Monitoring Using Unordered Daily Construction Photographs and IFC-Based Building Information Models (2015) (266)
- Subcategory-Aware Convolutional Neural Networks for Object Proposals and Detection (2016) (265)
- What are they doing? : Collective activity classification using spatio-temporal relationship among people (2009) (263)
- Which Tasks Should Be Learned Together in Multi-task Learning? (2019) (255)
- Making Sense of Vision and Touch: Self-Supervised Learning of Multimodal Representations for Contact-Rich Tasks (2018) (255)
- Learning Transferrable Representations for Unsupervised Domain Adaptation (2016) (245)
- Discriminative Object Class Models of Appearance and Shape by Correlatons (2006) (244)
- Learning context for collective activity recognition (2011) (243)
- ObjectNet3D: A Large Scale Database for 3D Object Recognition (2016) (234)
- Learning an Image-Based Motion Context for Multiple People Tracking (2014) (230)
- TopNet: Structural Point Cloud Decoder (2019) (220)
- Spatial-Temporal correlatons for unsupervised action classification (2008) (212)
- Articulated part-based model for joint object detection and pose estimation (2011) (205)
- A General Framework for Tracking Multiple People from a Moving Camera (2013) (199)
- Recurrent Autoregressive Networks for Online Multi-object Tracking (2017) (196)
- Adversarial Feature Augmentation for Unsupervised Domain Adaptation (2017) (193)
- Toward automated generation of parametric BIMs based on hybrid video and laser scanning data (2010) (186)
- Learning a dense multi-view representation for detection, viewpoint classification and synthesis of object categories (2009) (184)
- Understanding Indoor Scenes Using 3D Geometric Phrases (2013) (181)
- Automatic Targetless Extrinsic Calibration of a 3D Lidar and Camera by Maximizing Mutual Information (2012) (180)
- Feedback Networks (2016) (176)
- Depth-Encoded Hough Voting for Joint Object Detection and Shape Recovery (2010) (170)
- Social Scene Understanding: End-to-End Multi-person Action Localization and Collective Activity Recognition (2016) (168)
- Integrated Sequential As-Built and As-Planned Representation with D4AR Tools in Support of Decision-Making Tasks in the AEC/FM Industry (2011) (168)
- Neural Task Programming: Learning to Generalize Across Hierarchical Tasks (2017) (157)
- Semantic structure from motion (2011) (155)
- 3D Scene Graph: A Structure for Unified Semantics, 3D Space, and Camera (2019) (154)
- Multiple Target Tracking in World Coordinate with Single, Minimally Calibrated Camera (2010) (148)
- Automatic Extrinsic Calibration of Vision and Lidar by Maximizing Mutual Information (2015) (146)
- Learning task-oriented grasping for tool manipulation from simulated self-supervision (2018) (144)
- ROBOTURK: A Crowdsourcing Platform for Robotic Skill Learning through Imitation (2018) (144)
- Detecting and tracking people using an RGB-D camera via multiple detector fusion (2011) (139)
- Comparing image classification methods: K-nearest-neighbor and support-vector-machines (2012) (138)
- A multi-view probabilistic model for 3D object classes (2009) (137)
- Lattice Long Short-Term Memory for Human Action Recognition (2017) (136)
- Understanding Collective Activitiesof People from Videos (2014) (135)
- 3D Scene Understanding by Voxel-CRF (2013) (132)
- SURREAL: Open-Source Reinforcement Learning Framework and Robot Manipulation Benchmark (2018) (125)
- Adversarially Robust Policy Learning: Active construction of physically-plausible perturbations (2017) (124)
- Interactive Gibson Benchmark: A Benchmark for Interactive Navigation in Cluttered Environments (2019) (123)
- Watch-n-patch: Unsupervised understanding of actions and relations (2015) (123)
- CAR-Net: Clairvoyant Attentive Recurrent Network (2017) (123)
- Estimating the aspect layout of object categories (2012) (123)
- Scene Memory Transformer for Embodied Agents in Long-Horizon Tasks (2019) (122)
- Extrinsic Calibration of a 3D Laser Scanner and an Omnidirectional Camera (2010) (119)
- Dense Object Reconstruction with Semantic Priors (2013) (115)
- Text2Shape: Generating Shapes from Natural Language by Learning Joint Embeddings (2018) (114)
- Semantic structure from motion with points, regions, and objects (2012) (113)
- iGibson 1.0: A Simulation Environment for Interactive Tasks in Large Realistic Scenes (2020) (111)
- Neural Task Graphs: Generalizing to Unseen Tasks From a Single Video Demonstration (2018) (110)
- DeLay: Robust Spatial Layout Estimation for Cluttered Indoor Scenes (2016) (107)
- Variable Impedance Control in End-Effector Space: An Action Space for Reinforcement Learning in Contact-Rich Tasks (2019) (105)
- Research in Visualization Techniques for Field Construction (2011) (105)
- Weakly Supervised 3D Reconstruction with Adversarial Constraint (2017) (102)
- Making Sense of Vision and Touch: Learning Multimodal Representations for Contact-Rich Tasks (2019) (99)
- Knowledge Transfer for Scene-Specific Motion Prediction (2016) (98)
- Toward coherent object detection and scene layout understanding (2010) (97)
- What Matters in Learning from Offline Human Demonstrations for Robot Manipulation (2021) (95)
- DeformNet: Free-Form Deformation Network for 3D Shape Reconstruction from a Single Image (2017) (92)
- 6-PACK: Category-level 6D Pose Tracker with Anchor-Based Keypoints (2019) (90)
- Deformable part models revisited: A performance evaluation for object category pose estimation (2011) (89)
- Local analysis for 3D reconstruction of specular surfaces (2001) (89)
- Unsupervised Semantic Parsing of Video Collections (2015) (86)
- Local Shape from Mirror Reflections (2005) (84)
- Generic 3D Representation via Pose Estimation and Matching (2016) (82)
- A Conversational Paradigm for Program Synthesis (2022) (77)
- Mechanical Search: Multi-Step Retrieval of a Target Object Occluded by Clutter (2019) (77)
- A Behavioral Approach to Visual Navigation with Graph Localization Networks (2019) (77)
- A coarse-to-fine model for 3D pose estimation and sub-category recognition (2015) (76)
- Accurate Localization of 3D Objects from RGB-D Data Using Segmentation Hypotheses (2013) (73)
- Action Recognition by Hierarchical Mid-Level Action Elements (2015) (73)
- Structured Recurrent Temporal Restricted Boltzmann Machines (2014) (72)
- MEVBench: A mobile computer vision benchmarking suite (2011) (71)
- 3D Reconstruction by Shadow Carving: Theory and Practical Evaluation (2007) (70)
- Demo2Vec: Reasoning Object Affordances from Online Videos (2018) (69)
- Im2Pano3D: Extrapolating 360° Structure and Semantics Beyond the Field of View (2017) (68)
- iGibson 2.0: Object-Centric Simulation for Robot Learning of Everyday Household Tasks (2021) (68)
- Discovering Groups of People in Images (2014) (67)
- View Synthesis for Recognizing Unseen Poses of Object Classes (2008) (66)
- Combining 3D Shape, Color, and Motion for Robust Anytime Tracking (2014) (65)
- Deep Learning Under Privileged Information Using Heteroscedastic Dropout (2018) (65)
- Embodied intelligence via learning and evolution (2021) (65)
- ReLMoGen: Integrating Motion Generation in Reinforcement Learning for Mobile Manipulation (2020) (63)
- Deep Visual MPC-Policy Learning for Navigation (2019) (61)
- KETO: Learning Keypoint Representations for Tool Manipulation (2019) (60)
- Large-Scale 3D Shape Reconstruction and Segmentation from ShapeNet Core55 (2017) (58)
- Monitoring changes of 3D building elements from unordered photo collections (2011) (57)
- BLIP-2: Bootstrapping Language-Image Pre-training with Frozen Image Encoders and Large Language Models (2023) (56)
- Mid-Level Visual Representations Improve Generalization and Sample Efficiency for Learning Visuomotor Policies (2018) (56)
- Multimodal Video Indexing and Retrieval Using Directed Information (2012) (56)
- HRL4IN: Hierarchical Reinforcement Learning for Interactive Navigation with Mobile Manipulators (2019) (55)
- Learning to Generalize Across Long-Horizon Tasks from Human Demonstrations (2020) (55)
- Semantic Cross-View Matching (2015) (53)
- Topological Planning with Transformers for Vision-and-Language Navigation (2020) (51)
- Learning to Navigate Using Mid-Level Visual Priors (2019) (51)
- JRMOT: A Real-Time 3D Multi-Object Tracker and a New Large-Scale Dataset (2020) (51)
- GONet: A Semi-Supervised Deep Learning Approach For Traversability Estimation (2018) (50)
- Causal Induction from Visual Observations for Goal Directed Tasks (2019) (50)
- Deep Local Trajectory Replanning and Control for Robot Navigation (2019) (49)
- Learning Language-Conditioned Robot Behavior from Offline Data and Crowd-Sourced Annotation (2021) (49)
- Learning to Predict Human Behavior in Crowded Scenes (2017) (48)
- BEHAVIOR: Benchmark for Everyday Household Activities in Virtual, Interactive, and Ecological Environments (2021) (47)
- Generative Sparse Detection Networks for 3D Single-shot Object Detection (2020) (46)
- Visually bootstrapped generalized ICP (2011) (45)
- A Probabilistic Framework for Real-time 3D Segmentation using Spatial, Temporal, and Semantic Cues (2016) (45)
- An efficient branch-and-bound algorithm for optimal human pose estimation (2012) (45)
- Situational Fusion of Visual Representation for Visual Navigation (2019) (44)
- What do reflections tell us about the shape of a mirror? (2004) (44)
- Machine Vision for Natural Gas Methane Emissions Detection Using an Infrared Camera (2019) (44)
- Representations and Techniques for 3D Object Recognition and Scene Interpretation (2011) (43)
- Shadow carving (2001) (43)
- Dynamics Learning with Cascaded Variational Inference for Multi-Step Manipulation (2019) (40)
- Semantic Parsing of Large-Scale Indoor Spaces (2016) (40)
- CodeGen: An Open Large Language Model for Code with Multi-Turn Program Synthesis (2022) (40)
- Enriching object detection with 2D-3D registration and continuous viewpoint estimation (2015) (40)
- Cracking open the DNN black-box: Video Analytics with DNNs across the Camera-Cloud Boundary (2019) (39)
- Robust real-time tracking combining 3D shape, color, and motion (2016) (39)
- Interactive Visual Construction Progress Monitoring with D 4 AR — 4D Augmented Reality — Models (2009) (39)
- Goal-Aware Prediction: Learning to Model What Matters (2020) (39)
- JRDB: A Dataset and Benchmark of Egocentric Robot Visual Perception of Humans in Built Environments. (2021) (38)
- Free your Camera: 3D Indoor Scene Understanding from Arbitrary Camera Motion (2013) (37)
- Deep Affordance Foresight: Planning Through What Can Be Done in the Future (2020) (37)
- Indoor Scene Understanding with Geometric and Semantic Contexts (2015) (37)
- Monocular Multiview Object Tracking with 3D Aspect Parts (2014) (36)
- Object Co-detection (2012) (36)
- A Geometric Approach to Active Learning for Convolutional Neural Networks (2017) (35)
- EFFEX: An embedded processor for computer vision based feature extraction (2011) (35)
- Long-term path prediction in urban scenarios using circular distributions (2018) (34)
- Understanding the 3D layout of a cluttered room from multiple images (2014) (34)
- Robust single-view instance recognition (2016) (34)
- Automated Model-Based Recognition of Progress Using Daily Construction Photographs and IFC-Based 4D Models (2010) (34)
- Human-in-the-Loop Imitation Learning using Remote Teleoperation (2020) (33)
- Layout Estimation of Highly Cluttered Indoor Scenes Using Geometric and Semantic Cues (2013) (32)
- Continuous Relaxation of Symbolic Planner for One-Shot Imitation Learning (2019) (31)
- Robust object pose estimation via statistical manifold modeling (2011) (31)
- Weakly Supervised Generative Adversarial Networks for 3D Reconstruction (2017) (30)
- Deep Learning for Single-View Instance Recognition (2015) (30)
- Robust Policies via Mid-Level Visual Representations: An Experimental Study in Manipulation and Navigation (2020) (30)
- Weakly Supervised Learning of Mid-Level Features with Beta-Bernoulli Process Restricted Boltzmann Machines (2013) (30)
- Robot Navigation in Constrained Pedestrian Environments using Reinforcement Learning (2020) (29)
- Monitoring of Construction Performance Using Daily Progress Photograph Logs and 4D As-Planned Models (2009) (28)
- Object Detection by 3D Aspectlets and Occlusion Reasoning (2013) (28)
- Visuomotor Mechanical Search: Learning to Retrieve Target Objects in Clutter (2020) (27)
- Find the Best Path: An Efficient and Accurate Classifier for Image Hierarchies (2013) (27)
- Multimodal Sensor Fusion with Differentiable Filters (2020) (27)
- Regression Planning Networks (2019) (27)
- Detecting Specular Surfaces on Natural Images (2007) (27)
- AC-Teach: A Bayesian Actor-Critic Method for Policy Learning with an Ensemble of Suboptimal Teachers (2019) (27)
- Watch-n-Patch: Unsupervised Learning of Actions and Relations (2016) (26)
- CodeRL: Mastering Code Generation through Pretrained Models and Deep Reinforcement Learning (2022) (25)
- Breaking the Chain: Liberation from the Temporal Markov Assumption for Tracking Human Poses (2013) (25)
- Pose Estimation Errors, the Ultimate Diagnosis (2016) (25)
- Multi-view Object Categorization and Pose Estimation (2010) (25)
- Improving Social Awareness Through DANTE: Deep Affinity Network for Clustering Conversational Interactants (2019) (24)
- Recovering Local Shape of a Mirror Surface from Reflection of a Regular Grid (2004) (24)
- VUNet: Dynamic Scene View Synthesis for Traversability Estimation Using an RGB Camera (2018) (24)
- Relating Things and Stuff via ObjectProperty Interactions (2014) (23)
- Translating Navigation Instructions in Natural Language to a High-Level Plan for Behavioral Robot Navigation (2018) (23)
- Forecasting Social Navigation in Crowded Complex Scenes (2016) (22)
- image2mass: Estimating the Mass of an Object from Its Image (2017) (21)
- LASER: Learning a Latent Action Space for Efficient Reinforcement Learning (2021) (21)
- ADAPT: Zero-Shot Adaptive Policy Transfer for Stochastic Dynamical Systems (2017) (20)
- TRiPOD: Human Trajectory and Pose Dynamics Forecasting in the Wild (2021) (19)
- Deep View Morphing (2017) (19)
- Efficient and Exact MAP-MRF Inference using Branch and Bound (2012) (19)
- To Go or Not To Go? A Near Unsupervised Learning Approach For Robot Navigation (2017) (18)
- Object Detection with Geometrical Context Feedback Loop (2010) (18)
- Relating Things and Stuff by High-Order Potential Modeling (2012) (17)
- Merlion: A Machine Learning Library for Time Series (2021) (17)
- Interactive Gibson: A Benchmark for Interactive Navigation in Cluttered Environments (2019) (17)
- ACID: Action-Conditional Implicit Visual Dynamics for Deformable Object Manipulation (2022) (17)
- JRDB: A Dataset and Benchmark for Visual Perception for Navigation in Human Environments (2019) (17)
- Learning Multi-Arm Manipulation Through Collaborative Teleoperation (2020) (16)
- Hierarchical classification of images by sparse approximation (2013) (16)
- Mid-Level Visual Representations Improve Generalization and Sample Efficiency for Learning Active Tasks (2018) (16)
- D4ar- 4 dimensional augmented reality - models or automation and interactive visualization of construction progress monitoring (2010) (15)
- Adaptive Procedural Task Generation for Hard-Exploration Problems (2020) (15)
- Watch-Bot: Unsupervised learning for reminding humans of forgotten actions (2015) (15)
- Semantic and Geometric Modeling with Neural Message Passing in 3D Scene Graphs for Hierarchical Mechanical Search (2020) (15)
- Long Document Summarization with Top-down and Bottom-up Inference (2022) (15)
- Sample-Efficient Safety Assurances using Conformal Prediction (2021) (14)
- Relating Things and Stuff via Object Property Interactions. (2013) (14)
- Point-based path prediction from polar histograms (2016) (14)
- Toward mutual information based automatic registration of 3D point clouds (2012) (14)
- Unsupervised Object Pose Classification from Short Video Sequences (2009) (13)
- Object Detection using Geometrical Context Feedback (2012) (13)
- EVA: An efficient vision architecture for mobile systems (2013) (12)
- VideoGasNet: Deep Learning for Natural Gas Methane Leak Classification Using an Infrared Camera (2021) (12)
- Mobile object detection through client-server based vote transfer (2012) (12)
- Scene Semantic Reconstruction from Egocentric RGB-D-Thermal Videos (2017) (12)
- Plug-and-Play VQA: Zero-shot VQA by Conjoining Large Pretrained Models with Zero Training (2022) (11)
- Video scene categorization by 3D hierarchical histogram matching (2009) (11)
- Reflections on praxis and facture in a devotional portrait diptych: a computer analysis of the mirror in Hans Memling's Virgin and Child and Maarten van Nieuwenhove (2008) (11)
- A Bayesian generative model for learning semantic hierarchies (2014) (10)
- Implementation of a shadow carving system for shape capture (2002) (10)
- Semantic structure from motion with object and point interactions (2011) (10)
- Gibson Env V2: Embodied Simulation Environments for Interactive Navigation (2019) (9)
- Second order local analysis for 3D reconstruction of specular surfaces (2002) (9)
- CS231A Course Notes 1: Camera Models (2017) (9)
- Sparse Reconstruction and Geo-Registration of Site Photographs for As-Built Construction Representation and Automatic Progress Data Collection (2009) (8)
- Retrospectives on the Embodied AI Workshop (2022) (8)
- Model-Based Object Recognition (2014) (8)
- Generalization Through Hand-Eye Coordination: An Action Space for Learning Spatially-Invariant Visuomotor Control (2021) (8)
- Toward Automatic 3D Generic Object Modeling from One Single Image (2011) (8)
- Biological data annotation via a human-augmenting AI-based labeling system (2021) (8)
- Unsupervised camera localization in crowded spaces (2017) (8)
- Supplemental Material : Understanding Indoor Scenes using 3 D Geometric Phrases (2013) (8)
- Object detection, shape recovery, and 3D modelling by depth-encoded hough voting (2013) (8)
- Error-Aware Imitation Learning from Teleoperation Data for Mobile Manipulation (2021) (7)
- Behavioral Indoor Navigation With Natural Language Directions (2018) (7)
- Co-GAIL: Learning Diverse Strategies for Human-Robot Collaboration (2021) (7)
- JRDB-Act: A Large-scale Dataset for Spatio-temporal Action, Social Group and Activity Detection (2021) (7)
- Label transfer exploiting three-dimensional structure for semantic segmentation (2013) (6)
- Toward mutual information based place recognition (2014) (6)
- LAVIS: A Library for Language-Vision Intelligence (2022) (6)
- Masked Unsupervised Self-training for Zero-shot Image Classification (2022) (6)
- Unsupervised Transductive Domain Adaptation (2016) (6)
- Shape reconstruction from shadows and reflections (2005) (6)
- Testing of Depth-Encoded Hough Voting for Infrastructure Object Detection (2012) (5)
- OmniXAI: A Library for Explainable AI (2022) (5)
- Special issue on 3D representation for object and scene recognition (2009) (5)
- BEHAVIOR-1K: A Benchmark for Embodied AI with 1, 000 Everyday Activities and Realistic Simulation (2022) (5)
- GONet++: Traversability Estimation via Dynamic Scene View Synthesis (2018) (5)
- The Group and Crowd Analysis Interdisciplinary Challenge (2017) (5)
- How Trustworthy are the Existing Performance Evaluations for Basic Vision Tasks? (2020) (5)
- Adversarially Robust Policy Learning through Active Construction of Physically-Plausible Perturbations (2017) (5)
- Semantic Structure from Motion: A Novel Framework for Joint Object Recognition and 3D Reconstruction (2011) (5)
- ULIP: Learning Unified Representation of Language, Image and Point Cloud for 3D Understanding (2022) (5)
- How Trustworthy are Performance Evaluations for Basic Vision Tasks? (2020) (5)
- Generic 3 D Representation via Pose Estimation and Matching supplementary material (2016) (4)
- Localized Calibration: Metrics and Recalibration (2021) (4)
- Probabilistic Visual Navigation with Bidirectional Image Prediction (2020) (4)
- Unsupervised Semantic Action Discovery from Video Collections (2016) (4)
- Discovering Generalizable Skills via Automated Generation of Diverse Tasks (2021) (3)
- Remote assessment of pre- And post-disaster critical physical infrastructures using mobile workstation chariot and D4AR models (2019) (3)
- A Bayesian Approach to Tracking Learning Detection (2013) (3)
- Leveraging Pretrained Image Classifiers for Language-Based Segmentation (2019) (3)
- Workshop on Machine Learning for Autonomous Vehicles 2017 (2)
- Coupled Recurrent Network (CRN) (2018) (2)
- JRDB-Act: A Large-scale Multi-modal Dataset for Spatio-temporal Action, Social Group and Activity Detection (2021) (2)
- Semantic Parsing of Large-Scale Indoor Spaces Supplementary Material (2016) (2)
- Recognizing Complex Human Activities via Crowd Context (2014) (2)
- Scene Understanding for the Visually Impaired Using Visual Sonification by Visual Feature Analysis and Auditory Signatures (2012) (2)
- Local calibration: metrics and recalibration (2021) (2)
- Supplementary Material for “ Data-Driven 3 D Voxel Patterns for Object Category Recognition ” (2015) (1)
- Model-based detection of progress using D4AR models generated by daily site photologs and building information models (2019) (1)
- Human Centred Object Co-Segmentation (2016) (1)
- MVSS: Michigan Visual Sonification System (2012) (1)
- A Discriminative Model for Learning Semantic and Geometric Interactions in Indoor Scenes (2013) (1)
- Shrinkage Optimized Directed Information using Pictorial Structures for Action Recognition (2014) (1)
- Classification of Satellite Images based on Scale-Invariant Feature Transform (2012) (1)
- CS 231 A Course Notes 2 : Single View Metrology (2017) (1)
- SURREAL-System: Fully-Integrated Stack for Distributed Deep Reinforcement Learning (2019) (1)
- Technical Report: Articulated Part-based Model for Joint Object Detection and Pose Estimation (2011) (1)
- Computer Vision: From 3d Reconstruction to Visual Recognition (2020) (1)
- CS 231 A Course Notes 4 : Stereo Systems and Structure from Motion (2017) (1)
- Can we see the shape of a mirror (2010) (1)
- Interactive Pedestrian Simulation in iGibson (2021) (1)
- Linear Artificial Forces for Human Dynamics in Complex Contexts (2020) (1)
- Online Distribution Shift Detection via Recency Prediction (2022) (1)
- Multi-target tracking with context from interaction feature strings (2014) (1)
- CEC : Research in visualization techniques for field construction (2010) (1)
- Privacy Preserving Recalibration under Domain Shift (2020) (1)
- Supplementary Material for the Paper “ Enriching Object Detection with 2 D-3 D Registration and Continuous Viewpoint Estimation ” (2015) (1)
- Time-Varying Interaction Estimation Using Ensemble Methods (2019) (1)
- AR – A 4-DIMENSIONAL AUGMENTED REALITY MODEL FOR AUTOMATING CONSTRUCTION (2009) (1)
- Supplemental Material : Discovering Groups of People in Images (2014) (0)
- Shape from Specularities (2014) (0)
- Indoor Scene Understanding with Geometric and Semantic Contexts (2014) (0)
- Minkowski Tracker: A Sparse Spatio-Temporal R-CNN for Joint Object Detection and Tracking (2022) (0)
- Feedback based Neural Networks (2016) (0)
- Carving from Ray-Tracing Constraints: IRT-Carving (2006) (0)
- Masked Unsupervised Self-training for Label-free Image Classification (2022) (0)
- ICRA 2019 Best Paper Award Recipients Announced [Society News] (2019) (0)
- Neural Architecture Search From Fréchet Task Distance (2021) (0)
- CS 231 A Course Notes 5 : Active and Volumetric Stereo (2017) (0)
- Perceiving the 3D World from Images (2013) (0)
- Object Pose Dataset using Discriminatively Trained Deformable Part Models (2014) (0)
- Procedure-Aware Pretraining for Instructional Video Understanding (2023) (0)
- Towards Grasp Transfer using Shape Deformation (2017) (0)
- Unsupervised Activity Learning and Parsing Learned Action 1 : Selected Visual Atoms : Selected Language Atoms (2018) (0)
- Editorial of Special Issue on Shape Representations Meet Visual Recognition (2015) (0)
- Network learns generic object tracking Neural Network Test : Frozen weights Current frame Network tracks novel objects ( no finetuning ) (2016) (0)
- Sparse Representation of Multimodality Sensing Databases for Data Mining and Retrieval (2015) (0)
- Multi-view Indoor Spatial Layout Estimation Yuanfang ( Yolanda ) (2016) (0)
- 2D AND 3D VISUAL RECOGNITION: APPROACHES AND METHODS (2011) (0)
- HIVE: Harnessing Human Feedback for Instructional Visual Editing (2023) (0)
- WS14: Challenges and opportunities in robot perception (2011) (0)
- Eye-BEHAVIOR: An Eye-Tracking Dataset for Everyday Household Activities in Virtual, Interactive, and Ecological Environments (2022) (0)
- Environment-aware Pedestrian Trajectory Prediction for Autonomous Driving (2020) (0)
- Supplementary Materials for Generative Sparse Detection Networks (2020) (0)
- Learning Hierarchical Linguistic Descriptions of Visual Datasets (2013) (0)
- CodeGen2: Lessons for Training LLMs on Programming and Natural Languages (2023) (0)
- Model-Based Object Recognition: Traditional Approach (2020) (0)
- Localizing Against Drawn Maps via Spline-Based Registration (2020) (0)
- Visual Pattern Recognition Models of Infrastructure Elements vs. Depth-Encoded Hough Voting (2012) (0)
- Generating Procedural 3D materials from Images using Neural Networks (2022) (0)
- ConvNet の 機能学習 MultiView ConvNet Feature Learning for Keypoint Detection and Matching ジュンフュン クオン (2017) (0)
- Why do we see some surfaces as reflective (2010) (0)
- Hierarchical Task Generalization with Neural Programs (2017) (0)
- Reality Capturing and Modeling with Visual and Spatial Sensing (2009) (0)
- When are reflections useful in perceiving the shape of shiny surfaces (2010) (0)
- Query Image Horizon + Semantic Segments GIS Map Recti & ied View Overlapping Tiles System Input Descriptor Extraction DescriptorN Matching (0)
- Best-k Search Algorithm for Neural Text Generation (2022) (0)
- Imitation Learning Init Demo Existing One-Shot Approaches Policy Networks Action : Pick ( A ) Symbol Gnding Networks (2019) (0)
- Learning a Visual State Representation for Generative Adversarial Imitation Learning (2017) (0)
This paper list is powered by the following services:
Other Resources About Silvio Savarese
What Schools Are Affiliated With Silvio Savarese?
Silvio Savarese is affiliated with the following schools: