Silvio Savarese

Q: What Schools Are Affiliated With Silvio Savarese

Silvio Savarese is affiliated with the following schools: Stanford University, University of Padua, Virginia Tech

Silvio Savarese's AcademicInfluence.com Rankings

Silvio Savarese

Engineering

#2753

World Rank

#3727

Historical Rank

Robotics

#65

World Rank

#67

Historical Rank

Electrical Engineering

#541

World Rank

#596

Historical Rank

engineering Degrees

Silvio Savarese

Computer Science

#3726

World Rank

#3916

Historical Rank

Database

#1014

World Rank

#1067

Historical Rank

computer-science Degrees

Download Badge

Engineering
Computer Science

Silvio Savarese's Degrees

Bachelors Electrical Engineering University of Padua

Why Is Silvio Savarese Influential?

(Suggest an Edit or Addition)

(See a Problem?)

Silvio Savarese's Published Works

Number of citations in a given year to any of this author's works

Total number of citations to an author for the works they published in a given year. This highlights publication of the most important work(s) by the author

Published Works

ShapeNet: An Information-Rich 3D Model Repository (2015) (3399)
Social LSTM: Human Trajectory Prediction in Crowded Spaces (2016) (1989)
Generalized Intersection Over Union: A Metric and a Loss for Bounding Box Regression (2019) (1860)
3D-R2N2: A Unified Approach for Single and Multi-view 3D Object Reconstruction (2016) (1336)
Deep Metric Learning via Lifted Structured Feature Embedding (2015) (1210)
Social GAN: Socially Acceptable Trajectories with Generative Adversarial Networks (2018) (1182)
Learning to Track at 100 FPS with Deep Regression Networks (2016) (1109)
3D Semantic Parsing of Large-Scale Indoor Spaces (2016) (1041)
Active Learning for Convolutional Neural Networks: A Core-Set Approach (2017) (1034)
Structural-RNN: Deep Learning on Spatio-Temporal Graphs (2015) (882)
Taskonomy: Disentangling Task Transfer Learning (2018) (850)
4D Spatio-Temporal ConvNets: Minkowski Convolutional Neural Networks (2019) (819)
Beyond PASCAL: A benchmark for 3D object detection in the wild (2014) (689)
Joint 2D-3D-Semantic Data for Indoor Scene Understanding (2017) (612)
Learning to Track: Online Multi-object Tracking by Decision Making (2015) (596)
Recognizing human actions by attributes (2011) (583)
DenseFusion: 6D Object Pose Estimation by Iterative Dense Fusion (2019) (583)
SoPhie: An Attentive GAN for Predicting Paths Compliant to Social and Physical Constraints (2018) (574)
Learning Social Etiquette: Human Trajectory Understanding In Crowded Scenes (2016) (547)
SEGCloud: Semantic Segmentation of 3D Point Clouds (2017) (542)
Gibson Env: Real-World Perception for Embodied Agents (2018) (536)
Tracking the Untrackable: Learning to Track Multiple Cues with Long-Term Dependencies (2017) (484)
Generalizing to Unseen Domains via Adversarial Data Augmentation (2018) (468)
3D generic object categorization, localization and pose estimation (2007) (450)
Evaluation of image-based modeling and laser scanning accuracy for emerging automated performance monitoring techniques (2011) (337)
A Unified Framework for Multi-target Tracking and Collective Activity Recognition (2012) (333)
Social-BiGAT: Multimodal Trajectory Forecasting using Bicycle-GAN and Graph Attention Networks (2019) (320)
Universal Correspondence Network (2016) (311)
A Hierarchical Representation for Future Action Prediction (2014) (310)
Cross-view action recognition via view knowledge transfer (2011) (282)
Data-driven 3D Voxel Patterns for object category recognition (2015) (280)
Application of D4AR - A 4-Dimensional augmented reality model for automating construction progress monitoring data collection, processing and communication (2009) (274)
Automated Progress Monitoring Using Unordered Daily Construction Photographs and IFC-Based Building Information Models (2015) (266)
Subcategory-Aware Convolutional Neural Networks for Object Proposals and Detection (2016) (265)
What are they doing? : Collective activity classification using spatio-temporal relationship among people (2009) (263)
Which Tasks Should Be Learned Together in Multi-task Learning? (2019) (255)
Making Sense of Vision and Touch: Self-Supervised Learning of Multimodal Representations for Contact-Rich Tasks (2018) (255)
Learning Transferrable Representations for Unsupervised Domain Adaptation (2016) (245)
Discriminative Object Class Models of Appearance and Shape by Correlatons (2006) (244)
Learning context for collective activity recognition (2011) (243)
ObjectNet3D: A Large Scale Database for 3D Object Recognition (2016) (234)
Learning an Image-Based Motion Context for Multiple People Tracking (2014) (230)
TopNet: Structural Point Cloud Decoder (2019) (220)
Spatial-Temporal correlatons for unsupervised action classification (2008) (212)
Articulated part-based model for joint object detection and pose estimation (2011) (205)
A General Framework for Tracking Multiple People from a Moving Camera (2013) (199)
Recurrent Autoregressive Networks for Online Multi-object Tracking (2017) (196)
Adversarial Feature Augmentation for Unsupervised Domain Adaptation (2017) (193)
Toward automated generation of parametric BIMs based on hybrid video and laser scanning data (2010) (186)
Learning a dense multi-view representation for detection, viewpoint classification and synthesis of object categories (2009) (184)
Understanding Indoor Scenes Using 3D Geometric Phrases (2013) (181)
Automatic Targetless Extrinsic Calibration of a 3D Lidar and Camera by Maximizing Mutual Information (2012) (180)
Feedback Networks (2016) (176)
Depth-Encoded Hough Voting for Joint Object Detection and Shape Recovery (2010) (170)
Social Scene Understanding: End-to-End Multi-person Action Localization and Collective Activity Recognition (2016) (168)
Integrated Sequential As-Built and As-Planned Representation with D4AR Tools in Support of Decision-Making Tasks in the AEC/FM Industry (2011) (168)
Neural Task Programming: Learning to Generalize Across Hierarchical Tasks (2017) (157)
Semantic structure from motion (2011) (155)
3D Scene Graph: A Structure for Unified Semantics, 3D Space, and Camera (2019) (154)
Multiple Target Tracking in World Coordinate with Single, Minimally Calibrated Camera (2010) (148)
Automatic Extrinsic Calibration of Vision and Lidar by Maximizing Mutual Information (2015) (146)
Learning task-oriented grasping for tool manipulation from simulated self-supervision (2018) (144)
ROBOTURK: A Crowdsourcing Platform for Robotic Skill Learning through Imitation (2018) (144)
Detecting and tracking people using an RGB-D camera via multiple detector fusion (2011) (139)
Comparing image classification methods: K-nearest-neighbor and support-vector-machines (2012) (138)
A multi-view probabilistic model for 3D object classes (2009) (137)
Lattice Long Short-Term Memory for Human Action Recognition (2017) (136)
Understanding Collective Activitiesof People from Videos (2014) (135)
3D Scene Understanding by Voxel-CRF (2013) (132)
SURREAL: Open-Source Reinforcement Learning Framework and Robot Manipulation Benchmark (2018) (125)
Adversarially Robust Policy Learning: Active construction of physically-plausible perturbations (2017) (124)
Interactive Gibson Benchmark: A Benchmark for Interactive Navigation in Cluttered Environments (2019) (123)
Watch-n-patch: Unsupervised understanding of actions and relations (2015) (123)
CAR-Net: Clairvoyant Attentive Recurrent Network (2017) (123)
Estimating the aspect layout of object categories (2012) (123)
Scene Memory Transformer for Embodied Agents in Long-Horizon Tasks (2019) (122)
Extrinsic Calibration of a 3D Laser Scanner and an Omnidirectional Camera (2010) (119)
Dense Object Reconstruction with Semantic Priors (2013) (115)
Text2Shape: Generating Shapes from Natural Language by Learning Joint Embeddings (2018) (114)
Semantic structure from motion with points, regions, and objects (2012) (113)
iGibson 1.0: A Simulation Environment for Interactive Tasks in Large Realistic Scenes (2020) (111)
Neural Task Graphs: Generalizing to Unseen Tasks From a Single Video Demonstration (2018) (110)
DeLay: Robust Spatial Layout Estimation for Cluttered Indoor Scenes (2016) (107)
Variable Impedance Control in End-Effector Space: An Action Space for Reinforcement Learning in Contact-Rich Tasks (2019) (105)
Research in Visualization Techniques for Field Construction (2011) (105)
Weakly Supervised 3D Reconstruction with Adversarial Constraint (2017) (102)
Making Sense of Vision and Touch: Learning Multimodal Representations for Contact-Rich Tasks (2019) (99)
Knowledge Transfer for Scene-Specific Motion Prediction (2016) (98)
Toward coherent object detection and scene layout understanding (2010) (97)
What Matters in Learning from Offline Human Demonstrations for Robot Manipulation (2021) (95)
DeformNet: Free-Form Deformation Network for 3D Shape Reconstruction from a Single Image (2017) (92)
6-PACK: Category-level 6D Pose Tracker with Anchor-Based Keypoints (2019) (90)
Deformable part models revisited: A performance evaluation for object category pose estimation (2011) (89)
Local analysis for 3D reconstruction of specular surfaces (2001) (89)
Unsupervised Semantic Parsing of Video Collections (2015) (86)
Local Shape from Mirror Reflections (2005) (84)
Generic 3D Representation via Pose Estimation and Matching (2016) (82)
A Conversational Paradigm for Program Synthesis (2022) (77)
Mechanical Search: Multi-Step Retrieval of a Target Object Occluded by Clutter (2019) (77)
A Behavioral Approach to Visual Navigation with Graph Localization Networks (2019) (77)
A coarse-to-fine model for 3D pose estimation and sub-category recognition (2015) (76)
Accurate Localization of 3D Objects from RGB-D Data Using Segmentation Hypotheses (2013) (73)
Action Recognition by Hierarchical Mid-Level Action Elements (2015) (73)
Structured Recurrent Temporal Restricted Boltzmann Machines (2014) (72)
MEVBench: A mobile computer vision benchmarking suite (2011) (71)
3D Reconstruction by Shadow Carving: Theory and Practical Evaluation (2007) (70)
Demo2Vec: Reasoning Object Affordances from Online Videos (2018) (69)
Im2Pano3D: Extrapolating 360° Structure and Semantics Beyond the Field of View (2017) (68)
iGibson 2.0: Object-Centric Simulation for Robot Learning of Everyday Household Tasks (2021) (68)
Discovering Groups of People in Images (2014) (67)
View Synthesis for Recognizing Unseen Poses of Object Classes (2008) (66)
Combining 3D Shape, Color, and Motion for Robust Anytime Tracking (2014) (65)
Deep Learning Under Privileged Information Using Heteroscedastic Dropout (2018) (65)
Embodied intelligence via learning and evolution (2021) (65)
ReLMoGen: Integrating Motion Generation in Reinforcement Learning for Mobile Manipulation (2020) (63)
Deep Visual MPC-Policy Learning for Navigation (2019) (61)
KETO: Learning Keypoint Representations for Tool Manipulation (2019) (60)
Large-Scale 3D Shape Reconstruction and Segmentation from ShapeNet Core55 (2017) (58)
Monitoring changes of 3D building elements from unordered photo collections (2011) (57)
BLIP-2: Bootstrapping Language-Image Pre-training with Frozen Image Encoders and Large Language Models (2023) (56)
Mid-Level Visual Representations Improve Generalization and Sample Efficiency for Learning Visuomotor Policies (2018) (56)
Multimodal Video Indexing and Retrieval Using Directed Information (2012) (56)
HRL4IN: Hierarchical Reinforcement Learning for Interactive Navigation with Mobile Manipulators (2019) (55)
Learning to Generalize Across Long-Horizon Tasks from Human Demonstrations (2020) (55)
Semantic Cross-View Matching (2015) (53)
Topological Planning with Transformers for Vision-and-Language Navigation (2020) (51)
Learning to Navigate Using Mid-Level Visual Priors (2019) (51)
JRMOT: A Real-Time 3D Multi-Object Tracker and a New Large-Scale Dataset (2020) (51)
GONet: A Semi-Supervised Deep Learning Approach For Traversability Estimation (2018) (50)
Causal Induction from Visual Observations for Goal Directed Tasks (2019) (50)
Deep Local Trajectory Replanning and Control for Robot Navigation (2019) (49)
Learning Language-Conditioned Robot Behavior from Offline Data and Crowd-Sourced Annotation (2021) (49)
Learning to Predict Human Behavior in Crowded Scenes (2017) (48)
BEHAVIOR: Benchmark for Everyday Household Activities in Virtual, Interactive, and Ecological Environments (2021) (47)
Generative Sparse Detection Networks for 3D Single-shot Object Detection (2020) (46)
Visually bootstrapped generalized ICP (2011) (45)
A Probabilistic Framework for Real-time 3D Segmentation using Spatial, Temporal, and Semantic Cues (2016) (45)
An efficient branch-and-bound algorithm for optimal human pose estimation (2012) (45)
Situational Fusion of Visual Representation for Visual Navigation (2019) (44)
What do reflections tell us about the shape of a mirror? (2004) (44)
Machine Vision for Natural Gas Methane Emissions Detection Using an Infrared Camera (2019) (44)
Representations and Techniques for 3D Object Recognition and Scene Interpretation (2011) (43)
Shadow carving (2001) (43)
Dynamics Learning with Cascaded Variational Inference for Multi-Step Manipulation (2019) (40)
Semantic Parsing of Large-Scale Indoor Spaces (2016) (40)
CodeGen: An Open Large Language Model for Code with Multi-Turn Program Synthesis (2022) (40)
Enriching object detection with 2D-3D registration and continuous viewpoint estimation (2015) (40)
Cracking open the DNN black-box: Video Analytics with DNNs across the Camera-Cloud Boundary (2019) (39)
Robust real-time tracking combining 3D shape, color, and motion (2016) (39)
Interactive Visual Construction Progress Monitoring with D 4 AR — 4D Augmented Reality — Models (2009) (39)
Goal-Aware Prediction: Learning to Model What Matters (2020) (39)
JRDB: A Dataset and Benchmark of Egocentric Robot Visual Perception of Humans in Built Environments. (2021) (38)
Free your Camera: 3D Indoor Scene Understanding from Arbitrary Camera Motion (2013) (37)
Deep Affordance Foresight: Planning Through What Can Be Done in the Future (2020) (37)
Indoor Scene Understanding with Geometric and Semantic Contexts (2015) (37)
Monocular Multiview Object Tracking with 3D Aspect Parts (2014) (36)
Object Co-detection (2012) (36)
A Geometric Approach to Active Learning for Convolutional Neural Networks (2017) (35)
EFFEX: An embedded processor for computer vision based feature extraction (2011) (35)
Long-term path prediction in urban scenarios using circular distributions (2018) (34)
Understanding the 3D layout of a cluttered room from multiple images (2014) (34)
Robust single-view instance recognition (2016) (34)
Automated Model-Based Recognition of Progress Using Daily Construction Photographs and IFC-Based 4D Models (2010) (34)
Human-in-the-Loop Imitation Learning using Remote Teleoperation (2020) (33)
Layout Estimation of Highly Cluttered Indoor Scenes Using Geometric and Semantic Cues (2013) (32)
Continuous Relaxation of Symbolic Planner for One-Shot Imitation Learning (2019) (31)
Robust object pose estimation via statistical manifold modeling (2011) (31)
Weakly Supervised Generative Adversarial Networks for 3D Reconstruction (2017) (30)
Deep Learning for Single-View Instance Recognition (2015) (30)
Robust Policies via Mid-Level Visual Representations: An Experimental Study in Manipulation and Navigation (2020) (30)
Weakly Supervised Learning of Mid-Level Features with Beta-Bernoulli Process Restricted Boltzmann Machines (2013) (30)
Robot Navigation in Constrained Pedestrian Environments using Reinforcement Learning (2020) (29)
Monitoring of Construction Performance Using Daily Progress Photograph Logs and 4D As-Planned Models (2009) (28)
Object Detection by 3D Aspectlets and Occlusion Reasoning (2013) (28)
Visuomotor Mechanical Search: Learning to Retrieve Target Objects in Clutter (2020) (27)
Find the Best Path: An Efficient and Accurate Classifier for Image Hierarchies (2013) (27)
Multimodal Sensor Fusion with Differentiable Filters (2020) (27)
Regression Planning Networks (2019) (27)
Detecting Specular Surfaces on Natural Images (2007) (27)
AC-Teach: A Bayesian Actor-Critic Method for Policy Learning with an Ensemble of Suboptimal Teachers (2019) (27)
Watch-n-Patch: Unsupervised Learning of Actions and Relations (2016) (26)
CodeRL: Mastering Code Generation through Pretrained Models and Deep Reinforcement Learning (2022) (25)
Breaking the Chain: Liberation from the Temporal Markov Assumption for Tracking Human Poses (2013) (25)
Pose Estimation Errors, the Ultimate Diagnosis (2016) (25)
Multi-view Object Categorization and Pose Estimation (2010) (25)
Improving Social Awareness Through DANTE: Deep Affinity Network for Clustering Conversational Interactants (2019) (24)
Recovering Local Shape of a Mirror Surface from Reflection of a Regular Grid (2004) (24)
VUNet: Dynamic Scene View Synthesis for Traversability Estimation Using an RGB Camera (2018) (24)
Relating Things and Stuff via ObjectProperty Interactions (2014) (23)
Translating Navigation Instructions in Natural Language to a High-Level Plan for Behavioral Robot Navigation (2018) (23)
Forecasting Social Navigation in Crowded Complex Scenes (2016) (22)
image2mass: Estimating the Mass of an Object from Its Image (2017) (21)
LASER: Learning a Latent Action Space for Efficient Reinforcement Learning (2021) (21)
ADAPT: Zero-Shot Adaptive Policy Transfer for Stochastic Dynamical Systems (2017) (20)
TRiPOD: Human Trajectory and Pose Dynamics Forecasting in the Wild (2021) (19)
Deep View Morphing (2017) (19)
Efficient and Exact MAP-MRF Inference using Branch and Bound (2012) (19)
To Go or Not To Go? A Near Unsupervised Learning Approach For Robot Navigation (2017) (18)
Object Detection with Geometrical Context Feedback Loop (2010) (18)
Relating Things and Stuff by High-Order Potential Modeling (2012) (17)
Merlion: A Machine Learning Library for Time Series (2021) (17)
Interactive Gibson: A Benchmark for Interactive Navigation in Cluttered Environments (2019) (17)
ACID: Action-Conditional Implicit Visual Dynamics for Deformable Object Manipulation (2022) (17)
JRDB: A Dataset and Benchmark for Visual Perception for Navigation in Human Environments (2019) (17)
Learning Multi-Arm Manipulation Through Collaborative Teleoperation (2020) (16)
Hierarchical classification of images by sparse approximation (2013) (16)
Mid-Level Visual Representations Improve Generalization and Sample Efficiency for Learning Active Tasks (2018) (16)
D4ar- 4 dimensional augmented reality - models or automation and interactive visualization of construction progress monitoring (2010) (15)
Adaptive Procedural Task Generation for Hard-Exploration Problems (2020) (15)
Watch-Bot: Unsupervised learning for reminding humans of forgotten actions (2015) (15)
Semantic and Geometric Modeling with Neural Message Passing in 3D Scene Graphs for Hierarchical Mechanical Search (2020) (15)
Long Document Summarization with Top-down and Bottom-up Inference (2022) (15)
Sample-Efficient Safety Assurances using Conformal Prediction (2021) (14)
Relating Things and Stuff via Object Property Interactions. (2013) (14)
Point-based path prediction from polar histograms (2016) (14)
Toward mutual information based automatic registration of 3D point clouds (2012) (14)
Unsupervised Object Pose Classification from Short Video Sequences (2009) (13)
Object Detection using Geometrical Context Feedback (2012) (13)
EVA: An efficient vision architecture for mobile systems (2013) (12)
VideoGasNet: Deep Learning for Natural Gas Methane Leak Classification Using an Infrared Camera (2021) (12)
Mobile object detection through client-server based vote transfer (2012) (12)
Scene Semantic Reconstruction from Egocentric RGB-D-Thermal Videos (2017) (12)
Plug-and-Play VQA: Zero-shot VQA by Conjoining Large Pretrained Models with Zero Training (2022) (11)
Video scene categorization by 3D hierarchical histogram matching (2009) (11)
Reflections on praxis and facture in a devotional portrait diptych: a computer analysis of the mirror in Hans Memling's Virgin and Child and Maarten van Nieuwenhove (2008) (11)
A Bayesian generative model for learning semantic hierarchies (2014) (10)
Implementation of a shadow carving system for shape capture (2002) (10)
Semantic structure from motion with object and point interactions (2011) (10)
Gibson Env V2: Embodied Simulation Environments for Interactive Navigation (2019) (9)
Second order local analysis for 3D reconstruction of specular surfaces (2002) (9)
CS231A Course Notes 1: Camera Models (2017) (9)
Sparse Reconstruction and Geo-Registration of Site Photographs for As-Built Construction Representation and Automatic Progress Data Collection (2009) (8)
Retrospectives on the Embodied AI Workshop (2022) (8)
Model-Based Object Recognition (2014) (8)
Generalization Through Hand-Eye Coordination: An Action Space for Learning Spatially-Invariant Visuomotor Control (2021) (8)
Toward Automatic 3D Generic Object Modeling from One Single Image (2011) (8)
Biological data annotation via a human-augmenting AI-based labeling system (2021) (8)
Unsupervised camera localization in crowded spaces (2017) (8)
Supplemental Material : Understanding Indoor Scenes using 3 D Geometric Phrases (2013) (8)
Object detection, shape recovery, and 3D modelling by depth-encoded hough voting (2013) (8)
Error-Aware Imitation Learning from Teleoperation Data for Mobile Manipulation (2021) (7)
Behavioral Indoor Navigation With Natural Language Directions (2018) (7)
Co-GAIL: Learning Diverse Strategies for Human-Robot Collaboration (2021) (7)
JRDB-Act: A Large-scale Dataset for Spatio-temporal Action, Social Group and Activity Detection (2021) (7)
Label transfer exploiting three-dimensional structure for semantic segmentation (2013) (6)
Toward mutual information based place recognition (2014) (6)
LAVIS: A Library for Language-Vision Intelligence (2022) (6)
Masked Unsupervised Self-training for Zero-shot Image Classification (2022) (6)
Unsupervised Transductive Domain Adaptation (2016) (6)
Shape reconstruction from shadows and reflections (2005) (6)
Testing of Depth-Encoded Hough Voting for Infrastructure Object Detection (2012) (5)
OmniXAI: A Library for Explainable AI (2022) (5)
Special issue on 3D representation for object and scene recognition (2009) (5)
BEHAVIOR-1K: A Benchmark for Embodied AI with 1, 000 Everyday Activities and Realistic Simulation (2022) (5)
GONet++: Traversability Estimation via Dynamic Scene View Synthesis (2018) (5)
The Group and Crowd Analysis Interdisciplinary Challenge (2017) (5)
How Trustworthy are the Existing Performance Evaluations for Basic Vision Tasks? (2020) (5)
Adversarially Robust Policy Learning through Active Construction of Physically-Plausible Perturbations (2017) (5)
Semantic Structure from Motion: A Novel Framework for Joint Object Recognition and 3D Reconstruction (2011) (5)
ULIP: Learning Unified Representation of Language, Image and Point Cloud for 3D Understanding (2022) (5)
How Trustworthy are Performance Evaluations for Basic Vision Tasks? (2020) (5)
Generic 3 D Representation via Pose Estimation and Matching supplementary material (2016) (4)
Localized Calibration: Metrics and Recalibration (2021) (4)
Probabilistic Visual Navigation with Bidirectional Image Prediction (2020) (4)
Unsupervised Semantic Action Discovery from Video Collections (2016) (4)
Discovering Generalizable Skills via Automated Generation of Diverse Tasks (2021) (3)
Remote assessment of pre- And post-disaster critical physical infrastructures using mobile workstation chariot and D4AR models (2019) (3)
A Bayesian Approach to Tracking Learning Detection (2013) (3)
Leveraging Pretrained Image Classifiers for Language-Based Segmentation (2019) (3)
Workshop on Machine Learning for Autonomous Vehicles 2017 (2)
Coupled Recurrent Network (CRN) (2018) (2)
JRDB-Act: A Large-scale Multi-modal Dataset for Spatio-temporal Action, Social Group and Activity Detection (2021) (2)
Semantic Parsing of Large-Scale Indoor Spaces Supplementary Material (2016) (2)
Recognizing Complex Human Activities via Crowd Context (2014) (2)
Scene Understanding for the Visually Impaired Using Visual Sonification by Visual Feature Analysis and Auditory Signatures (2012) (2)
Local calibration: metrics and recalibration (2021) (2)
Supplementary Material for “ Data-Driven 3 D Voxel Patterns for Object Category Recognition ” (2015) (1)
Model-based detection of progress using D4AR models generated by daily site photologs and building information models (2019) (1)
Human Centred Object Co-Segmentation (2016) (1)
MVSS: Michigan Visual Sonification System (2012) (1)
A Discriminative Model for Learning Semantic and Geometric Interactions in Indoor Scenes (2013) (1)
Shrinkage Optimized Directed Information using Pictorial Structures for Action Recognition (2014) (1)
Classification of Satellite Images based on Scale-Invariant Feature Transform (2012) (1)
CS 231 A Course Notes 2 : Single View Metrology (2017) (1)
SURREAL-System: Fully-Integrated Stack for Distributed Deep Reinforcement Learning (2019) (1)
Technical Report: Articulated Part-based Model for Joint Object Detection and Pose Estimation (2011) (1)
Computer Vision: From 3d Reconstruction to Visual Recognition (2020) (1)
CS 231 A Course Notes 4 : Stereo Systems and Structure from Motion (2017) (1)
Can we see the shape of a mirror (2010) (1)
Interactive Pedestrian Simulation in iGibson (2021) (1)
Linear Artificial Forces for Human Dynamics in Complex Contexts (2020) (1)
Online Distribution Shift Detection via Recency Prediction (2022) (1)
Multi-target tracking with context from interaction feature strings (2014) (1)
CEC : Research in visualization techniques for field construction (2010) (1)
Privacy Preserving Recalibration under Domain Shift (2020) (1)
Supplementary Material for the Paper “ Enriching Object Detection with 2 D-3 D Registration and Continuous Viewpoint Estimation ” (2015) (1)
Time-Varying Interaction Estimation Using Ensemble Methods (2019) (1)
AR – A 4-DIMENSIONAL AUGMENTED REALITY MODEL FOR AUTOMATING CONSTRUCTION (2009) (1)
Supplemental Material : Discovering Groups of People in Images (2014) (0)
Shape from Specularities (2014) (0)
Indoor Scene Understanding with Geometric and Semantic Contexts (2014) (0)
Minkowski Tracker: A Sparse Spatio-Temporal R-CNN for Joint Object Detection and Tracking (2022) (0)
Feedback based Neural Networks (2016) (0)
Carving from Ray-Tracing Constraints: IRT-Carving (2006) (0)
Masked Unsupervised Self-training for Label-free Image Classification (2022) (0)
ICRA 2019 Best Paper Award Recipients Announced [Society News] (2019) (0)
Neural Architecture Search From Fréchet Task Distance (2021) (0)
CS 231 A Course Notes 5 : Active and Volumetric Stereo (2017) (0)
Perceiving the 3D World from Images (2013) (0)
Object Pose Dataset using Discriminatively Trained Deformable Part Models (2014) (0)
Procedure-Aware Pretraining for Instructional Video Understanding (2023) (0)
Towards Grasp Transfer using Shape Deformation (2017) (0)
Unsupervised Activity Learning and Parsing Learned Action 1 : Selected Visual Atoms : Selected Language Atoms (2018) (0)
Editorial of Special Issue on Shape Representations Meet Visual Recognition (2015) (0)
Network learns generic object tracking Neural Network Test : Frozen weights Current frame Network tracks novel objects ( no finetuning ) (2016) (0)
Sparse Representation of Multimodality Sensing Databases for Data Mining and Retrieval (2015) (0)
Multi-view Indoor Spatial Layout Estimation Yuanfang ( Yolanda ) (2016) (0)
2D AND 3D VISUAL RECOGNITION: APPROACHES AND METHODS (2011) (0)
HIVE: Harnessing Human Feedback for Instructional Visual Editing (2023) (0)
WS14: Challenges and opportunities in robot perception (2011) (0)
Eye-BEHAVIOR: An Eye-Tracking Dataset for Everyday Household Activities in Virtual, Interactive, and Ecological Environments (2022) (0)
Environment-aware Pedestrian Trajectory Prediction for Autonomous Driving (2020) (0)
Supplementary Materials for Generative Sparse Detection Networks (2020) (0)
Learning Hierarchical Linguistic Descriptions of Visual Datasets (2013) (0)
CodeGen2: Lessons for Training LLMs on Programming and Natural Languages (2023) (0)
Model-Based Object Recognition: Traditional Approach (2020) (0)
Localizing Against Drawn Maps via Spline-Based Registration (2020) (0)
Visual Pattern Recognition Models of Infrastructure Elements vs. Depth-Encoded Hough Voting (2012) (0)
Generating Procedural 3D materials from Images using Neural Networks (2022) (0)
ConvNet の機能学習 MultiView ConvNet Feature Learning for Keypoint Detection and Matching ジュンフュンクオン (2017) (0)
Why do we see some surfaces as reflective (2010) (0)
Hierarchical Task Generalization with Neural Programs (2017) (0)
Reality Capturing and Modeling with Visual and Spatial Sensing (2009) (0)
When are reflections useful in perceiving the shape of shiny surfaces (2010) (0)
Query Image Horizon + Semantic Segments GIS Map Recti & ied View Overlapping Tiles System Input Descriptor Extraction DescriptorN Matching (0)
Best-k Search Algorithm for Neural Text Generation (2022) (0)
Imitation Learning Init Demo Existing One-Shot Approaches Policy Networks Action : Pick ( A ) Symbol Gnding Networks (2019) (0)
Learning a Visual State Representation for Generative Adversarial Imitation Learning (2017) (0)

This paper list is powered by the following services:

Other Resources About Silvio Savarese

profiles.stanford.edu

What Schools Are Affiliated With Silvio Savarese?

Silvio Savarese is affiliated with the following schools:

Silvio Savarese's Academic­Influence.com Rankings

Silvio Savarese's Degrees

Why Is Silvio Savarese Influential?

Silvio Savarese's Published Works

Published Works

Other Resources About Silvio Savarese

What Schools Are Affiliated With Silvio Savarese?

Silvio Savarese's AcademicInfluence.com Rankings