Sanja Fidler
#130,980
Most Influential Person Now
Researcher
Sanja Fidler's AcademicInfluence.com Rankings
Sanja Fidlercomputer-science Degrees
Computer Science
#5756
World Rank
#6075
Historical Rank
Machine Learning
#1641
World Rank
#1663
Historical Rank
Database
#2888
World Rank
#3013
Historical Rank

Download Badge
Computer Science
Sanja Fidler's Degrees
- PhD Computer Science University of Ljubljana
- Masters Computer Science University of Ljubljana
- Bachelors Computer Science University of Ljubljana
Similar Degrees You Can Earn
Why Is Sanja Fidler Influential?
(Suggest an Edit or Addition)Sanja Fidler's Published Works
Number of citations in a given year to any of this author's works
Total number of citations to an author for the works they published in a given year. This highlights publication of the most important work(s) by the author
Published Works
- Skip-Thought Vectors (2015) (2100)
- Aligning Books and Movies: Towards Story-Like Visual Explanations by Watching Movies and Reading Books (2015) (1817)
- Scene Parsing through ADE20K Dataset (2017) (1730)
- The Role of Context for Object Detection and Semantic Segmentation in the Wild (2014) (1080)
- Semantic Understanding of Scenes Through the ADE20K Dataset (2016) (958)
- VSE++: Improving Visual-Semantic Embeddings with Hard Negatives (2017) (748)
- Monocular 3D Object Detection for Autonomous Driving (2016) (725)
- 3D Object Proposals for Accurate Object Class Detection (2015) (689)
- Scaling Egocentric Vision: The EPIC-KITCHENS Dataset (2018) (582)
- MovieQA: Understanding Stories in Movies through Question-Answering (2015) (574)
- Detect What You Can: Detecting and Representing Objects Using Holistic Models and Body Parts (2014) (486)
- Order-Embeddings of Images and Language (2015) (477)
- Describing the scene as a whole: Joint object detection, scene classification and semantic segmentation (2012) (428)
- Gated-SCNN: Gated Shape CNNs for Semantic Segmentation (2019) (417)
- Towards Diverse and Natural Image Descriptions via a Conditional GAN (2017) (390)
- Predicting Deep Zero-Shot Convolutional Neural Networks Using Textual Descriptions (2015) (378)
- 3D Graph Neural Networks for RGBD Semantic Segmentation (2017) (370)
- Efficient Interactive Annotation of Segmentation Datasets with Polygon-RNN++ (2018) (305)
- 3D Object Proposals Using Stereo Imagery for Accurate Object Class Detection (2016) (292)
- Holistic Scene Understanding for 3D Object Detection with RGBD Cameras (2013) (286)
- Annotating Object Instances with a Polygon-RNN (2017) (267)
- Learning to Predict 3D Objects with an Interpolation-based Differentiable Renderer (2019) (249)
- Lift, Splat, Shoot: Encoding Images From Arbitrary Camera Rigs by Implicitly Unprojecting to 3D (2020) (245)
- SGN: Sequential Grouping Networks for Instance Segmentation (2017) (229)
- Be Your Own Prada: Fashion Synthesis with Structural Coherence (2017) (223)
- Towards Scalable Representations of Object Categories: Learning a Hierarchy of Parts (2007) (221)
- Neural Geometric Level of Detail: Real-time Rendering with Implicit 3D Shapes (2021) (218)
- Neuroaesthetics in fashion: Modeling the perception of fashionability (2015) (209)
- VirtualHome: Simulating Household Activities Via Programs (2018) (207)
- NerveNet: Learning Structured Policy with Graph Neural Networks (2018) (199)
- Combining reconstructive and discriminative subspace methods for robust classification and regression by subsampling (2006) (198)
- 3D Object Detection and Viewpoint Estimation with a Deformable 3D Cuboid Model (2012) (194)
- What Are You Talking About? Text-to-Image Coreference (2014) (184)
- Instance-Level Segmentation for Autonomous Driving with Deep Densely Connected MRFs (2015) (173)
- Meta-Sim: Learning to Generate Synthetic Datasets (2019) (173)
- Fast Interactive Object Annotation With Curve-GCN (2019) (171)
- VSE++: Improved Visual-Semantic Embeddings (2017) (167)
- TorontoCity: Seeing the World with a Million Eyes (2016) (157)
- Monocular Object Instance Segmentation and Depth Ordering with CNNs (2015) (149)
- segDeepM: Exploiting segmentation and context in deep neural networks for object detection (2015) (147)
- Bottom-Up Segmentation for Top-Down Detection (2013) (143)
- Visual Semantic Search: Retrieving Videos via Complex Textual Queries (2014) (143)
- DatasetGAN: Efficient Labeled Data Factory with Minimal Human Effort (2021) (133)
- Video In Sentences Out (2012) (133)
- Box in the Box: Joint 3D Layout and Object Reasoning from Single Images (2013) (131)
- HD Maps: Fine-Grained Road Segmentation by Parsing Ground and Aerial Images (2016) (128)
- Devil Is in the Edges: Learning Semantic Boundaries From Noisy Annotations (2019) (117)
- Song From PI: A Musically Plausible Network for Pop Music Generation (2016) (116)
- Enhancing Road Maps by Parsing Aerial Images Around the World (2015) (111)
- Real-time coarse-to-fine topologically preserving segmentation (2015) (110)
- Situation Recognition with Graph Neural Networks (2017) (109)
- Personalized Federated Learning with First Order Model Optimization (2020) (107)
- A High Performance CRF Model for Clothes Parsing (2014) (93)
- Efficient Summarization with Read-Again and Copy Mechanism (2016) (93)
- The EPIC-KITCHENS Dataset: Collection, Challenges and Baselines (2020) (91)
- Rent3D: Floor-plan priors for monocular layout estimation (2015) (91)
- Image GANs meet Differentiable Rendering for Inverse Graphics and Interpretable 3D Neural Rendering (2020) (90)
- MovieGraphs: Towards Understanding Human-Centric Situations from Videos (2017) (89)
- EditGAN: High-Precision Semantic Image Editing (2021) (87)
- Beat the MTurkers: Automatic Image Labeling from Weak 3D Supervision (2014) (86)
- Proximal Deep Structured Models (2016) (85)
- Holistic 3D scene understanding from a single geo-tagged image (2015) (81)
- Semantic Segmentation with Generative Models: Semi-Supervised Learning and Strong Out-of-Domain Generalization (2021) (80)
- Similarity-based cross-layered hierarchical representation for object categorization (2008) (77)
- Sports Field Localization via Deep Structured Models (2017) (74)
- Lost Shopping! Monocular Localization in Large Indoor Spaces (2015) (72)
- EigenDamage: Structured Pruning in the Kronecker-Factored Eigenbasis (2019) (72)
- Extracting Triangular 3D Models, Materials, and Lighting From Images (2021) (71)
- DARNet: Deep Active Ray Network for Building Segmentation (2019) (69)
- Open Vocabulary Scene Parsing (2017) (68)
- Kaolin: A PyTorch Library for Accelerating 3D Deep Learning Research (2019) (68)
- DMM-Net: Differentiable Mask-Matching Network for Video Object Segmentation (2019) (63)
- GET3D: A Generative Model of High Quality 3D Textured Shapes Learned from Images (2022) (63)
- Object Instance Annotation With Deep Extreme Level Set Evolution (2019) (61)
- Learning to Simulate Dynamic Environments With GameGAN (2020) (61)
- Magic3D: High-Resolution Text-to-3D Content Creation (2022) (61)
- LION: Latent Point Diffusion Models for 3D Shape Generation (2022) (61)
- gradSim: Differentiable simulation for system identification and visuomotor control (2021) (59)
- Learning to Generate Diverse Dance Motions with Transformer (2020) (57)
- Hierarchical Statistical Learning of Generic Parts of Object Structure (2006) (56)
- Learning to Act Properly: Predicting and Explaining Affordances from Images (2017) (56)
- Meta-Sim2: Unsupervised Learning of Scene Structure for Synthetic Data Generation (2020) (54)
- Teaching Machines to Describe Images with Natural Language Feedback (2017) (54)
- Neural Parts: Learning Expressive 3D Shape Abstractions with Invertible Neural Networks (2021) (53)
- Scaling Egocentric Vision: The Dataset (2018) (52)
- Efficient and Information-Preserving Future Frame Prediction and Beyond (2020) (51)
- Learning Deformable Tetrahedral Meshes for 3D Reconstruction (2020) (48)
- Deep Marching Tetrahedra: a Hybrid Representation for High-Resolution 3D Shape Synthesis (2021) (46)
- M2BEV: Multi-Camera Joint 3D Detection and Segmentation with Unified Birds-Eye View Representation (2022) (46)
- Video Face Clustering With Unknown Number of Clusters (2019) (46)
- A Sentence Is Worth a Thousand Pixels (2013) (45)
- A Theoretical Analysis of the Number of Shots in Few-Shot Learning (2019) (44)
- Detecting Curved Symmetric Parts Using a Deformable Disc Model (2013) (43)
- Neural Turtle Graphics for Modeling City Road Layouts (2019) (43)
- Object Categorization: Learning Hierarchical Compositional Representations of Object Structure (2009) (42)
- Watch-And-Help: A Challenge for Social Perception and Human-AI Collaboration (2020) (42)
- Identifying Clinical Terms in Medical Text Using Ontology-Guided Machine Learning (2019) (38)
- DriveGAN: Towards a Controllable High-Quality Neural Simulation (2021) (36)
- Find your way by observing the sun and other semantic cues (2016) (36)
- Analyzing Semantic Segmentation Using Hybrid Human-Machine CRFs (2013) (36)
- Learning to Evaluate Perception Models Using Planner-Centric Metrics (2020) (35)
- Action Recognition From Single Timestamp Supervision in Untrimmed Videos (2019) (34)
- Pose Estimation for Objects with Rotational Symmetry (2018) (34)
- f-Domain-Adversarial Learning: Theory and Algorithms (2021) (33)
- ATISS: Autoregressive Transformers for Indoor Scene Synthesis (2021) (33)
- Neural Data Server: A Large-Scale Search Engine for Transfer Learning Data (2020) (32)
- A Neural Compositional Paradigm for Image Captioning (2018) (32)
- Learning to Combine Mid-Level Cues for Object Proposal Generation (2015) (32)
- Neural Graph Evolution: Towards Efficient Automatic Robot Design (2019) (31)
- Learning Indoor Inverse Rendering with 3D Spatially-Varying Lighting (2021) (30)
- A Coarse-to-Fine Taxonomy of Constellations for Fast Multi-class Object Detection (2010) (29)
- ASE: Large-Scale Reusable Adversarial Skill Embeddings for Physically Simulated Characters (2022) (29)
- Instance-Level Segmentation with Deep Densely Connected MRFs (2015) (29)
- 3DStyleNet: Creating 3D Shapes with Geometric and Texture Style Variations (2021) (29)
- Physics-based Human Motion Estimation and Synthesis from Videos (2021) (27)
- Robust LDA Classification by Subsampling (2003) (26)
- Learning a Hierarchical Compositional Shape Vocabulary for Multi-class Object Representation (2014) (25)
- DIB-R++: Learning to Predict Lighting and Material with a Hybrid Differentiable Renderer (2021) (24)
- Generating Useful Accident-Prone Driving Scenarios via a Learned Traffic Prior (2021) (22)
- ACTRCE: Augmenting Experience via Teacher's Advice For Multi-Goal Reinforcement Learning (2019) (22)
- Human-Machine CRFs for Identifying Bottlenecks in Scene Understanding (2016) (22)
- Don't Generate Me: Training Differentially Private Generative Models with Sinkhorn Divergence (2021) (22)
- Learning to Caption Images Through a Lifetime by Asking Questions (2018) (22)
- Color Builder: A Direct Manipulation Interface for Versatile Color Theme Authoring (2019) (22)
- Generating Multi-Sentence Lingual Descriptions of Indoor Scenes (2015) (21)
- Expressive Telepresence via Modular Codec Avatars (2020) (21)
- Synthesizing Environment-Aware Activities via Activity Sketches (2019) (21)
- Selecting features for object detection using an AdaBoost-compatible evaluation function (2008) (21)
- Variable Bitrate Neural Fields (2022) (20)
- BigDatasetGAN: Synthesizing ImageNet with Pixel-wise Annotations (2022) (20)
- Federated Simulation for Medical Imaging (2020) (20)
- Learning Smooth Neural Functions via Lipschitz Regularization (2022) (20)
- Evaluating multi-class learning strategies in a generative hierarchical framework for object detection (2009) (19)
- Scaling Egocentric Vision: The Open image in new window Dataset (2018) (19)
- Generating Multi-sentence Natural Language Descriptions of Indoor Scenes (2015) (18)
- SurfConv: Bridging 3D and 2D Convolution for RGBD Images (2018) (18)
- A Face-to-Face Neural Conversation Model (2018) (18)
- Beyond Fixed Grid: Learning Geometric Image Representation with a Deformable Grid (2020) (17)
- UniCon: Universal Neural Controller For Physics-based Character Motion (2020) (17)
- Image-Level or Object-Level? A Tale of Two Resampling Strategies for Long-Tailed Detection (2021) (17)
- Neural Fields as Learnable Kernels for 3D Reconstruction (2021) (16)
- HouseCraft: Building Houses from Rental Ads and Street Views (2016) (16)
- Now You Shake Me: Towards Automatic 4D Cinema (2018) (15)
- Nonlinear color triads for approximation, learning and direct manipulation of color distributions (2020) (15)
- Superedge grouping for object localization by combining appearance and shape information (2012) (15)
- ScribbleBox: Interactive Annotation Framework for Video Object Segmentation (2020) (15)
- Variational Amodal Object Completion (2020) (15)
- Learning Hierarchical Representations of Object Categories for Robot Vision (2007) (14)
- Creative Flow+ Dataset (2019) (14)
- Optimization Framework for Learning a Hierarchical Shape Vocabulary for Object Class Detection (2009) (14)
- Low Budget Active Learning via Wasserstein Distance: An Integer Programming Approach (2021) (13)
- Federated Learning with Heterogeneous Architectures using Graph HyperNetworks (2022) (11)
- CrevNet: Conditionally Reversible Video Prediction (2019) (11)
- Auto-Tuning Structured Light by Optical Stochastic Gradient Descent (2020) (11)
- Evaluating multi-class learning strategies in a hierarchical framework for object detection (2009) (10)
- Towards Good Practices for Efficiently Annotating Large-Scale Image Classification Datasets (2021) (10)
- Visual Reasoning by Progressive Module Networks (2018) (10)
- EPIC-KITCHENS VISOR Benchmark: VIdeo Segmentations and Object Relations (2022) (10)
- A probabilistic model for recursive factorized image features (2011) (9)
- VoxFormer: Sparse Voxel Transformer for Camera-based 3D Semantic Scene Completion (2023) (8)
- Multi-cue Mid-level Grouping (2014) (8)
- Emergent Road Rules In Multi-Agent Driving Environments (2020) (8)
- Interactive Annotation of 3D Object Geometry using 2D Scribbles (2020) (7)
- Differentiable simulation for system identification and visuomotor control ∇ Sim : D IFFERENTIABLE SIMULATION FOR SYSTEM IDENTIFICATION AND VISUOMOTOR CONTROL (2020) (7)
- Soccer Field Localization from a Single Image (2016) (6)
- Neural Light Field Estimation for Street Scenes with Differentiable Virtual Object Insertion (2022) (6)
- The efficacy of Neural Planning Metrics: A meta-analysis of PKL on nuScenes (2020) (6)
- Unsupervised Disambiguation of Image Captions (2012) (5)
- Color Sails: Discrete-Continuous Palettes for Deep Color Exploration (2018) (5)
- The Shmoop Corpus: A Dataset of Stories with Loosely Aligned Summaries (2019) (5)
- Domain Adversarial Training: A Game Perspective (2022) (4)
- Towards Optimal Strategies for Training Self-Driving Perception Models in Simulation (2021) (4)
- Fed-Sim: Federated Simulation for Medical Imaging (2020) (4)
- Lifelong Learning for Image Captioning by Asking Natural Language Questions (2018) (4)
- Frame Averaging for Equivariant Shape Space Learning (2021) (4)
- Optimizing Data Collection for Machine Learning (2022) (4)
- 3 D Object Detection with a Deformable 3 D Cuboid Model (2013) (3)
- How Much More Data Do I Need? Estimating Requirements for Downstream Tasks (2022) (3)
- MvDeCor: Multi-view Dense Correspondence Learning for Fine-grained 3D Segmentation (2022) (3)
- AUV-Net: Learning Aligned UV Maps for Texture Transfer and Synthesis (2022) (3)
- Situation Recognition with Graph Neural Networks Supplementary Material (2017) (3)
- A Framework for Symmetric Part Detection in Cluttered Scenes (2015) (3)
- Scalable Neural Data Server: A Data Recommender for Transfer Learning (2022) (3)
- Mimicking the In-Camera Color Pipeline for Camera-Aware Object Compositing (2019) (2)
- Causal BERT: Improving object detection by searching for challenging groups (2021) (2)
- Learning Categorical Shape from Captioned Images (2012) (2)
- Categorical Perception (2010) (2)
- Hierarchical Neural Implicit Pose Network for Animation and Motion Retargeting (2021) (2)
- ACTRCE: Augmenting Experience via Teacher’s Advice (2018) (2)
- Progressive Reasoning by Module Composition (2018) (2)
- NP-DRAW: A Non-Parametric Structured Latent Variable Modelfor Image Generation (2021) (2)
- XDGAN: Multi-Modal 3D Shape Generation in 2D Space (2022) (1)
- Selecting features for object detection using (2008) (1)
- Causal Scene BERT: Improving object detection by searching for challenging groups of data (2022) (1)
- HLS Typing Technologies (2010) (1)
- Human-Machine CRFs for Identifying Bottlenecks in Holistic Scene Understanding (2014) (1)
- DIFFERENTIALLY PRIVATE GENERATIVE MODELS THROUGH OPTIMAL TRANSPORT (2020) (1)
- Recognizing visual object categories with subspace methods and a learned hierarchical shape vocabulary (2010) (1)
- Bridging the Sim2Real gap with CARE: Supervised Detection Adaptation with Conditional Alignment and Reweighting (2023) (1)
- Improving Semantic Segmentation in Transformers using Hierarchical Inter-Level Attention (2022) (1)
- PADL: Language-Directed Physics-Based Character Control (2022) (1)
- Polymorphic-GAN: Generating Aligned Samples across Multiple Domains with Learned Morph Maps (2022) (1)
- Appendix: Learning Deformable Tetrahedral Meshes for 3D Reconstruction (2020) (0)
- Neural Fields meet Explicit Geometric Representation for Inverse Rendering of Urban Scenes (2023) (0)
- Supplementary Material: Variational Amodal Object Completion (2020) (0)
- ASE (2022) (0)
- y 1 ? ? ? ? ? open jar scoop sugar ? ? ? ? ? ? ? (2019) (0)
- Vocabulary Scene Parsing (0)
- Captioner Decision Maker Question Generator generate Answerer Scorer Writer question Captioner answer “ rollout (2018) (0)
- Creative Flow + Dataset Supplemental Material (2019) (0)
- Align your Latents: High-Resolution Video Synthesis with Latent Diffusion Models (2023) (0)
- W ATCH -A ND -H ELP : A C HALLENGE FOR S OCIAL P ER CEPTION AND H UMAN -AI C OLLABORATION (2021) (0)
- Supplementary Material for ATISS: Autoregressive Transformers for Indoor Scene Synthesis (2021) (0)
- Neural Brushstroke Engine (2022) (0)
- A bottom-up and top-down optimization framework for learning a compositional hierarchy of object classes (2009) (0)
- Encoder First Vertex Recurrent Decoder GGNN Evaluator Network polygon upscaling polygon prediction polygon evaluation vertex vertices (2018) (0)
- Appreciation to IJCV Reviewers (2012) (0)
- NP-DRAW: A Non-Parametric Structured Latent Variable Model for Image Generation (Supplementary material) (2021) (0)
- Keynote Talk: Generative Hierarchical Models for Image Analysis (2009) (0)
- ON ( plate , table ) : 2 ON ( glass , table ) : 1 ON ( fork , table ) : 1 Ground-truth Goal VirtualHome-Social Task Demonstration (2021) (0)
- Neural LiDAR Fields for Novel View Synthesis (2023) (0)
- Bridging the Sim-to-Real Gap: Unsupervised Learning of Scene Structure for Synthetic Data Generation Supplementary Material (2020) (0)
- Trace and Pace: Controllable Pedestrian Animation via Guided Trajectory Diffusion (2023) (0)
- NeuralField-LDM: Scene Generation with Hierarchical Latent Diffusion Models (2023) (0)
- WHAT DATA IS USEFUL FOR MY DATA: TRANSFER LEARNING WITH A MIXTURE OF SELF-SUPERVISED EXPERTS (2019) (0)
- Generative Grammar Semantic Trees Parse Graphs Training Descriptions New Image Scene Graph Semantic Trees Generated Description Vision Models Training Images (2015) (0)
- Language Processing Value Network Natural Language Goal G State S t Value for actions " Go to the blue torch " Vision Processing Gated Attention Goal Embedding Attention Text Image Representation (2019) (0)
- PROGRESSIVE MODULE NETWORKS (2018) (0)
- CNN Encoder Boundary Prediction Feature Extraction initialization GCN GCN Feature Extraction image prediction (2019) (0)
- Local Descriptors (2009) (0)
- FIRST ORDER MODEL OPTIMIZATION (2021) (0)
- RIDING ROLE AGENT VEHICLE PLACE ROLE AGENT VEHICLE PLACE VALUE MAN HORSE OUTSIDE VALUE DOG SKATEBOARD (2017) (0)
- Identifying Clinical Terms in Free-Text Notes Using Ontology-Guided Machine Learning (2019) (0)
- Implementing Planning KL-Divergence (2020) (0)
- Creative Flow+ Dataset Errata and Data Details (2019) (0)
- Matrix ? Why does Cypher betray Morpheus ? How does the movie end ? (2016) (0)
- Supplementary Material: EditGAN: High-Precision Semantic Image Editing (2021) (0)
- Efficient transfer learning for NLP with ELECTRA (2021) (0)
- Supplementary Material : Annotating Object Instances with a Polygon-RNN (2017) (0)
- Synthesizing Physical Character-Scene Interactions (2023) (0)
This paper list is powered by the following services:
What Schools Are Affiliated With Sanja Fidler?
Sanja Fidler is affiliated with the following schools: