Ross Girshick
#69,760
Most Influential Person Now
Ross Girshick's AcademicInfluence.com Rankings
Ross Girshickcomputer-science Degrees
Computer Science
#2309
World Rank
#2406
Historical Rank
Algorithms
#45
World Rank
#45
Historical Rank
Artificial Intelligence
#237
World Rank
#242
Historical Rank
Database
#236
World Rank
#245
Historical Rank
Download Badge
Computer Science
Ross Girshick's Degrees
- PhD Computer Science University of California, Berkeley
- Bachelors Computer Science Stanford University
Similar Degrees You Can Earn
Why Is Ross Girshick Influential?
(Suggest an Edit or Addition)Ross Girshick's Published Works
Published Works
- Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks (2015) (42725)
- You Only Look Once: Unified, Real-Time Object Detection (2015) (22888)
- Rich Feature Hierarchies for Accurate Object Detection and Semantic Segmentation (2013) (20872)
- Fast R-CNN (2015) (17626)
- Mask R-CNN (2017) (14940)
- Caffe: Convolutional Architecture for Fast Feature Embedding (2014) (14406)
- Feature Pyramid Networks for Object Detection (2016) (13673)
- Focal Loss for Dense Object Detection (2017) (13618)
- Object Detection with Discriminatively Trained Part Based Models (2010) (9938)
- Aggregated Residual Transformations for Deep Neural Networks (2016) (7323)
- Momentum Contrast for Unsupervised Visual Representation Learning (2019) (5986)
- Non-local Neural Networks (2017) (5920)
- Focal Loss for Dense Object Detection (2017) (3951)
- Mask R-CNN (2017) (3018)
- Accurate, Large Minibatch SGD: Training ImageNet in 1 Hour (2017) (2753)
- Training Region-Based Object Detectors with Online Hard Example Mining (2016) (1921)
- Unsupervised Deep Embedding for Clustering Analysis (2015) (1908)
- Improved Baselines with Momentum Contrastive Learning (2020) (1886)
- Region-Based Convolutional Networks for Accurate Object Detection and Segmentation (2016) (1845)
- Masked Autoencoders Are Scalable Vision Learners (2021) (1687)
- CLEVR: A Diagnostic Dataset for Compositional Language and Elementary Visual Reasoning (2016) (1569)
- Hypercolumns for object segmentation and fine-grained localization (2014) (1488)
- Learning Rich Features from RGB-D Images for Object Detection and Segmentation (2014) (1408)
- Simultaneous Detection and Segmentation (2014) (1161)
- Part-Based R-CNNs for Fine-Grained Category Detection (2014) (1063)
- Exploring the Limits of Weakly Supervised Pretraining (2018) (1048)
- Inside-Outside Net: Detecting Objects in Context with Skip Pooling and Recurrent Neural Networks (2015) (1047)
- Cascade object detection with deformable part models (2010) (936)
- Designing Network Design Spaces (2020) (843)
- Panoptic Segmentation (2018) (774)
- Rethinking ImageNet Pre-Training (2018) (760)
- Panoptic Feature Pyramid Networks (2019) (728)
- DenseNet: Implementing Efficient ConvNet Descriptor Pyramids (2014) (618)
- Low-Shot Visual Recognition by Shrinking and Hallucinating Features (2016) (616)
- LVIS: A Dataset for Large Vocabulary Instance Segmentation (2019) (579)
- Low-Shot Learning from Imaginary Data (2018) (566)
- Learning Features by Watching Objects Move (2016) (467)
- PointRend: Image Segmentation As Rendering (2019) (461)
- Inferring and Executing Programs for Visual Reasoning (2017) (458)
- Detecting and Recognizing Human-Object Interactions (2017) (426)
- Efficient regression of general-activity human poses from depth images (2011) (426)
- Analyzing the Performance of Multilayer Neural Networks for Object Recognition (2014) (416)
- Deformable part models are convolutional neural networks (2014) (409)
- Efficient Human Pose Estimation from Single Depth Images (2013) (392)
- Deep3D: Fully Automatic 2D-to-3D Video Conversion with Deep Convolutional Neural Networks (2016) (375)
- Contextual Action Recognition with R*CNN (2015) (374)
- Object Detection Networks on Convolutional Feature Maps (2015) (356)
- Long-Term Feature Banks for Detailed Video Understanding (2018) (356)
- Data Distillation: Towards Omni-Supervised Learning (2017) (344)
- Reducing Overfitting in Deep Networks by Decorrelating Representations (2015) (338)
- LSDA: Large Scale Detection through Adaptation (2014) (327)
- Early Convolutions Help Transformers See Better (2021) (319)
- Visual Storytelling (2016) (312)
- Exploring Randomly Wired Neural Networks for Image Recognition (2019) (303)
- Object Detection with Grammar Models (2011) (290)
- Learning to Segment Every Thing (2017) (261)
- On learning to localize objects with minimal supervision (2014) (254)
- TensorMask: A Foundation for Dense Object Segmentation (2019) (245)
- Indoor Scene Understanding with RGB-D Images: Bottom-up Segmentation, Object Detection and Semantic Segmentation (2015) (236)
- Aligning 3D models to RGB-D images of cluttered scenes (2015) (226)
- Efficient Human Pose Estimation from Single Depth Images (2013) (218)
- Seeing through the Human Reporting Bias: Visual Classifiers from Noisy Human-Centric Labels (2015) (181)
- Exploring Nearest Neighbor Approaches for Image Captioning (2015) (179)
- Using k-Poselets for Detecting People and Localizing Their Keypoints (2014) (158)
- Exploring Plain Vision Transformer Backbones for Object Detection (2022) (146)
- R-CNNs for Pose Estimation and Action Detection (2014) (144)
- A Large-Scale Study on Unsupervised Spatiotemporal Representation Learning (2021) (142)
- Actions and Attributes from Wholes and Parts (2014) (134)
- Understanding Objects in Detail with Fine-Grained Attributes (2014) (109)
- Sparselet Models for Efficient Multiclass Object Detection (2012) (92)
- Boundary IoU: Improving Object-Centric Image Segmentation Evaluation (2021) (85)
- PHYRE: A New Benchmark for Physical Reasoning (2019) (77)
- Object Instance Segmentation and Fine-Grained Localization Using Hypercolumns (2017) (76)
- A Multigrid Method for Efficiently Training Video Models (2019) (70)
- Learning by Asking Questions (2017) (63)
- Benchmarking Detection Transfer Learning with Vision Transformers (2021) (60)
- Visual object detection with deformable part models (2013) (58)
- Are Labels Necessary for Neural Architecture Search? (2020) (58)
- From rigid templates to grammars: object detection with structured models (2012) (52)
- Fast and Accurate Model Scaling (2021) (50)
- Low-shot visual object recognition (2016) (49)
- The three R's of computer vision: Recognition, reconstruction and reorganization (2016) (43)
- Revisiting Weakly Supervised Pre-Training of Visual Perception Models (2022) (40)
- Training Deformable Part Models with Decorrelated Features (2013) (37)
- Inferring 3D Object Pose in RGB-D Images (2015) (33)
- Discriminatively Activated Sparselets (2013) (32)
- PyTorchVideo: A Deep Learning Library for Video Understanding (2021) (26)
- Impact of data on generalization of AI for surgical intelligence applications (2020) (26)
- Evaluating Large-Vocabulary Object Detectors: The Devil is in the Details (2021) (25)
- Generalized Sparselet Models for Real-Time Multiclass Object Recognition (2015) (22)
- Segment Anything (2023) (20)
- Visibility constraints on features of 3D objects (2009) (14)
- Discriminatively Trained Mixtures of Deformable Part Models (2008) (9)
- Editorial- Deep Learning for Computer Vision (2017) (9)
- Large scale weakly and semi-supervised learning for low-resource video ASR (2020) (9)
- Simulating Chinese brush painting: the parametric hairy brush (2004) (9)
- Training ASR Models By Generation of Contextual Information (2019) (6)
- Object Detection with Heuristic Coarse-to-Fine Search (2009) (5)
- Towards a Detailed Understanding of Objects and Scenes in Natural Images (2012) (4)
- DenseNet : Implementing Efficient ConvNet Descriptor Pyramids Technical Report (2014) (4)
- From Large-Scale Object Classifiers to Large-Scale Object Detectors: An Adaptation Approach (2014) (3)
- Cooperative Learning of Audio and Video Models from Self-Supervised Synchronization (2018) (2)
- Discriminative Latent Variable Models for Object Detection (2010) (2)
- LSVM-MDPM Release 4 Notes (2010) (2)
- The effectiveness of MAE pre-pretraining for billion-scale pretraining (2023) (2)
- I1.4: Invited Paper: Indoor Scene Understanding from RGB-D Images (2015) (2)
- Learning Visual Classifiers using Human-centric Annotations (2015) (2)
- Inferring and Executing Programs for Visual Reasoning Supplementary Material (2017) (2)
- PyTorchVideo (2021) (1)
- Learning and transferring movie styles (2017) (0)
- Supplementary materials: PointRend: Image Segmentation as Rendering (2020) (0)
- Simulating Chinese brush painting: a geometric model (2004) (0)
- Transforming the output of GANs by fine-tuning them with features from different datasets (2019) (0)
- Indoor Scene Understanding with RGB-D Images: Bottom-up Segmentation, Object Detection and Semantic Segmentation (2014) (0)
- Study of Entity Detection and Identification using Deep Learning Techniques a Survey (2020) (0)
- Exploiting and Introducing Parallelism for Efficient Object Detection (2013) (0)
- Deformable Part Models are Convolutional Neural Networks Tech report (2014) (0)
- Snack time in the lab (2013) (0)
- Training deformable part models with decorrelated features : Supplementary material (2013) (0)
This paper list is powered by the following services: