Hongsheng Li
#156,277
Most Influential Person Now
Hongsheng Li's AcademicInfluence.com Rankings
Hongsheng Licomputer-science Degrees
Computer Science
#8622
World Rank
#9064
Historical Rank
Machine Learning
#3579
World Rank
#3623
Historical Rank
Artificial Intelligence
#3888
World Rank
#3944
Historical Rank
Database
#5617
World Rank
#5828
Historical Rank

Download Badge
Computer Science
Hongsheng Li's Degrees
- PhD Computer Science Stanford University
- Masters Computer Science Stanford University
- Bachelors Computer Science Tsinghua University
Similar Degrees You Can Earn
Why Is Hongsheng Li Influential?
(Suggest an Edit or Addition)Hongsheng Li's Published Works
Number of citations in a given year to any of this author's works
Total number of citations to an author for the works they published in a given year. This highlights publication of the most important work(s) by the author
Published Works
- StackGAN: Text to Photo-Realistic Image Synthesis with Stacked Generative Adversarial Networks (2016) (2179)
- PointRCNN: 3D Object Proposal Generation and Detection From Point Cloud (2018) (1330)
- Cross-scene crowd counting via deep convolutional neural networks (2015) (974)
- PV-RCNN: Point-Voxel Feature Set Abstraction for 3D Object Detection (2019) (536)
- From Points to Parts: 3D Object Detection From Point Cloud With Part-Aware and Part-Aggregation Network (2019) (417)
- Mutual Mean-Teaching: Pseudo Label Refinery for Unsupervised Domain Adaptation on Person Re-identification (2020) (297)
- Group-Wise Correlation Stereo Network (2019) (284)
- Self-paced Contrastive Learning with Hybrid Memory for Domain Adaptive Object Re-ID (2020) (267)
- Learning Deep Neural Networks for Vehicle Re-ID with Visual-spatio-Temporal Path Proposals (2017) (257)
- Person Search with Natural Language Description (2017) (238)
- FD-GAN: Pose-guided Feature Distilling GAN for Robust Person Re-identification (2018) (232)
- Group Consistent Similarity Learning via Deep CRF for Person Re-identification (2018) (206)
- Balanced Meta-Softmax for Long-Tailed Visual Recognition (2020) (191)
- CAMP: Cross-Modal Adaptive Message Passing for Text-Image Retrieval (2019) (182)
- Identity-Aware Textual-Visual Matching with Latent Co-attention (2017) (174)
- Efficient Attention: Attention with Linear Complexities (2018) (168)
- Interpolated Convolutional Networks for 3D Point Cloud Understanding (2019) (167)
- Object Detection in Videos with Tubelet Proposal Networks (2017) (162)
- CLIP-Adapter: Better Vision-Language Models with Feature Adapters (2021) (148)
- Video Person Re-identification with Competitive Snippet-Similarity Aggregation and Co-attentive Snippet Embedding (2018) (143)
- Depth Completion From Sparse LiDAR Data With Depth-Normal Constraints (2019) (142)
- Learning to Predict Layout-to-image Conditional Convolutions for Semantic Image Synthesis (2019) (140)
- Eliminating Background-bias for Robust Person Re-identification (2018) (128)
- End-to-End Deep Kronecker-Product Matching for Person Re-identification (2018) (118)
- Improving Referring Expression Grounding With Cross-Modal Attention-Guided Erasing (2019) (115)
- Deep Group-Shuffling Random Walk for Person Re-identification (2018) (110)
- End-to-End Object Detection with Adaptive Clustering Transformer (2020) (109)
- Improving Deep Visual Representation for Person Re-identification by Global and Local Image-language Association (2018) (100)
- Bi-directional Cross-Modality Feature Propagation with Separation-and-Aggregation Gate for RGB-D Semantic Segmentation (2020) (98)
- Tip-Adapter: Training-free CLIP-Adapter for Better Vision-Language Modeling (2021) (93)
- Data-Driven Crowd Understanding: A Baseline for a Large-Scale Crowd Dataset (2016) (90)
- Cylinder3D: An Effective 3D Framework for Driving-scene LiDAR Semantic Segmentation (2020) (79)
- PointCLIP: Point Cloud Understanding by CLIP (2021) (79)
- PV-RCNN++: Point-Voxel Feature Set Abstraction With Local Vector Representation for 3D Object Detection (2021) (79)
- Learning N: M Fine-grained Structured Sparse Neural Networks From Scratch (2021) (78)
- Part-A2 Net: 3D Part-Aware and Aggregation Neural Network for Object Detection from Point Cloud (2019) (77)
- UniFormer: Unifying Convolution and Self-attention for Visual Recognition (2022) (72)
- ST3D: Self-training for Unsupervised Domain Adaptation on 3D Object Detection (2021) (69)
- Monocular 3D Object Detection with Decoupled Structured Polygon Estimation and Height-Guided Depth Estimation (2020) (64)
- UniFormer: Unified Transformer for Efficient Spatiotemporal Representation Learning (2022) (64)
- Actor-Context-Actor Relation Network for Spatio-Temporal Action Localization (2020) (62)
- Self-supervising Fine-grained Region Similarities for Large-scale Image Localization (2020) (60)
- Multi-Modality Latent Interaction Network for Visual Question Answering (2019) (60)
- Question-Guided Hybrid Convolution for Visual Question Answering (2018) (54)
- LIGA-Stereo: Learning LiDAR Geometry Aware Representations for Stereo-based 3D Detector (2021) (48)
- ConvMAE: Masked Convolution Meets Masked Autoencoders (2022) (47)
- DivCo: Diverse Conditional Image Synthesis via Contrastive Generative Adversarial Network (2021) (46)
- FuseFormer: Fusing Fine-Grained Information in Transformers for Video Inpainting (2021) (44)
- 3D Sketch-Aware Semantic Scene Completion via Semi-Supervised Structure Prior (2020) (43)
- LiDAR-based Panoptic Segmentation via Dynamic Shifting Network (2020) (41)
- Unsupervised Domain Adaptive 3D Detection with Multi-Level Consistency (2021) (41)
- Point-M2AE: Multi-scale Masked Autoencoders for Hierarchical Point Cloud Pre-training (2022) (40)
- FlowFormer: A Transformer Architecture for Optical Flow (2022) (39)
- Refining Pseudo Labels with Clustering Consensus over Generations for Unsupervised Object Re-identification (2021) (39)
- Uni-Perceiver: Pre-training Unified Architecture for Generic Perception for Zero-shot and Few-shot Tasks (2021) (37)
- Conditional Adversarial Generative Flow for Controllable Image Synthesis (2019) (34)
- Container: Context Aggregation Network (2021) (34)
- Generalizing Monocular 3D Human Pose Estimation in the Wild (2019) (32)
- Multi-organ Segmentation via Co-training Weight-averaged Models from Few-organ Datasets (2020) (31)
- Structured Domain Adaptation With Online Relation Regularization for Unsupervised Person Re-ID. (2020) (30)
- EfficientFCN: Holistically-guided Decoding for Semantic Segmentation (2020) (29)
- Cylindrical and Asymmetrical 3D Convolution Networks for LiDAR-Based Perception (2021) (29)
- EdgeViTs: Competing Light-weight CNNs on Mobile Devices with Vision Transformers (2022) (27)
- Dynamic Graph Representation Learning for Video Dialog via Multi-Modal Shuffled Transformers (2021) (25)
- Foreground-Action Consistency Network for Weakly Supervised Temporal Action Localization (2021) (24)
- MixMIM: Mixed and Masked Image Modeling for Efficient Visual Representation Learning (2022) (23)
- Encoder-decoder with Multi-level Attention for 3D Human Shape and Pose Estimation (2021) (22)
- ST-Adapter: Parameter-Efficient Image-to-Video Transfer Learning for Action Recognition (2022) (22)
- Semantic Scene Completion via Integrating Instances and Scene in-the-Loop (2021) (21)
- MonoDETR: Depth-aware Transformer for Monocular 3D Object Detection (2022) (20)
- StereoGAN: Bridging Synthetic-to-Real Domain Gap by Joint Optimization of Domain Translation and Stereo Matching (2020) (20)
- Rethinking Noise Synthesis and Modeling in Raw Denoising (2021) (19)
- Learning to Predict Context-adaptive Convolution for Semantic Segmentation (2020) (19)
- Decoupled Spatial-Temporal Transformer for Video Inpainting (2021) (19)
- MPPNet: Multi-Frame Feature Intertwining with Proxy Points for 3D Temporal Object Detection (2022) (18)
- Semi-Supervised Monocular 3D Face Reconstruction With End-to-End Shape-Preserved Domain Transfer (2019) (17)
- DominoSearch: Find layer-wise fine-grained N: M sparse schemes from dense neural networks (2021) (17)
- A^2-Net: Molecular Structure Estimation from Cryo-EM Density Volumes (2019) (17)
- LIF-Seg: LiDAR and Camera Image Fusion for 3D LiDAR Semantic Segmentation (2021) (17)
- Open-Edit: Open-Domain Image Manipulation with Open-Vocabulary Instructions (2020) (16)
- Progressive Correspondence Pruning by Consensus Learning (2021) (14)
- TokenMix: Rethinking Image Mixing for Data Augmentation in Vision Transformers (2022) (13)
- Learning 3D Representations from 2D Pre-trained Models via Image-to-Point Masked Autoencoders (2022) (13)
- RBGNet: Ray-based Grouping for 3D Object Detection (2022) (13)
- Safety-Enhanced Autonomous Driving Using Interpretable Sensor Fusion Transformer (2022) (13)
- VS-Net: Voting with Segmentation for Visual Localization (2021) (13)
- IDR: Self-Supervised Image Denoising via Iterative Data Refinement (2021) (13)
- UniNet: Unified Architecture Search with Convolution, Transformer, and MLP (2021) (12)
- Unsupervised Cross-spectral Stereo Matching by Learning to Synthesize (2019) (12)
- ST3D++: Denoised Self-Training for Unsupervised Domain Adaptation on 3D Object Detection (2021) (11)
- Distillation with Contrast is All You Need for Self-Supervised Point Cloud Representation Learning (2022) (11)
- SelfVoxeLO: Self-supervised LiDAR Odometry with Voxel-based Deep Neural Networks (2020) (11)
- Self-distillation with Batch Knowledge Ensembling Improves ImageNet Classification (2021) (11)
- AutoLoss-Zero: Searching Loss Functions from Scratch for Generic Tasks (2021) (11)
- Controllable 3D Face Synthesis with Conditional Generative Occupancy Fields (2022) (10)
- Prompt, Generate, then Cache: Cascade of Foundation Models makes Strong Few-shot Learners (2023) (10)
- Tip-Adapter: Training-free Adaption of CLIP for Few-shot Classification (2022) (9)
- Generalizable Neural Performer: Learning Robust Radiance Fields for Human Novel View Synthesis (2022) (9)
- Structured Domain Adaptation for Unsupervised Person Re-identification (2020) (9)
- RBF-Softmax: Learning Deep Representative Prototypes with Radial Basis Function Softmax (2020) (9)
- Categorical Relation-Preserving Contrastive Knowledge Distillation for Medical Image Classification (2021) (8)
- SymReg-GAN: Symmetric Image Registration With Generative Adversarial Networks (2021) (8)
- REFINE: Prediction Fusion Network for Panoptic Segmentation (2021) (8)
- Multi-Modality Self-Distillation for Weakly Supervised Temporal Action Localization (2022) (7)
- LLaMA-Adapter: Efficient Fine-tuning of Language Models with Zero-init Attention (2023) (7)
- Hybrid Supervision Learning for Pathology Whole Slide Image Classification (2021) (7)
- PV-RCNN: The Top-Performing LiDAR-only Solutions for 3D Detection / 3D Tracking / Domain Adaptation of Waymo Open Dataset Challenges (2020) (6)
- Scalable Transformers for Neural Machine Translation (2021) (6)
- Collaboration of Pre-trained Models Makes Better Few-shot Learner (2022) (6)
- A Simple Long-Tailed Recognition Baseline via Vision-Language Model (2021) (6)
- Person Re-Identification With Deep Kronecker-Product Matching and Group-Shuffling Random Walk (2019) (6)
- Parameter is Not All You Need: Starting from Non-Parametric Networks for 3D Point Cloud Analysis (2023) (6)
- LIFE: Lighting Invariant Flow Estimation (2021) (5)
- Pyramid Fusion Transformer for Semantic Segmentation (2022) (5)
- A Holistically-Guided Decoder for Deep Representation Learning with Applications to Semantic Segmentation and Object Detection (2020) (4)
- A Unified Multi-Scenario Attacking Network for Visual Object Tracking (2021) (4)
- Instance-weighted Central Similarity for Multi-label Image Retrieval (2021) (4)
- Meta Knowledge Distillation (2022) (4)
- FNAS: Uncertainty-Aware Fast Neural Architecture Search (2021) (4)
- Parameter-Efficient Image-to-Video Transfer Learning (2022) (4)
- Robust Self-Supervised LiDAR Odometry Via Representative Structure Discovery and 3D Inherent Error Modeling (2022) (4)
- Decomposed Attention: Self-Attention with Linear Complexities (2018) (3)
- 1st place solution for AVA-Kinetics Crossover in AcitivityNet Challenge 2020 (2020) (3)
- MagnifierNet: Towards Semantic Adversary and Fusion for Person Re-identification (2020) (2)
- Mimic before Reconstruct: Enhancing Masked Autoencoders with Feature Mimicking (2023) (2)
- Better Aligning Text-to-Image Models with Human Preference (2023) (2)
- You Only Need End-to-End Training for Long-Tailed Recognition (2021) (2)
- Guest editorial: Deep learning for medical image analysis (2021) (2)
- Towards Overcoming False Positives in Visual Relationship Detection (2020) (2)
- Learning Deep Representations for Scene Labeling with Guided Supervision (2017) (1)
- MixMAE: Mixed and Masked Autoencoder for Efficient Pretraining of Hierarchical Vision Transformers (2022) (1)
- MonoDETR: Depth-guided Transformer for Monocular 3D Object Detection (2022) (1)
- Mixed Supervision Learning for Whole Slide Image Classification (2021) (1)
- Guest Editorial: Generative Adversarial Networks for Computer Vision (2020) (1)
- Fixing the Teacher-Student Knowledge Discrepancy in Distillation (2021) (1)
- LLaMA-Adapter V2: Parameter-Efficient Visual Instruction Model (2023) (1)
- Consensus-Guided Correspondence Denoising (2021) (0)
- Inverting Generative Adversarial Renderer for Face Reconstruction-Supplementary Material - (2021) (0)
- Complementary Boundary Generator with Scale-Invariant Relation Modeling for Temporal Action Localization: Submission to ActivityNet Challenge 2020 (2020) (0)
- MagnifierNet: Towards Semantic Regularization and Fusion for Person Re-identification (2020) (0)
- Personalize Segment Anything Model with One Shot (2023) (0)
- FeatAug-DETR: Enriching One-to-Many Matching for DETRs with Feature Augmentation (2023) (0)
- Improving Weakly Supervised Temporal Action Localization by Bridging Train-Test Gap in Pseudo Labels (2023) (0)
- Towards Robust Face Recognition with Comprehensive Search (2022) (0)
- Supplementary Material for ST3D: Self-training for Unsupervised Domain Adaptation on 3D Object Detection (2021) (0)
- Temporal Enhanced Training of Multi-view 3D Object Detector via Historical Object Prediction (2023) (0)
- Perception Imitation: Towards Synthesis-free Simulator for Autonomous Vehicles (2023) (0)
- CORA: Adapting CLIP for Open-Vocabulary Detection with Region Prompting and Anchor Pre-Matching (2023) (0)
- Environment-aware Pedestrian Trajectory Prediction for Autonomous Driving (2020) (0)
- Question : What is on the plate ? S of tm ax Linear Tanh ResNet Faster-RCNN GRU Linear Tanh (2017) (0)
This paper list is powered by the following services:
What Schools Are Affiliated With Hongsheng Li?
Hongsheng Li is affiliated with the following schools: