Gang Hua
#141,846
Most Influential Person Now
Gang Hua's AcademicInfluence.com Rankings
Gang Huacomputer-science Degrees
Computer Science
#6856
World Rank
#7222
Historical Rank
Algorithms
#249
World Rank
#252
Historical Rank
Artificial Intelligence
#2641
World Rank
#2682
Historical Rank
Database
#3934
World Rank
#4092
Historical Rank

Download Badge
Computer Science
Gang Hua's Degrees
- Bachelors Computer Science Peking University
Similar Degrees You Can Earn
Why Is Gang Hua Influential?
(Suggest an Edit or Addition)Gang Hua's Published Works
Number of citations in a given year to any of this author's works
Total number of citations to an author for the works they published in a given year. This highlights publication of the most important work(s) by the author
Published Works
- A convolutional neural network cascade for face detection (2015) (1168)
- Stacked Cross Attention for Image-Text Matching (2018) (721)
- LQ-Nets: Learned Quantization for Highly Accurate and Compact Deep Neural Networks (2018) (528)
- Ordinal Regression with Multiple Output CNN for Age Estimation (2016) (479)
- Discriminative Learning of Local Image Descriptors (2011) (473)
- Visual attribute transfer through deep image analogy (2017) (426)
- Labeled Faces in the Wild: A Survey (2016) (424)
- CVAE-GAN: Fine-Grained Image Generation through Asymmetric Training (2017) (389)
- StyleBank: An Explicit Representation for Neural Image Style Transfer (2017) (357)
- Neural Aggregation Network for Video Face Recognition (2016) (338)
- Gated Context Aggregation Network for Image Dehazing and Deraining (2018) (315)
- Picking the best DAISY (2009) (308)
- Descriptive visual words and visual phrases for image applications (2009) (255)
- Context-Aware Visual Tracking (2009) (254)
- Learning Discriminative Reconstructions for Unsupervised Outlier Removal (2015) (239)
- Coherent Online Video Style Transfer (2017) (235)
- Probabilistic Elastic Matching for Pose Variant Face Verification (2013) (228)
- How to Train a Compact Binary Neural Network with High Accuracy? (2017) (219)
- Similarity learning on an explicit polynomial kernel feature map for person re-identification (2015) (208)
- A Generic Deep Architecture for Single Image Reflection Removal and Image Smoothing (2017) (208)
- Discriminant Embedding for Local Image Descriptors (2007) (203)
- Towards Open-Set Identity Preserving Face Synthesis (2018) (194)
- Integrated feature selection and higher-order spatial feature extraction for object categorization (2008) (165)
- Semantic Model Vectors for Complex Video Event Recognition (2012) (163)
- Supervised Transformer Network for Efficient Face Detection (2016) (159)
- Tracking articulated body by dynamic Markov network (2003) (145)
- Building contextual visual vocabulary for large-scale image applications (2010) (139)
- Hierarchical Multimodal LSTM for Dense Visual-Semantic Embedding (2017) (137)
- Learning to estimate human pose with data driven belief propagation (2005) (135)
- Eigen-PEP for Video Face Recognition (2014) (130)
- Automatic salient object extraction with contextual cue (2011) (130)
- A Hierarchical Visual Model for Video Object Summarization (2010) (127)
- Face Relighting from a Single Image under Arbitrary Unknown Lighting Conditions (2009) (124)
- Discriminative Tracking by Metric Learning (2010) (107)
- Tracking appearances with occlusions (2003) (106)
- Stereoscopic Neural Style Transfer (2018) (102)
- Context aware topic model for scene recognition (2012) (99)
- Efficient Boosted Exemplar-Based Face Detection (2014) (99)
- Unsupervised One-Class Learning for Automatic Outlier Removal (2014) (99)
- Hyperspectral Image Classification Through Bilayer Graph-Based Learning (2014) (99)
- Implicit elastic matching with random projections for pose-variant face recognition (2009) (95)
- Probabilistic Elastic Part Model for Unsupervised Face Detector Adaptation (2013) (85)
- Supervised Matrix Factorization for Cross-Modality Hashing (2016) (82)
- Revisiting Deep Intrinsic Image Decompositions (2017) (82)
- Face Re-Lighting from a Single Image under Harsh Lighting Conditions (2007) (82)
- Weakly Supervised Temporal Action Localization Through Contrast Based Evaluation Networks (2019) (82)
- Dynamic hand gesture recognition: An exemplar-based approach from motion divergence fields (2012) (81)
- Video Object Discovery and Co-Segmentation with Extremely Weak Supervision (2017) (79)
- Introduction to the Special Section on Real-World Face Recognition (2011) (76)
- Generating Descriptive Visual Words and Visual Phrases for Large-Scale Image Applications (2011) (76)
- Multiple instance fFeature for robust part-based object detection (2009) (75)
- A robust elastic and partial matching metric for face recognition (2009) (74)
- Two-Stream Consensus Network for Weakly-Supervised Temporal Action Localization (2020) (71)
- Hierarchical-PEP model for real-world face recognition (2015) (71)
- Collaborative Deep Reinforcement Learning for Joint Object Search (2017) (70)
- Connections with Robust PCA and the Role of Emergent Sparsity in Variational Autoencoder Models (2018) (67)
- Multi-class Multi-annotator Active Learning with Robust Gaussian Process for Visual Recognition (2015) (67)
- What characterizes a shadow boundary under the sun and sky? (2011) (65)
- Multi-scale visual tracking by sequential belief propagation (2004) (63)
- Joint People, Event, and Location Recognition in Personal Photo Collections Using Cross-Domain Context (2010) (62)
- Attention-based Temporal Weighted Convolutional Neural Network for Action Recognition (2018) (59)
- Face Recognition using Discriminatively Trained Orthogonal Rank One Tensor Projections (2007) (57)
- LG-GAN: Label Guided Adversarial Network for Flexible Targeted Attack of Point Cloud Based Deep Networks (2020) (56)
- Hash-SVM: Scalable Kernel Machines for Large-Scale Visual Classification (2014) (55)
- Decouple Learning for Parameterized Image Operators (2018) (54)
- Scene Aligned Pooling for Complex Video Recognition (2012) (54)
- SGCN:Sparse Graph Convolution Network for Pedestrian Trajectory Prediction (2021) (53)
- A Joint Gaussian Process Model for Active Visual Recognition with Expertise Estimation in Crowdsourcing (2013) (50)
- Report on the FG 2015 Video Person Recognition Evaluation (2015) (50)
- Detection by detections: Non-parametric detector adaptation for a video (2012) (49)
- Switching observation models for contour tracking in clutter (2003) (49)
- Which faces to tag: Adding prior constraints into active learning (2009) (48)
- A Joint Gaussian Process Model for Active Visual Recognition with Expertise Estimation in Crowdsourcing (2013) (48)
- Accurate Object Detection with Location Relaxation and Regionlets Re-localization (2014) (47)
- Semi-supervised FusedGAN for Conditional Image Generation (2018) (47)
- Enriching Local and Global Contexts for Temporal Action Localization (2021) (43)
- Efficient Optimal Kernel Placement for Reliable Visual Tracking (2006) (40)
- Deep Model Intellectual Property Protection via Deep Watermarking (2021) (40)
- Few-Shot Open-Set Recognition Using Meta-Learning (2020) (40)
- IBM Research TRECVID-2010 Video Copy Detection and Multimedia Event Detection System (2010) (39)
- Collaborative Active Learning of a Kernel Machine Ensemble for Recognition (2013) (39)
- Video Object Discovery and Co-Segmentation with Extremely Weak Supervision (2014) (38)
- Collaborative Active Visual Recognition from Crowds: A Distributed Ensemble Approach (2018) (37)
- Semi-supervised Relational Topic Model for Weakly Annotated Image Recognition in Social Media (2014) (37)
- Measurement integration under inconsistency for robust tracking (2006) (37)
- Weakly Supervised Visual Dictionary Learning by Harnessing Image Attributes (2014) (37)
- IBM Research and Columbia University TRECVID-2011 Multimedia Event Detection (MED) System (2011) (36)
- Self-Robust 3D Point Recognition via Gather-Vector Guidance (2020) (36)
- A statistical field model for pedestrian detection (2005) (35)
- Spatial-DiscLDA for visual recognition (2011) (35)
- Order-Preserving Wasserstein Distance for Sequence Matching (2017) (34)
- ACSNet: Action-Context Separation Network for Weakly Supervised Temporal Action Localization (2021) (33)
- Efficient Semantic Image Synthesis via Class-Adaptive Normalization (2020) (33)
- Passport-aware Normalization for Deep Model Protection (2020) (32)
- A Comprehensive Approach to Image Spam Detection: From Server to Client Solution (2010) (32)
- Automatic salient object extraction with contextual cue and its applications to recognition and alpha matting (2013) (31)
- Can Visual Recognition Benefit from Auxiliary Information in Training? (2014) (30)
- Action Recognition by an Attention-Aware Temporal Weighted Convolutional Neural Network (2018) (30)
- Any-Precision Deep Neural Networks (2019) (29)
- A Simple Baseline for StyleGAN Inversion (2021) (29)
- A Multi-level Contextual Model for Person Recognition in Photo Albums (2016) (28)
- Learning Dynamics via Graph Neural Networks for Human Pose Estimation and Tracking (2021) (27)
- Joint Segmentation and Recognition of Categorized Objects From Noisy Web Image Collection (2014) (27)
- Topical video object discovery from key frames by modeling word co-occurrence prior (2013) (27)
- The IJCB 2014 PaSC video face and person recognition competition (2014) (26)
- Multi-View Visual Recognition of Imperfect Testing Data (2015) (26)
- Three-Dimensional Traffic Scenes Simulation From Road Image Sequences (2016) (26)
- ObjectPatchNet: Towards scalable and semantic image annotation and retrieval (2014) (25)
- Ladder Loss for Coherent Visual-Semantic Embedding (2019) (25)
- SaccadeNet: A Fast and Accurate Object Detector (2020) (25)
- Implicit Autoencoder for Point Cloud Self-supervised Representation Learning (2022) (25)
- Description-Discrimination Collaborative Tracking (2014) (25)
- Semi-Supervised Learning with Manifold Fitted Graphs (2013) (25)
- Semi-online Multi-people Tracking by Re-identification (2020) (25)
- Hidden Talents of the Variational Autoencoder. (2017) (24)
- Adversarial Ranking Attack and Defense (2020) (24)
- ER3: A Unified Framework for Event Retrieval, Recognition and Recounting (2017) (24)
- Diverse Semantic Image Synthesis via Probability Distribution Modeling (2021) (24)
- IBM Research and Columbia University TRECVID-2012 Multimedia Event Detection (MED), Multimedia Event Recounting (MER), and Semantic Indexing (SIN) Systems (2012) (22)
- Knowledge-Based Topic Model for Unsupervised Object Discovery and Localization (2018) (22)
- Iterative Local-Global Energy Minimization for Automatic Extraction of Objects of Interest (2006) (22)
- A General Decoupled Learning Framework for Parameterized Image Operators (2019) (22)
- Introduction to the Special Issue on Mobile Vision (2012) (22)
- Video Imprint Segmentation for Temporal Action Detection in Untrimmed Videos (2019) (21)
- Correlational Gaussian Processes for Cross-Domain Visual Recognition (2017) (21)
- DSSL: Deep Surroundings-person Separation Learning for Text-based Person Retrieval (2021) (21)
- What can visual content analysis do for text based image search? (2009) (20)
- Order-Preserving Optimal Transport for Distances between Sequences (2019) (20)
- Automatic Business Card Scanning with a Camera (2006) (20)
- Variational maximum a posteriori by annealed mean field analysis (2005) (19)
- Auxiliary Training Information Assisted Visual Recognition (2015) (19)
- Efficient Scale-Space Spatiotemporal Saliency Tracking for Distortion-Free Video Retargeting (2009) (18)
- Calibrated Domain-Invariant Learning for Highly Generalizable Large Scale Re-Identification (2019) (17)
- Meta-tag propagation by co-training an ensemble classifier for improving image search relevance (2008) (16)
- Poison Ink: Robust and Invisible Backdoor Attack (2021) (16)
- Action Coherence Network for Weakly Supervised Temporal Action Localization (2019) (16)
- Unlimited Neighborhood Interaction for Heterogeneous Trajectory Prediction (2021) (15)
- Meta Pairwise Relationship Distillation for Unsupervised Person Re-identification (2021) (14)
- Sequential mean field variational analysis of structured deformable shapes (2006) (14)
- Towards large scale land-cover recognition of satellite images (2011) (14)
- A decentralized probabilistic approach to articulated body tracking (2007) (14)
- Motion divergence fields for dynamic hand gesture recognition (2011) (14)
- Modeling spatial and semantic cues for large-scale near-duplicated image retrieval (2011) (14)
- GistNet: a Geometric Structure Transfer Network for Long-Tailed Recognition (2021) (14)
- A nonnegative sparsity induced similarity measure with application to cluster analysis of spam images (2010) (14)
- Joint Video Object Discovery and Segmentation by Coupled Dynamic Markov Networks (2018) (13)
- PEYE: Toward a Visual Motion Based Perceptual Interface for Mobile Devices (2007) (13)
- Controllable Image Processing via Adaptive FilterBank Pyramid (2020) (12)
- An egocentric vision based assistive co-robot (2013) (12)
- An egocentric computer vision based co-robot wheelchair (2016) (12)
- Visual Tracking via Joint Discriminative Appearance Learning (2017) (12)
- Exemplar-Guided Similarity Learning on Polynomial Kernel Feature Map for Person Re-identification (2017) (12)
- Joint Spatio-Temporal Action Localization in Untrimmed Videos with Per-Frame Segmentation (2018) (11)
- Recurrent Variational Autoencoders for Learning Nonlinear Generative Models in the Presence of Outliers (2018) (11)
- Visual quality assessment for web videos (2010) (11)
- Video Object Co-Segmentation from Noisy Videos by a Multi-Level Hypergraph Model (2018) (11)
- Texture Synthesis (2020) (11)
- Fast, Accurate Thin-Structure Obstacle Detection for Autonomous Mobile Robots (2017) (10)
- Interest seam image (2010) (10)
- Weakly Supervised Temporal Action Localization Through Learning Explicit Subspaces for Action and Context (2021) (10)
- Practical Relative Order Attack in Deep Ranking (2021) (10)
- Explicit Filterbank Learning for Neural Image Style Transfer and Image Processing (2020) (10)
- Probabilistic Elastic Part Model: A Pose-Invariant Representation for Real-World Face Verification (2018) (10)
- IBM Research and Columbia University TRECVID-2013 Multimedia Event Detection (MED), Multimedia Event Recounting (MER), Surveillance Event Detection (SED), and Semantic Indexing (SIN) Systems (2013) (9)
- Temporal Keypoint Matching and Refinement Network for Pose Estimation and Tracking (2020) (9)
- Segment-Tube: Spatio-Temporal Action Localization in Untrimmed Videos with Per-Frame Segmentation (2018) (9)
- Revisiting Deep Image Smoothing and Intrinsic Image Decomposition (2017) (9)
- Video Event Detection Using Temporal Pyramids of Visual Semantics with Kernel Optimization and Model Subspace Boosting (2012) (9)
- Green Generative Modeling: Recycling Dirty Data using Recurrent Variational Autoencoders (2017) (8)
- Multi-Timescale Collaborative Tracking (2017) (8)
- Face and Facial Expression Recognition from Real World Videos (2015) (8)
- Object Affordances Graph Network for Action Recognition (2019) (8)
- gDLS*: Generalized Pose-and-Scale Estimation Given Scale and Gravity Priors (2020) (8)
- Social Interpretable Tree for Pedestrian Trajectory Prediction (2022) (7)
- VideoCut: Removing Irrelevant Frames by Discovering the Object of Interest (2008) (7)
- Loss functions for pose guided person image generation (2022) (7)
- Improving Person Re-Identification With Iterative Impression Aggregation (2020) (7)
- CANNET: Context aware nonlocal convolutional networks for semantic image segmentation (2015) (7)
- Egocentric Object Recognition Leveraging the 3D Shape of the Grasping Hand (2014) (7)
- Proceedings of the 2010 ACM multimedia workshop on Mobile cloud media computing (2010) (7)
- Robust Pose Estimation in Crowded Scenes with Direct Pose-Level Inference (2021) (6)
- Computer Vision – ECCV 2016 Workshops (2016) (6)
- Loss Functions for Person Image Generation (2020) (6)
- Giant Panda Identification (2021) (5)
- Complementary Attention Gated Network for Pedestrian Trajectory Prediction (2022) (5)
- ACM workshop on mobile cloud media computing (2010) (5)
- The VLSI implementation of a high-resolution depth-sensing SoC based on active structured light (2015) (5)
- Face Recognition by Discriminative Orthogonal Rank-one Tensor Decomposition (2008) (5)
- Semantic Image Synthesis via Efficient Class-Adaptive Normalization (2020) (5)
- Multi-timescale Collaborative Tracking. (2016) (4)
- Video Imprint (2019) (4)
- Video Analytics. Face and Facial Expression Recognition and Audience Measurement - Third International Workshop, VAAM 2016, and Second International Workshop, FFER 2016, Cancun, Mexico, December 4, 2016, Revised Selected Papers (2017) (4)
- Graph-based temporal action co-localization from an untrimmed video (2021) (3)
- How to make Face Recognition Work: The Power of Modeling Context (2012) (3)
- Analyzing Structured Deformable Shapes Via Mean Field Monte Carlo (2004) (3)
- Exploring Structure Consistency for Deep Model Watermarking (2021) (3)
- Visual Topic Network: Building better image representations for images in social media (2015) (3)
- Learning Disentangled Classification and Localization Representations for Temporal Action Localization (2022) (3)
- Semantic Probability Distribution Modeling for Diverse Semantic Image Synthesis (2022) (3)
- Mobile Cloud Visual Media Computing: From Interaction to Service (2015) (3)
- Fine-Grained Giant Panda Identification (2020) (3)
- Exploring Discrete Diffusion Models for Image Captioning (2022) (3)
- Dual relation network for temporal action localization (2022) (3)
- Concurrent segmentation of categorized objects from an image collection (2012) (3)
- Multi-scale shared features for cascade object detection (2012) (2)
- Video Analytics. Face and Facial Expression Recognition (2018) (2)
- Semi-supervised Long-tailed Recognition using Alternate Sampling (2021) (2)
- Beyond Visual Attractiveness: Physically Plausible Single Image HDR Reconstruction for Spherical Panoramas (2021) (2)
- Capturing Human Body Motion from Video for Perceptual Interfaces by Sequential Variational MAP (2005) (2)
- Adversarial Attack and Defense in Deep Ranking (2021) (2)
- Breadcrumbs: Adversarial Class-Balanced Sampling for Long-tailed Recognition (2021) (2)
- Action Co-localization in an Untrimmed Video by Graph Neural Networks (2020) (2)
- Attention-driven Egocentric Computer Vision for Robotic Wheelchair Navigation (2016) (2)
- Vision-Based Interaction (2013) (1)
- Hyperspectral Image Segmentation Through Bilayer Graph Based Learning (2014) (1)
- Large-scale video event classification using dynamic temporal pyramid matching of visual semantics (2013) (1)
- Adaptive Two-Stream Consensus Network for Weakly-Supervised Temporal Action Localization (2022) (1)
- Mobile Cloud Visual Media Computing (2015) (1)
- Local to Global Feature Learning for Salient Object Detection (2022) (1)
- Multiple instance boosting with global smoothness regularization (2011) (1)
- Counting Grid Aggregation for Event Retrieval and Recognition (2016) (1)
- Understanding and Predicting The Attractiveness of Human Action Shot (2017) (1)
- Instance Motion Tendency Learning for Video Panoptic Segmentation (2022) (1)
- Deep Style Transfer (2020) (1)
- Probabilistic Elastic Part Model for Real-World Face Recognition (2014) (1)
- Action Coherence Network for Weakly-Supervised Temporal Action Localization (2022) (1)
- Automatic Segmentation of Objects of Interest from an Image (2006) (1)
- Weakly Supervised Temporal Action Localization Through Contrast Based Evaluation Networks (2021) (1)
- Veiled Attributes of the Variational Autoencoder (2017) (1)
- Memory-augmented appearance-motion network for video anomaly detection (2023) (1)
- Discriminative Multiple Target Tracking (2011) (1)
- Video Demo: An Egocentric Vision Based Assistive Co-robot (2013) (1)
- Exploring Action Centers for Temporal Action Localization (2023) (1)
- An Efficient Visual Representation and Indexing Scheme for Large Scale Video Retrieval (2013) (0)
- Introduction to the special section of best papers of ACM multimedia 2011 (2012) (0)
- Modeling Inter- and Intra-Part Deformations for Object Structure Parsing (2015) (0)
- The VLSI implementation of a high-resolution depth-sensing SoC based on active structured light (2015) (0)
- ContextLoc++: A Unified Context Model for Temporal Action Localization. (2023) (0)
- Object Cosegmentation in Noisy Videos With Multilevel Hypergraph (2021) (0)
- Revisiting Deep Intrinsic Image Decompositions ( Supplementary Material ) (2018) (0)
- Semi-online Multi-people Tracking by Re-identification (2020) (0)
- Automatic Extraction of Objects of Interest by Minimizing a Local-Global Variational Energy (2006) (0)
- Practical Order Attack in Deep Ranking (2020) (0)
- Egocentric Computer Vision based Wheelchair Robot Control (2015) (0)
- Boosted Dynamic Neural Networks (2022) (0)
- The Art of Detection (2016) (0)
- Introduction to the special section of best papers of ACM multimedia 2012 (2013) (0)
- Visual Topic Network: Building an Image Representation for Image Recognition in Social Media (2015) (0)
- Priming Deep Pedestrian Detection with Geometric Context (2019) (0)
- Decouple learning framework ... 7 residual blocks ���⃗���γ WW 1 WWnnWWnn − 1 WWnn − 2 WW 2 WW 3 WW 4 WWnn − 3 (2019) (0)
- Reinforced Pipeline Optimization: Behaving Optimally with Non-Differentiabilities (2018) (0)
- TxVAD: Improved Video Action Detection by Transformers (2022) (0)
- Weakly-guided Self-supervised Pretraining for Temporal Activity Detection (2021) (0)
- Erratum to: Video Analytics (2016) (0)
- Guest editorial: selected papers from ICIMCS 2013 (2015) (0)
- Collaborative Active Learning of an Kernel Machine Ensemble for Visual Classification (2012) (0)
- Representing Multimodal Behaviors With Mean Location for Pedestrian Trajectory Prediction. (2023) (0)
- An Integrated Model for Bayesian Learning of Sparse Representation and Classifier Training (2013) (0)
- Probabilistic variational methods for vision based complex motion analysis (2006) (0)
- Exemplar-Guided Similarity Learning on Polynomial Kernel Feature Map for Person Re-identification (2017) (0)
- A Compositional Textual Model for Recognition of Imperfect Word Images (2018) (0)
- Conscious Inference for Object Detection (2018) (0)
- Learning Disentangled Classification and Localization Representations for Temporal Action Localization (2022) (0)
- Monocular Visual-IMU Odometry: A Comparative Evaluation of the Detector-Descriptor Based Methods (2016) (0)
- Progressive Backdoor Erasing via connecting Backdoor and Adversarial Attacks (2022) (0)
- Regularizing Second-Order Influences for Continual Learning (2023) (0)
- Descriptive VisualWords: the Visual Correspondences of Text Words (2009) (0)
- MotionTrack: Learning Robust Short-term and Long-term Motions for Multi-Object Tracking (2023) (0)
- Deep Learning for Video Face Recognition (2021) (0)
- Adversarial Fine-tuning for Backdoor Defense: Connecting Backdoor Attacks to Adversarial Attacks (2022) (0)
- Joint Multi-object Detection and Segmentation from an Untrimmed Video (2020) (0)
- Preface (2019) (0)
- Sparse Pose Trajectory Completion (2021) (0)
- E^2TAD: An Energy-Efficient Tracking-based Action Detector (2022) (0)
This paper list is powered by the following services:
What Schools Are Affiliated With Gang Hua?
Gang Hua is affiliated with the following schools: