Qing-ming Huang

Qing-ming Huang's AcademicInfluence.com Rankings

Qing-ming Huang

Computer Science

#4156

World Rank

#4372

Historical Rank

Artificial Intelligence

#936

World Rank

#953

Historical Rank

Database

#1392

World Rank

#1465

Historical Rank

computer-science Degrees

Download Badge

Computer Science

Qing-ming Huang's Degrees

PhD Computer Science University of California, Berkeley
Masters Computer Science University of California, Berkeley
Bachelors Computer Science Tsinghua University

Similar Degrees You Can Earn

Best Online PhD of Computer Science (Doctorates) 2026
Best Online Master's in Computer Science
10 Fastest Accelerated Online Bachelor's of Computer Science
Best Online Bachelor's in Computer Science 2026

Why Is Qing-ming Huang Influential?

(Suggest an Edit or Addition)

(See a Problem?)

Qing-ming Huang's Published Works

Number of citations in a given year to any of this author's works

Total number of citations to an author for the works they published in a given year. This highlights publication of the most important work(s) by the author

Published Works

CenterNet: Keypoint Triplets for Object Detection (2019) (1381)
The Visual Object Tracking VOT2016 Challenge Results (2016) (702)
Hedged Deep Tracking (2016) (644)
Cascaded Partial Decoder for Fast and Accurate Salient Object Detection (2019) (519)
The Visual Object Tracking VOT2017 Challenge Results (2017) (424)
Fast and robust text detection in images and video frames (2005) (372)
The Unmanned Aerial Vehicle Benchmark: Object Detection and Tracking (2018) (347)
The Visual Object Tracking VOT2013 Challenge Results (2013) (333)
Relay Backpropagation for Effective Learning of Deep Convolutional Neural Networks (2015) (315)
F3Net: Fusion, Feedback and Focus for Salient Object Detection (2019) (310)
Descriptive visual words and visual phrases for image applications (2009) (255)
Review of Visual Saliency Detection With Comprehensive Information (2018) (242)
Spatial Pyramid-Enhanced NetVLAD With Weighted Triplet Loss for Place Recognition (2020) (242)
Image Matching by Normalized Cross-Correlation (2006) (229)
Stacked Cross Refinement Network for Edge-Aware Salient Object Detection (2019) (227)
Global Context-Aware Progressive Aggregation Network for Salient Object Detection (2020) (220)
Multimodal Transformer With Multi-View Visual Representation for Image Captioning (2019) (217)
Measuring visual saliency by Site Entropy Rate (2010) (202)
Saliency Detection for Stereoscopic Images Based on Depth Confidence Analysis and Multiple Cues Fusion (2016) (185)
A configurable method for multi-style license plate recognition (2009) (185)
RAM: A Region-Aware Deep Model for Vehicle Re-Identification (2018) (175)
Using Webcast Text for Semantic Event Detection in Broadcast Sports Video (2008) (162)
VisDrone-DET2019: The Vision Meets Drone Object Detection in Image Challenge Results (2018) (158)
Joint Source-Channel Rate-Distortion Optimization for H.264 Video Coding Over Error-Prone Networks (2007) (158)
Gradually Vanishing Bridge for Adversarial Domain Adaptation (2020) (156)
Towards Discriminability and Diversity: Batch Nuclear-Norm Maximization Under Label Insufficient Situations (2020) (156)
Less Is More: Picking Informative Frames for Video Captioning (2018) (149)
Label Decoupling Framework for Salient Object Detection (2020) (143)
Building contextual visual vocabulary for large-scale image applications (2010) (139)
Learning Label-Specific Features and Class-Dependent Labels for Multi-Label Classification (2016) (138)
Human Daily Action Analysis with Multi-view and Color-Depth Data (2012) (133)
The Thermal Infrared Visual Object Tracking VOT-TIR2015 Challenge Results (2015) (132)
Robust moving object segmentation on H.264/AVC compressed video using the block-based MRF model (2005) (123)
Thresholding technique with adaptive window selection for uneven lighting image (2005) (121)
Joint video/depth rate allocation for 3D video coding based on view synthesis distortion model (2009) (119)
ASIF-Net: Attention Steered Interweave Fusion Network for RGB-D Salient Object Detection (2020) (119)
F³Net: Fusion, Feedback and Focus for Salient Object Detection (2020) (117)
Joint Feature Selection and Classification for Multilabel Learning (2018) (111)
Hedging Deep Features for Visual Tracking (2019) (111)
Trajectory based event tactics analysis in broadcast sports video (2007) (109)
Going From RGB to RGBD Saliency: A Depth-Guided Transformation Model (2020) (107)
A Useful Visualization Technique: A Literature Review for Augmented Reality and its Application, limitation & future direction (2009) (105)
Affective Visualization and Retrieval for Music Video (2010) (105)
Event Tactic Analysis Based on Broadcast Sports Video (2009) (100)
Dependency Exploitation: A Unified CNN-RNN Approach for Visual Emotion Recognition (2017) (99)
An Iterative Co-Saliency Framework for RGBD Images (2017) (97)
Parsing-Based View-Aware Embedding Network for Vehicle Re-Identification (2020) (93)
Player action recognition in broadcast tennis video with applications to semantic analysis of sports game (2006) (93)
Learning Label Specific Features for Multi-label Classification (2015) (92)
Co-Saliency Detection for RGBD Images Based on Multi-Constraint Feature Matching and Cross Label Propagation (2017) (92)
Blind image quality prediction by exploiting multi-level deep representations (2018) (90)
Deep Unsupervised Convolutional Domain Adaptation (2017) (87)
Improving multi-label classification with missing labels by learning label-specific features (2019) (84)
Reverse Perspective Network for Perspective-Aware Object Counting (2020) (82)
Generalized Semi-supervised and Structured Subspace Learning for Cross-Modal Retrieval (2018) (82)
Learning Fragment Self-Attention Embeddings for Image-Text Matching (2019) (80)
Human Behavior Analysis for Highlight Ranking in Broadcast Racket Sports Video (2007) (80)
Region-based visual attention analysis with its application in image browsing on small displays (2007) (79)
Generating Descriptive Visual Words and Visual Phrases for Large-Scale Image Applications (2011) (76)
State-Relabeling Adversarial Active Learning (2020) (75)
Cross-media analysis and reasoning: advances and directions (2017) (74)
Detecting Violent Scenes in Movies by Auditory and Visual Cues (2008) (73)
The Visual Object Tracking VOT 2016 Challenge Results (2018) (73)
The Unmanned Aerial Vehicle Benchmark: Object Detection, Tracking and Baseline (2019) (72)
Recognizing human group action by layered model with multiple cues (2014) (72)
CenterNet: Object Detection with Keypoint Triplets (2019) (70)
Video Saliency Detection via Sparsity-Based Reconstruction and Propagation (2019) (70)
A generic virtual content insertion system based on visual attention analysis (2008) (69)
DPANet: Depth Potentiality-Aware Gated Attention Network for RGB-D Salient Object Detection (2020) (69)
Edge-SIFT: Discriminative Binary Descriptor for Scalable Partial-Duplicate Mobile Search (2013) (69)
An effective method to detect and categorize digitized traditional Chinese paintings (2006) (68)
HSCS: Hierarchical Sparsity Based Co-saliency Detection for RGBD Images (2018) (68)
Spatiotemporal CNN for Video Object Segmentation (2019) (67)
USB: Ultrashort Binary Descriptor for Fast Visual Matching and Retrieval (2014) (66)
Image classification by non-negative sparse coding, correlation constrained low-rank and sparse decomposition (2014) (65)
Learning Hierarchical Semantic Description Via Mixed-Norm Regularization for Image Understanding (2012) (65)
Affective Image Content Analysis: A Comprehensive Survey (2018) (64)
Multi-label classification by exploiting local positive and negative pairwise label correlation (2017) (64)
HodgeRank on Random Graphs for Subjective Video Quality Assessment (2012) (64)
A Novel Rate Control Technique for Multiview Video Plus Depth Based 3D Video Coding (2011) (62)
Affective MTV analysis based on arousal and valence features (2008) (62)
A Scheme for Ball Detection and Tracking in Broadcast Soccer Video (2005) (62)
Toward Realistic Face Photo–Sketch Synthesis via Composition-Aided GANs (2017) (62)
Social Attribute-Aware Force Model: Exploiting Richness of Interaction for Abnormal Crowd Detection (2015) (60)
Cross-Modal Retrieval Using Multiordered Discriminative Structured Subspace Learning (2017) (59)
Online crowdsourcing subjective image quality assessment (2012) (58)
VisDrone-VDT2018: The Vision Meets Drone Video Detection and Tracking Challenge Results (2018) (57)
Joint Global and Co-Attentive Representation Learning for Image-Sentence Retrieval (2018) (57)
Dual Quaternion Knowledge Graph Embeddings (2021) (57)
Corner Proposal Network for Anchor-free, Two-stage Object Detection (2020) (57)
VisDrone-SOT2019: The Vision Meets Drone Single Object Tracking Challenge Results (2018) (56)
Highlight Summarization in Sports Video Based on Replay Detection (2006) (55)
Partial-Duplicate Image Retrieval via Saliency-Guided Visual Matching (2013) (54)
Adaptive Reconstruction Network for Weakly Supervised Referring Expression Grounding (2019) (53)
Automatic Multi-Player Detection and Tracking in Broadcast Sports Video using Support Vector Machine and Particle Filter (2006) (52)
Fast and effective text detection (2008) (52)
Extracting 3D information from broadcast soccer video (2006) (52)
Robust Spatial Consistency Graph Model for Partial Duplicate Image Retrieval (2013) (52)
Transferring Boosted Detectors Towards Viewpoint and Scene Adaptiveness (2011) (52)
Jersey number detection in sports video for athlete identification (2005) (51)
Abnormal crowd behavior detection based on social attribute-aware force model (2012) (51)
Multi-Level Discriminative Dictionary Learning With Application to Large Scale Image Classification (2015) (49)
Multi-level Discriminative Dictionary Learning towards Hierarchical Visual Categorization (2013) (49)
Error resilience video coding in H.264 encoder with potential distortion tracking (2004) (49)
Action Recognition in Broadcast Tennis Video (2006) (49)
A framework for flexible summarization of racquet sports video using multiple modalities (2009) (48)
Automatic text segmentation from complex background (2004) (48)
Online Deformable Object Tracking Based on Structure-Aware Hyper-Graph (2016) (48)
Split Multiplicative Multi-View Subspace Clustering (2019) (46)
Discrete Probability Distribution Prediction of Image Emotions with Shared Sparse Learning (2020) (45)
Learning to Predict Bus Arrival Time From Heterogeneous Measurements via Recurrent Neural Network (2019) (45)
Image classification using spatial pyramid robust sparse coding (2013) (45)
Geometric Hypergraph Learning for Visual Tracking (2016) (45)
A Low-Cost Very Large Scale Integration Architecture for Multistandard Inverse Transform (2010) (45)
Structure-Aware Local Sparse Coding for Visual Tracking (2018) (45)
Utilizing affective analysis for efficient movie browsing (2009) (45)
Improving particle filter with support vector regression for efficient visual tracking (2005) (43)
Action Recognition in Broadcast Tennis Video Using Optical Flow and Support Vector Machine (2006) (43)
Exciting event detection in broadcast soccer video with mid-level description and incremental learning (2005) (43)
Random partial paired comparison for subjective video quality assessment via hodgerank (2011) (42)
Playfield detection using adaptive GMM and its application (2005) (42)
Mean-Shift Blob Tracking with Adaptive Feature Selection and Scale Adaptation (2007) (41)
Deep Spatial-Spectral Subspace Clustering for Hyperspectral Image (2021) (40)
Object tracking using incremental 2D-LDA learning and Bayes inference (2008) (40)
Learning Attribute-Specific Representations for Visual Tracking (2019) (40)
Group Activity Recognition by Gaussian Processes Estimation (2010) (40)
Semantically-Based Human Scanpath Estimation with HMMs (2013) (40)
Weakly-Supervised Crowd Counting Learns from Sorting Rather Than Locations (2020) (39)
PL-ranking: A Novel Ranking Method for Cross-Modal Retrieval (2016) (38)
Fine-Grained Image Classification via Low-Rank Sparse Coding With General and Class-Specific Codebooks (2017) (37)
Multimodal Similarity Gaussian Process Latent Variable Model (2017) (36)
The Visual Object Tracking VOT 2017 challenge results (2018) (36)
Effective Multimodality Fusion Framework for Cross-Media Topic Detection (2016) (36)
Attentive Recurrent Neural Network for Weak-supervised Multi-label Image Classification (2018) (35)
${\rm S}^{3}{\rm MKL}$: Scalable Semi-Supervised Multiple Kernel Learning for Real-World Image Applications (2012) (35)
Cross-Modal Correlation Learning by Adaptive Hierarchical Semantic Aggregation (2014) (35)
Adding Affine Invariant Geometric Constraint for Partial-Duplicate Image Retrieval (2010) (35)
Multi-feature metric learning with knowledge transfer among semantics and social tagging (2012) (34)
Statistical model, analysis and approximation of rate-distortion function in MPEG-4 FGS videos (2005) (34)
SCAN: Spatial and Channel Attention Network for Vehicle Re-Identification (2018) (33)
Image classification using Harr-like transformation of local features with coding residuals (2013) (32)
Automatic sports genre categorization and view-type classification over large-scale dataset (2009) (32)
Contextual Exemplar Classifier-Based Image Representation for Classification (2017) (32)
Video Shot Detection Using Hidden Markov Models with Complementary Features (2006) (32)
SkeletonNet: A Hybrid Network With a Skeleton-Embedding Process for Multi-View Image Representation Learning (2019) (31)
Set-label modeling and deep metric learning on person re-identification (2015) (31)
Online Asymmetric Similarity Learning for Cross-Modal Retrieval (2017) (31)
Multiple Instance Boost Using Graph Embedding Based Decision Stump for Pedestrian Detection (2008) (31)
Multi-modal semantic autoencoder for cross-modal retrieval (2019) (30)
Adaptively Unified Semi-supervised Learning for Cross-Modal Retrieval (2017) (29)
Cross-media topic detection: A multi-modality fusion framework (2013) (29)
Semantic invariant cross-domain image generation with generative adversarial networks (2018) (29)
Unsupervised Texture Classification: Automatically Discover and Classify Texture Patterns (2006) (28)
S3MKL: scalable semi-supervised multiple kernel learning for image data mining (2010) (28)
When to Learn What: Deep Cognitive Subspace Clustering (2018) (28)
Heuristic Domain Adaptation (2020) (27)
Exploring Coherent Motion Patterns via Structured Trajectory Learning for Crowd Mood Modeling (2017) (27)
Learning Semantic Structure-preserved Embeddings for Cross-modal Retrieval (2018) (27)
A Graph Regularized Deep Neural Network for Unsupervised Image Representation Learning (2017) (27)
Mode mapping method for H.264/AVC spatial downscaling transcoding (2004) (27)
Treat samples differently: Object tracking with semi-supervised online CovBoost (2011) (27)
Unsupervised Open Domain Recognition by Semantic Discrepancy Minimization (2019) (26)
Embedding Perspective Analysis Into Multi-Column Convolutional Neural Network for Crowd Counting (2020) (26)
Object categorization in sub-semantic space (2014) (26)
Robust evaluation for quality of experience in crowdsourcing (2013) (25)
ObjectPatchNet: Towards scalable and semantic image annotation and retrieval (2014) (25)
JDL at TRECVID 2006 Shot Boundary Detection (2006) (25)
Action Recognition Using Spatial-Temporal Context (2010) (24)
Global-and-Local Collaborative Learning for Co-Salient Object Detection (2022) (24)
Beyond particle flow: Bag of Trajectory Graphs for dense crowd event recognition (2013) (24)
Query sensitive dynamic web video thumbnail generation (2011) (24)
Beyond Explicit Codebook Generation: Visual Representation Using Implicitly Transferred Codebooks (2015) (24)
Knowledge-guided Pairwise Reconstruction Network for Weakly Supervised Referring Expression Grounding (2019) (24)
Robust visual tracking via scale-and-state-awareness (2019) (24)
Greedy Gradient Ensemble for Robust Visual Question Answering (2021) (24)
RD-optimized interactive streaming of multiview video with multiple encodings (2010) (23)
Similarity Gaussian Process Latent Variable Model for Multi-modal Data Analysis (2015) (23)
Abnormal event detection in crowded scenes based on Structural Multi-scale Motion Interrelated Patterns (2013) (23)
Rethinking Graph Neural Architecture Search from Message-passing (2021) (23)
Viewpoint and Scale Consistency Reinforcement for UAV Vehicle Re-Identification (2020) (22)
Distributed image understanding with semantic dictionary and semantic expansion (2016) (22)
TINA: Cross-Modal Correlation Learning by Adaptive Hierarchical Semantic Aggregation (2016) (22)
Context-based 2D-VLC for video coding (2004) (22)
Improved error concealment algorithms based on H.264/AVC non-normative decoder (2004) (22)
Web video thumbnail recommendation with content-aware analysis and query-sensitive matching (2014) (22)
Online Asymmetric Metric Learning With Multi-Layer Similarity Aggregation for Cross-Modal Retrieval (2019) (22)
Image Class Prediction by Joint Object, Context, and Background Modeling (2018) (21)
Beyond visual features: A weak semantic image representation using exemplar classifiers for classification (2013) (21)
Syntax-Guided Hierarchical Attention Network for Video Captioning (2022) (21)
Self-Regulated Learning for Egocentric Video Activity Anticipation (2021) (21)
Unsupervised Web Topic Detection Using A Ranked Clustering-Like Pattern Across Similarity Cascades (2015) (21)
A Real-Time Score Detection and Recognition Approach for Broadcast Basketball Video (2007) (21)
An effective multi-clue fusion approach for web video topic detection (2012) (21)
Decomposition and Completion Network for Salient Object Detection (2021) (21)
Naming faces in broadcast news video by image google (2008) (21)
Online Selection of Discriminative Features Using Bayes Error Rate for Visual Tracking (2006) (21)
Iterative Graph Seeking for Object Tracking (2018) (20)
Image classification by search with explicitly and implicitly semantic representations (2017) (20)
Long-Term Video Question Answering via Multimodal Hierarchical Memory Attentive Networks (2021) (20)
Multi-View Spatial Attention Embedding for Vehicle Re-Identification (2021) (20)
Group sensitive Classifier Chains for multi-label classification (2015) (20)
New bi-prediction techniques for B pictures coding [video coding] (2004) (20)
Detecting Small Objects Using a Channel-Aware Deconvolutional Network (2020) (20)
Video2Cartoon: A System for Converting Broadcast Soccer Video into 3D Cartoon Animation (2007) (19)
A New Text Detection Algorithm in Images/Video Frames (2004) (19)
DM2C: Deep Mixed-Modal Clustering (2019) (19)
Learning With Multiclass AUC: Theory and Algorithms (2021) (19)
Friend recommendation according to appearances on photos (2009) (18)
Multi-Networks Joint Learning for Large-Scale Cross-Modal Retrieval (2017) (18)
Boosted random contextual semantic space based representation for visual recognition (2016) (18)
Nearest-neighbor method using multiple neighborhood similarities for social media data mining (2012) (18)
GOMES: A group-aware multi-view fusion approach towards real-world image clustering (2015) (18)
Replay Detection Based on Semi-automatic Logo Template Sequence Extraction in Sports Video (2007) (18)
Image Matching by Multiscale Oriented Corner Correlation (2006) (17)
Joint multi-view representation and image annotation via optimal predictive subspace learning (2018) (17)
Near-duplicate video matching with transformation recognition (2009) (17)
Location-Based Parallel Tag Completion for Geo-Tagged Social Image Retrieval (2017) (17)
Video2Cartoon: generating 3D cartoon from broadcast soccer video (2005) (17)
VisDrone-MOT2019: The Vision Meets Drone Multiple Object Tracking Challenge Results (2019) (17)
A Recursive Constrained Framework for Unsupervised Video Action Clustering (2020) (17)
Key Techniques of Bit Rate Reduction for H.264 Streams (2004) (17)
Online selection of the best k-feature subset for object tracking (2012) (17)
Person Re-Identification by Semantic Region Representation and Topology Constraint (2018) (17)
Location-Sensitive Visual Recognition with Cross-IOU Loss (2021) (16)
Online web video topic detection and tracking with semi-supervised learning (2016) (16)
Unsupervised sports video scene clustering and its applications to story units detection (2005) (16)
ALID: Scalable Dominant Cluster Detection (2014) (16)
A Multiple Targets Appearance Tracker Based on Object Interaction Models (2012) (15)
Augmented Adversarial Training for Cross-Modal Retrieval (2021) (15)
A new method to calculate the camera focusing area and player position on playfield in soccer video (2005) (15)
Joint Multi-View Representation Learning and Image Tagging (2016) (15)
A pixel-wise local information-based background subtraction approach (2008) (15)
Quaternion-Based Knowledge Graph Network for Recommendation (2020) (15)
Matching images more efficiently with local descriptors (2008) (14)
Hierarchical deep semantic representation for visual categorization (2017) (14)
Cross-modal Retrieval by Real Label Partial Least Squares (2016) (14)
Human group activity analysis with fusion of motion and appearance information (2011) (14)
MOCC: A Fast and Robust Correlation-Based Method for Interest Point Matching under Large Scale Changes (2010) (14)
Few Shot Generative Model Adaption via Relaxed Spatial Structural Alignment (2022) (14)
Event tactic analysis based on player and ball trajectory in broadcast video (2008) (14)
A hybrid text segmentation approach (2009) (14)
Deep Stereoscopic Image Super-Resolution via Interaction Module (2020) (14)
Online multiple object tracking via exchanging object context (2018) (14)
Modeling spatial and semantic cues for large-scale near-duplicated image retrieval (2011) (14)
Exploiting sample correlation for crowd counting with multi-expert network (2021) (14)
Harmonized Multimodal Learning with Gaussian Process Latent Variable Models (2019) (13)
Image classification using boosted local features with random orientation and location selection (2015) (13)
A Generic Approach for Systematic Analysis of Sports Videos (2012) (13)
From Social to Individuals: A Parsimonious Path of Multi-Level Models for Crowdsourced Preference Aggregation (2018) (13)
Coarse-to-fine video text detection (2008) (13)
Video saliency prediction with optimized optical flow and gravity center bias (2016) (13)
Spatial-temporal attention analysis for home video (2008) (13)
Relative image similarity learning with contextual information for Internet cross-media retrieval (2014) (13)
Detection and location of near-duplicate video sub-clips by finding dense subgraphs (2011) (13)
Representing dense crowd patterns using bag of trajectory graphs (2014) (13)
Low-delay View Random Access for Multi-view Video Coding (2007) (13)
Online low-rank similarity function learning with adaptive relative margin for cross-modal retrieval (2017) (13)
Webpage saliency prediction with multi-features fusion (2016) (13)
Nearest-neighbor classification using unlabeled data for real world image application (2010) (13)
Compression-Induced Rendering Distortion Analysis for Texture/Depth Rate Allocation in 3D Video Compression (2009) (13)
DMVOS: Discriminative Matching for Real-time Video Object Segmentation (2020) (13)
Aesthetic composition represetation for portrait photographing recommendation (2012) (13)
A Simulation Analysis on the Existence of Network Traffic Flow Equilibria (2014) (13)
A Survey on Visual Human Action Recognition: A Survey on Visual Human Action Recognition (2014) (12)
Exploring Outliers in Crowdsourced Ranking for QoE (2017) (12)
Fast Batch Nuclear-norm Maximization and Minimization for Robust Domain Adaptation (2021) (12)
Generating Video Sequence from Photo Image for Mobile Screens by Content Analysis (2007) (12)
Weakly Supervised Bilinear Attention Network for Fine-Grained Visual Classification (2018) (12)
A Hierarchical CNN-RNN Approach for Visual Emotion Classification (2019) (12)
Multi-View Multi-Label Learning With View-Label-Specific Features (2019) (12)
When All We Need is a Piece of the Pie: A Generic Framework for Optimizing Two-way Partial AUC (2021) (12)
People re-detection using Adaboost with sift and color correlogram (2008) (12)
Seeking the Shape of Sound: An Adaptive Framework for Learning Voice-Face Association (2021) (12)
Viewpoint switching in multiview video streaming (2005) (12)
Depth Potentiality-Aware Gated Attention Network for RGB-D Salient Object Detection (2020) (12)
IR-GAN: Image Manipulation with Linguistic Instruction by Increment Reasoning (2020) (12)
Vicept: link visual features to concepts for large-scale image understanding (2010) (11)
Online Fast Adaptive Low-Rank Similarity Learning for Cross-Modal Retrieval (2020) (11)
Personalized MTV Affective Analysis Using User Profile (2008) (11)
Multiview Video Coding Based on Global Motion Model (2004) (11)
SSOCBT: A Robust Semisupervised Online CovBoost Tracker That Uses Samples Differently (2013) (11)
Joint image representation and classification in random semantic spaces (2015) (11)
Online HodgeRank on Random Graphs for Crowdsourceable QoE Evaluation (2014) (11)
Deep Constrained Low-Rank Subspace Learning for Multi-View Semi-Supervised Classification (2019) (11)
Facial Landmarks Detection by Self-Iterative Regression based Landmarks-Attention Network (2018) (11)
Visual perception based Lagrangian rate distortion optimization for video coding (2011) (11)
Training Efficient Saliency Prediction Models with Knowledge Distillation (2019) (11)
Release the Power of Online-Training for Robust Visual Tracking (2020) (11)
Bilevel Multiview Latent Space Learning (2018) (11)
Building pair-wise visual word tree for efficent image re-ranking (2010) (11)
Stereoscopic Image Stitching via Disparity-Constrained Warping and Blending (2020) (10)
Long Short-Term Relation Transformer With Global Gating for Video Captioning (2022) (10)
Semantic-aware Hashing for Social Image Retrieval (2015) (10)
Does Thermal Really Always Matter for RGB-T Salient Object Detection? (2022) (10)
Depth image segmentation for improved virtual view image quality in 3-DTV (2007) (10)
LSH-based semantic dictionary learning for large scale image understanding (2015) (10)
Shot classification for action movies based on motion characteristics (2008) (10)
Summarization in Soccer Video based on Goalmouth Detection (2006) (10)
Composition-aided Sketch-realistic Portrait Generation (2017) (10)
Conditional GAN based individual and global motion fusion for multiple object tracking in UAV videos (2020) (10)
Robust copy detection by mining temporal self-similarities (2009) (10)
Matching Content-based Saliency Regions for partial-duplicate image retrieval (2011) (10)
Cluster-sensitive Structured Correlation Analysis for Web cross-modal retrieval (2015) (10)
Learning Coupled Convolutional Networks Fusion for Video Saliency Prediction (2019) (10)
Interpretable Visual Reasoning via Probabilistic Formulation Under Natural Supervision (2020) (10)
CIR-Net: Cross-Modality Interaction and Refinement for RGB-D Salient Object Detection (2022) (10)
Laplacian affine sparse coding with tilt and orientation consistency for image classification (2013) (10)
Cross-media Topic Detection with Refined CNN based Image-Dominant Topic Model (2015) (10)
Online Discriminative Structured Output SVM Learning for Multi-Target Tracking (2014) (9)
Multimodal Entity Linking: A New Dataset and A Baseline (2021) (9)
Fusing cross-media for topic detection by dense keyword groups (2015) (9)
Semi-Autoregressive Image Captioning (2021) (9)
Weighted visual vocabulary to balance the descriptive ability on general dataset (2013) (9)
Visual-aural attention modeling for talk show video highlight detection (2008) (9)
Learning Self-Supervised Space-Time CNN for Fast Video Style Transfer (2021) (9)
Multi-order visual phrase for scalable image search (2013) (9)
Embedding Multi-Order Spatial Clues for Scalable Visual Matching and Retrieval (2014) (9)
Geometry Interaction Knowledge Graph Embeddings (2022) (9)
Pornographic Image Detection Based on Multilevel Representation (2009) (9)
i.MTV: an integrated system for mtv affective analysis (2008) (9)
Monocular Tracking 3D People By Gaussian Process Spatio-Temporal Variable Model (2007) (9)
Subjective evaluation criterion for selecting affective features and modeling highlights (2006) (9)
Learning Deep Convolutional Neural Networks for Places2 Scene Recognition (2015) (9)
Adversarial Preference Learning with Pairwise Comparisons (2019) (9)
Style-adaptive photo aesthetic rating via convolutional neural networks and multi-task learning (2020) (8)
Visual ContextRank for web image re-ranking (2009) (8)
Topic detection in cross-media: a semi-supervised co-clustering approach (2014) (8)
A Fast Intra Mode Decision Algorithm for AVS to H.264 Transcoding (2006) (8)
Discovering Fine-Grained Spatial Pattern From Taxi Trips: Where Point Process Meets Matrix Decomposition and Factorization (2018) (8)
Extracting Story Units in Sports Video Based on Unsupervised Video Scene Clustering (2006) (8)
Visual Ontology Construction for Digitized Art Image Retrieval (2005) (8)
Online Deformable Object Tracking Based on Structure-Aware Hyper-Graph. (2016) (8)
Advertise gently - in-image advertising with low intrusiveness (2009) (8)
Robust Latent Poisson Deconvolution From Multiple Features for Web Topic Detection (2016) (8)
A scheme for racquet sports video analysis with the combination of audio-visual information (2005) (8)
Justifying the Importance of Color Cues in Object Detection: A Case Study on Pedestrian (2013) (8)
Towards More Explainability: Concept Knowledge Mining Network for Event Recognition (2020) (8)
Reverse Densely Connected Feature Pyramid Network for Object Detection (2018) (8)
Beyond global fusion: A group-aware fusion approach for multi-view image clustering (2019) (8)
Vehicle Detection in UAV Traffic Video Based on Convolution Neural Network (2018) (8)
Hierarchical Modular Network for Video Captioning (2021) (8)
CNN-MR for No Reference Video Quality Assessment (2017) (8)
Multiple Kernel Learning with High Order Kernels (2010) (8)
Advances in Multimedia Information Processing – PCM 2017 (2017) (8)
Reducing Spatial Resolution for MPEG-2 to H.264/AVC Transcoding (2005) (8)
A novel rate control scheme for video streaming over wireless networks (2004) (8)
Saliency-Based Spatiotemporal Attention for Video Captioning (2018) (7)
Implicit Feedbacks are Not Always Favorable: Iterative Relabeled One-Class Collaborative Filtering against Noisy Interactions (2021) (7)
Categorizing Social Multimedia by Neighborhood Decision Using Local Pairwise Label Correlation (2014) (7)
Cascade Category-Aware Visual Search (2014) (7)
Location-Based Parallel Tag Completion for Geo-tagged Social Image Retrieval (2015) (7)
Pedestrian detection via logistic multiple instance boosting (2008) (7)
The Demo: A Real-Time Score Detection and Recognition Approach in Broadcast Basketball Sports Video (2007) (7)
Cascade Cross-modal Attention Network for Video Actor and Action Segmentation from a Sentence (2021) (7)
Cross-media topic detection associated with hot search queries (2013) (7)
A Rotation Invariant Descriptor for Robust Video Copy Detection (2013) (7)
Self-Supervised Deep TripleNet for Video Object Segmentation (2021) (7)
Click data guided query modeling with click propagation and sparse coding (2018) (7)
HodgeRank With Information Maximization for Crowdsourced Pairwise Ranking Aggregation (2017) (7)
Collaborative Preference Embedding against Sparse Labels (2019) (7)
LVE-S2D: Low-Light Video Enhancement From Static to Dynamic (2022) (7)
Fast copy detection based on Slice Entropy Scattergraph (2010) (7)
Multimodal Gaussian Process Latent Variable Models with Harmonization (2017) (7)
Structure-aware multi-object discovery for weakly supervised tracking (2014) (7)
Rethinking Graph Neural Network Search from Message-passing (2021) (7)
Deep Affine Motion Compensation Network for Inter Prediction in VVC (2021) (7)
Crowd video retrieval via deep attribute-embedding graph ranking (2016) (7)
Learning image Vicept description via mixed-norm regularization for large scale semantic image search (2011) (6)
Automatic video genre categorization and event detection techniques on large-scale sports data (2010) (6)
Fine-grained Feature Alignment with Part Perspective Transformation for Vehicle ReID (2020) (6)
Label Correlation Guided Deep Multi-View Image Annotation (2019) (6)
From Common to Special: When Multi-Attribute Learning Meets Personalized Opinions (2017) (6)
Composition-Aided Face Photo-Sketch Synthesis (2017) (6)
Linear transform based motion compensated prediction for luminance intensity changes (2005) (6)
Modeling Temporal Concept Receptive Field Dynamically for Untrimmed Video Analysis (2020) (6)
Nearest Neighbor Classifier Embedded Network for Active Learning (2021) (6)
Multi-view Video Coding with Flexible View-Temporal Prediction Structure for Fast Random Access (2006) (6)
Sports Video Analysis: From Semantics to Tactics (2009) (6)
Structured Stochastic Recurrent Network for Linguistic Video Prediction (2019) (6)
Channel-wise Temporal Attention Network for Video Action Recognition (2019) (6)
ER: Equivariance Regularizer for Knowledge Graph Completion (2022) (6)
A Margin-based MLE for Crowdsourced Partial Ranking (2018) (6)
Mining Information of Attack-Defense Status from Soccer Video Based on Scene Analysis (2007) (6)
Generalized Zero-Shot Video Classification via Generative Adversarial Networks (2020) (6)
Undo the codebook bias by linear transformation for visual applications (2013) (6)
Novel observation model for probabilistic object tracking (2010) (6)
CenterNet++ for Object Detection (2022) (6)
Intra- and Inter-modal Multilinear Pooling with Multitask Learning for Video Grounding (2020) (6)
Symmetric segment-based stereo matching of motion blurred images with illumination variations (2008) (6)
MULTFRC-LERD: An Improved Rate Control Scheme for Video Streaming over Wireless (2004) (6)
Personalized online video recommendation by neighborhood score propagation based global ranking (2009) (5)
Who to Ask: An Intelligent Fashion Consultant (2018) (5)
DA-CCD: A novel action representation by Deep Architecture of local depth feature (2014) (5)
Coarse-to-Fine Dissolve Detection Based on Image Quality Assessment (2013) (5)
The third eye: mining the visual cognition across multi-language communities (2010) (5)
Learning Feature Representation and Partial Correlation for Multimodal Multi-Label Data (2021) (5)
Diverter-Guider Recurrent Network for Diverse Poems Generation from Image (2020) (5)
Recognizing Realistic Action Using Contextual Feature Group (2013) (5)
WIKI-CMR: A web cross modality dataset for studying and evaluation of cross modality retrieval models (2013) (5)
Fine-Grained Image Quality Assessment: A Revisit and Further Thinking (2021) (5)
Cross media topic analytics based on synergetic content and user behavior modeling (2014) (5)
Toward Understanding and Boosting Adversarial Transferability From a Distribution Perspective (2022) (5)
Motion Based Perceptual Distortion and Rate Optimization for Video Coding (2012) (5)
Bridging the gap between objective score and subjective preference in video quality assessment (2010) (5)
Saliency detection with two-level fully convolutional networks (2017) (5)
Lower attentive region detection for virtual content insertion in broadcast video (2008) (5)
Self-balance Motion and Appearance Model for Multi-object Tracking in UAV (2019) (5)
A two-step approach to describing web topics via probable keywords and prototype images from background-removed similarities (2018) (5)
A novel FGS base-layer encoding model and weight-based rate adaptation for constant-quality streaming (2004) (5)
Accurate and efficient cross-domain visual matching leveraging multiple feature representations (2013) (5)
Task-Feature Collaborative Learning with Application to Personalized Attribute Prediction (2020) (5)
Memory matrix: a novel user experience for home video (2010) (5)
Structural Semantic Adversarial Active Learning for Image Captioning (2020) (5)
Pareto Optimality for Fairness-constrained Collaborative Filtering (2021) (5)
Formation Period Matters: Towards Socially Consistent Group Detection via Dense Subgraph Seeking (2015) (5)
Image Inpainting Based on Multi-frequency Probabilistic Inference Model (2020) (5)
A model-based demand-balancing control for dynamically divided multiple urban subnetworks (2016) (4)
Socio-mobile landmark recognition using local features with adaptive region selection (2016) (4)
A Novel Framework for Web Video Thumbnail Generation (2012) (4)
ObjectBook construction for large-scale semantic-aware image retrieval (2011) (4)
Multi-Stream Region Proposal Network for Pedestrian Detection (2018) (4)
I2Transformer: Intra- and Inter-Relation Embedding Transformer for TV Show Captioning (2022) (4)
News video story sentiment classification and ranking (2011) (4)
Weakly Supervised Local Attention Network for Fine-Grained Visual Classification (2018) (4)
Multi-description of local interest point for partial-duplicate image retrieval (2010) (4)
Task-distribution-aware Meta-learning for Cold-start CTR Prediction (2020) (4)
Improving Image Distance Metric Learning by Embedding Semantic Relations (2012) (4)
An efficient occlusion detection method to improve object trackers (2013) (4)
Local-binarized very deep residual network for visual categorization (2020) (4)
Beyond visual word ambiguity: Weighted local feature encoding with governing region (2014) (4)
Graph-Density-based visual word vocabulary for image retrieval (2014) (4)
A Bit-Plane Decomposition Matrix-Based VLSI Integer Transform Architecture for HEVC (2017) (4)
Coupling Reranking and Structured Output SVM Co-Train for Multitarget Tracking (2016) (4)
An Edge-Based Median Filtering Algorithm with Consideration of Motion Vector Reliability for Adaptive Video Deinterlacing (2006) (4)
Content-Based Video Semantic Analysis (2009) (4)
Image-regulated graph topic model for cross-media topic detection (2015) (4)
Human reappearance detection based on on-line learning (2008) (4)
Improving cross-modal correlation learning with hyperlinks (2015) (4)
Beyond bag of words: image representation in sub-semantic space (2013) (4)
Self-calibration Based 3D Information Extraction and Application in Broadcast Soccer Video (2006) (4)
Multi-label double-layer learning for cross-modal retrieval (2018) (4)
Proposal Complementary Action Detection (2020) (4)
Online learning affinity measure with CovBoost for multi-target tracking (2015) (4)
Learning Sparse Prototypes for Crowd Perception via Ensemble Coding Mechanisms (2014) (3)
SIEV-Net: A Structure-Information Enhanced Voxel Network for 3D Object Detection From LiDAR Point Clouds (2022) (3)
Large scale image understanding with non-convex multi-task learning (2014) (3)
Stereoscopic Image Retargeting Based on Deep Convolutional Neural Network (2021) (3)
Beyond appearance model: Learning appearance variations for object tracking (2016) (3)
Highlight Ranking for Racquet Sports Video in User Attention Subspaces Based on Relevance Feedback (2007) (3)
Attribute Group Editing for Reliable Few-shot Image Generation (2022) (3)
Rotative maximal pattern: A local coloring descriptor for object classification and recognition (2017) (3)
Advances in Multimedia Information Processing – PCM 2017 (2017) (3)
Fast common visual pattern detection via radiate geometric model (2011) (3)
Abnormal Event Detection Based on Multi-scale Markov Random Field (2015) (3)
Spatial-temporal video browsing for mobile environment based on visual attention analysis (2009) (3)
Evaluating Visual Properties via Robust HodgeRank (2021) (3)
Deep Robust Subjective Visual Property Prediction in Crowdsourcing (2019) (3)
Adaptive Moving Cast Shadow Detection (2013) (3)
Video Anomaly Detection Using Open Data Filter and Domain Adaptation (2020) (3)
Multi-Attention Network for Compressed Video Referring Object Segmentation (2022) (3)
Effective algorithms for fast transcoding of AVS to H.264/AVC in the spatial domain (2007) (3)
Visual Saliency and Distortion Weighting Based Video Quality Assessment (2012) (3)
Macroblock-level Reduced Resolution Video Coding Allowing Adaptive DCT Coefficients Selection (2007) (3)
Face Distortion Recovery Based on Online Learning Database for Conversational Video (2014) (3)
Effective scene matching with local feature representatives (2008) (3)
Meta-Wrapper: Differentiable Wrapping Operator for User Interest Selection in CTR Prediction (2021) (3)
Cross-media retrieval with semantics clustering and enhancement (2017) (3)
Accelerate convolutional neural networks for binary classification via cascading cost-sensitive feature (2016) (3)
Set-based classification for person re-identification utilizing mutual-information (2013) (3)
Dist-PU: Positive-Unlabeled Learning from a Label Distribution Perspective (2022) (3)
Graph Regularized Encoder-Decoder Networks for Image Representation Learning (2021) (3)
Theoretical analysis of learning local anchors for classification (2012) (3)
Learning Personalized Attribute Preference via Multi-task AUC Optimization (2019) (3)
Robust real-time transmission of scalable multimedia for heterogeneous client bandwidths (2005) (3)
Edge-featured Graph Neural Architecture Search (2021) (3)
Attention Based Album Slideshow (2010) (3)
Online dictionary learning for Local Coordinate Coding with Locality Coding Adaptors (2015) (3)
Deep Partial Rank Aggregation for Personalized Attributes (2021) (2)
A generic approach to classify sports video shots and its application in event detection (2009) (2)
Continuation Multiple Instance Learning for Weakly and Fully Supervised Object Detection (2021) (2)
Joint learning for side information and correlation model based on linear regression model in distributed video coding (2009) (2)
Event based news video people classification and ranking using multimodality features (2010) (2)
Coupling Multiple Alignments and Re-ranking for Low-Latency Online Multi-target Tracking (2014) (2)
Justify role of Similarity Diffusion Process in cross-media topic ranking: an empirical evaluation (2017) (2)
Action recognition using trajectories of spatio-tempral feature points (2014) (2)
Two-stream deep sparse network for accurate and efficient image restoration (2020) (2)
Undoing the codebook bias by linear transformation with sparsity and F-norm constraints for image classification (2014) (2)
A System for Automatic Generation of Music Sports-Video (2005) (2)
Video frame prediction with dual-stream deep network emphasizing motions and content details (2022) (2)
Metric based on multi-order spaces for cross-modal retrieval (2017) (2)
iSplit LBI: Individualized Partial Ranking with Ties via Split LBI (2019) (2)
Learning Unified Embeddings for Recommendation via Meta-path Semantics (2021) (2)
Edge Guided Generation Network for Video Prediction (2018) (2)
The Minority Matters: A Diversity-Promoting Collaborative Metric Learning Algorithm (2022) (2)
Semantic Manifold Alignment in Visual Feature Space for Zero-Shot Learning (2018) (2)
RGB-D Human Matting: A Real-World Benchmark Dataset and A Baseline Method (2023) (2)
Monocular Tracking 3 D People with Back Constrained Scaled Gaussian Process Latent Variable Models (2007) (2)
Fast and Accurately Measuring Crack Width via Cascade Principal Component Analysis (2019) (2)
When False Positive is Intolerant: End-to-End Optimization with Low FPR for Multipartite Ranking (2021) (2)
Optimum End-to-End Distortion Estimation for Error Resilient Video Coding (2004) (2)
FNet: Fusion, Feedback and Focus for Salient Object Detection (2020) (2)
Adaptive Sharing for Image Classification (2015) (2)
Robust Statistical Ranking: Theory and Algorithms (2014) (2)
Local Laplacian Coding From Theoretical Analysis of Local Coding Schemes for Locally Linear Classification (2015) (2)
A Novel Story Unit Segmentation Algorithm Avoiding Voice Cutting (2007) (2)
Poisoning Attack Against Estimating From Pairwise Comparisons (2021) (2)
From Seed Discovery to Deep Reconstruction: Predicting Saliency in Crowd via Deep Networks (2016) (2)
Generalized Block-Diagonal Structure Pursuit: Learning Soft Latent Task Assignment against Negative Transfer (2019) (2)
Two Birds With One Stone: A Coupled Poisson Deconvolution for Detecting and Describing Topics From Multimodal Web Data (2019) (2)
Online Learning Based Face Distortion Recovery for Conversational Video Coding (2013) (2)
Multi-order visual phrase for scalable partial-duplicate visual search (2015) (2)
User Attention Analysis Based Video Summarization and Highlight Ranking: User Attention Analysis Based Video Summarization and Highlight Ranking (2009) (2)
Multi-view Subspace Learning with Diversity Enforced Skeleton Embedding (2017) (2)
Moving Object Segmentation: A Block-Based Moving Region Detection Approach (2004) (2)
S2L: Single-Streamline For Complex Video Event Detection (2018) (2)
C2FNet: A Coarse-to-Fine Network for Multi-View 3D Point Cloud Generation (2022) (2)
A close-up detection method for movies (2010) (1)
Span-based Audio-Visual Localization (2022) (1)
Error-resistance and Low-complexity Integer Inverse Discrete Cosine Transform (2010) (1)
Concept Propagation via Attentional Knowledge Graph Reasoning for Video-Text Retrieval (2022) (1)
A Sparse-Motif Ensemble Graph Convolutional Network against Over-smoothing (2022) (1)
Latent influence propagation on dynamic networks (2015) (1)
Not All Samples are Trustworthy: Towards Deep Robust SVP Prediction (2020) (1)
How Functions Evolve in Deep Convolutional Neural Network (2018) (1)
Semantic Editing On Segmentation Map Via Multi-Expansion Loss (2020) (1)
@ICT: attention-based virtual content insertion (2012) (1)
Sports video summarization and adaptation for application in mobile communication (2006) (1)
Polysemious visual representation based on feature aggregation for large scale image applications (2014) (1)
MaxMatch: Semi-Supervised Learning With Worst-Case Consistency (2022) (1)
Story Unit Segmentation with Friendly Acoustic Perception (2007) (1)
JEREMIE: Joint Semantic Feature Learning via Multi-relational Matrix Completion (2017) (1)
Efficient lp-norm multiple feature metric learning for image categorization (2011) (1)
Cross community news event summary generation based on collaborative ranking (2012) (1)
Neural Collaborative Preference Learning With Pairwise Comparisons (2020) (1)
Using timing to detect horror shots in horror movies (2007) (1)
Introduction to the Special Issue on Fine-Grained Visual Recognition and Re-Identification (2022) (1)
Introduction to the Special Issue on MMAC: Multimodal Affective Computing of Large-Scale Multimedia Data (2021) (1)
A Fast Approach for Natural Image Matting using Structure Information (2007) (1)
Learning-to-Share Based on Finding Groups for Large Scale Image Classification (2013) (1)
Optimizing Two-way Partial AUC with an End-to-end Framework (2022) (1)
Who Likes What? - SplitLBI in Exploring Preferential Diversity of Ratings (2020) (1)
Fusing multi-cues description for partial-duplicate image retrieval (2014) (1)
A Two-Stage Approach to Highlight Extraction in Sports Video by Using AdaBoost and Multi-modal (2008) (1)
Viewpoint Alignment and Discriminative Parts Enhancement in 3D Space for Vehicle ReID (2022) (1)
Embedded Packetization Framework for Layered Multiple Description Coding (2004) (1)
Siamese Dynamic Mask Estimation Network for Fast Video Object Segmentation (2021) (1)
Strategy for aesthetic photography recommendation via collaborative composition model (2015) (1)
Transfer pedestrian detector towards view-adaptiveness and efficiency (2009) (1)
What to Select: Pursuing Consistent Motion Segmentation from Multiple Geometric Models (2021) (1)
Uncertainty Modeling for Robust Domain Adaptation Under Noisy Environments (2022) (1)
Color Maximal-Dissimilarity Pattern for pedestrian detection (2012) (1)
Drift-compensated coding optimization for fast bit-rate reduction transcoding (2007) (1)
FEC-based multiple description coding for heterogeneous client bandwidths (2004) (1)
Real-time interactive multi-target tracking using kernel-based trackers (2010) (1)
Fine-Grained Feature Generation for Generalized Zero-Shot Video Classification (2023) (1)
Stochastic boosting for large-scale image classification (2013) (1)
Web topic detection using a ranked clustering-like pattern across similarity cascades (2014) (1)
A fast intra 4×4 mode decision algorithm for H.264/AVC down rate transcoding (2010) (1)
Rethinking Collaborative Metric Learning: Toward an Efficient Alternative Without Negative Sampling (2022) (1)
Active Sampling for Subjective Video Quality Assessment (2018) (1)
Zero-shot Video Classification with Appropriate Web and Task Knowledge Transfer (2022) (1)
Online web video topic detection and tracking with semi-supervised learning (2013) (1)
Transferrable Referring Expression Grounding with Concept Transfer and Context Inheritance (2020) (1)
A Structured Latent Variable Recurrent Network With Stochastic Attention For Generating Weibo Comments (2020) (1)
Fine-Grained Image Classification Using Color Exemplar Classifiers (2013) (1)
Exploring the Algorithm-Dependent Generalization of AUPRC Optimization with List Stability (2022) (0)
Interactive event detection in crowd scenes (2012) (0)
A Study of Neural Collapse Phenomenon: Grassmannian Frame, Symmetry, Generalization (2023) (0)
HIGHLIGHTRANKINGFORRACQUET SPORTSVIDEOINUSERATTENTION SUBSPACESBASEDON RELEVANCE FEEDBACK (2007) (0)
Action Category and Phase Consistency Regularization for High-Quality Temporal Action Proposal Generation (2021) (0)
Content-based intelligent video recorder with its implementation on sports video (2011) (0)
Self Supervised Progressive Network for High Performance Video Object Segmentation. (2022) (0)
Online Vicept learning for web-scale image understanding (2011) (0)
A REAL-TIMESCORE DETECTIONAND RECOGNITIONAPPROACH FOR BROADCAST BASKETBALLVIDEO (2007) (0)
Online multi-target tracking via depth range segmentation (2017) (0)
Accurate and efficient cross-domain visual matching leveraging multiple feature representations (2013) (0)
Language Attention Proposal Attention + Training Inference man in white on the left holding a bat Subject Location Context Input query Input image (2019) (0)
Spatio-temporal Visual Distortion and Rate Optimization for Video Coding (2012) (0)
Confederated Learning: Going Beyond Centralization (2022) (0)
A Unified Framework against Topology and Class Imbalance (2022) (0)
Accelerating Topic Detection on Web for a Large-Scale Data Set via Stochastic Poisson Deconvolution (2018) (0)
Active Perception Network for Salient Object Detection (2019) (0)
Cross Concept Local Fisher Discriminant Analysis for Image Classification (2013) (0)
Tri-level Combination for Image Representation (2016) (0)
ZS-SBPRnet: A Zero-Shot Sketch-Based Point Cloud Retrieval Network Based on Feature Projection and Cross-Reconstruction (2022) (0)
OTKGE: Multi-modal Knowledge Graph Embeddings via Optimal Transport (2022) (0)
Fixation guided network for salient object detection (2021) (0)
ASMMC-MMAC 2018: The Joint Workshop of 4th the Workshop on Affective Social Multimedia Computing and first Multi-Modal Affective Computing of Large-Scale Multimedia Data Workshop (2018) (0)
Web video thumbnail recommendation with content-aware analysis and query-sensitive matching (2013) (0)
Multi-Projection Fusion and Refinement Network for Salient Object Detection in 360° Omnidirectional Image (2022) (0)
Sharing model with multi-level feature representations (2014) (0)
The Unmanned Aerial Vehicle Benchmark: Object Detection, Tracking and Baseline (2019) (0)
Progressive Multi-resolution Loss for Crowd Counting (2022) (0)
CSCNet: A Shallow Single Column Network for Crowd Counting (2020) (0)
Robust latent poisson deconvolution from multiple imperfect features for web topic detection (2016) (0)
Inferential Visual Question Generation (2022) (0)
Online learning af fi nity measure with CovBoost for multi-target tracking (2015) (0)
View Sequence Coding using Warping-based Image Alignment for Multiview Video (2006) (0)
General Greedy De-bias Learning (2021) (0)
DMVOS (2020) (0)
Justify role of Similarity Diffusion Process in cross-media topic ranking: an empirical evaluation (2017) (0)
Intra- and Inter-modal Multilinear Pooling with Multitask Learning for Video Grounding (2020) (0)
Rethinking Label Flipping Attack: From Sample Masking to Sample Thresholding (2023) (0)
Consistency-Aware Anchor Pyramid Network for Crowd Localization (2022) (0)
Two-Stream Sparse Network for Accurate Image Super-Resolution (2019) (0)
Topic detection in cross-media: a semi-supervised co-clustering approach (2014) (0)
Automatic Relation-aware Graph Network Proliferation (2022) (0)
Recurrent Meta-Learning against Generalized Cold-start Problem in CTR Prediction (2022) (0)
The Face Object based HEVC System for Video Call (2015) (0)
Relative image similarity learning with contextual information for Internet cross-media retrieval (2013) (0)
Modeling Long-Range Dependencies and Epipolar Geometry for Multi-View Stereo (2023) (0)
THE DEMO:A REAL-TIMESCOREDETECTIONAND RECOGNITIONAPPROACH IN BROADCAST BASKETBALLSPORTSVIDEO (2007) (0)
Deep neural networks for emerging multimedia computing and applications (2020) (0)
Asymptotically Unbiased Instance-wise Regularized Partial AUC Optimization: Theory and Algorithm (2022) (0)
Quaternion Ordinal Embedding (2022) (0)
DVCFlow: Modeling Information Flow Towards Human-like Video Captioning (2021) (0)
Learning Linguistic Association Towards Efficient Text-Video Retrieval (2022) (0)
Descriptive VisualWords: the Visual Correspondences of Text Words (2009) (0)
Visual Object Tracking using Sparse Representation and Interest Points in a Double Step Approach (2020) (0)
Proceedings of the 2nd International Conference on Internet Multimedia Computing and Service, ICIMCS'10 (2010) (0)
Optimizing Partial Area Under the Top-k Curve: Theory and Practice (2022) (0)
On Discriminability and Diversity in Domain Adaptation (2021) (0)
Pay Attention to Your Positive Pairs: Positive Pair Aware Contrastive Knowledge Distillation (2022) (0)
Automatic Shadow Generation via Exposure Fusion (2023) (0)
Highlight Ranking for Broadcast Tennis Video Based on Multi-modality Analysis and Relevance Feedback (2008) (0)
Domain Specific and Idiom Adaptive Video Summarization (2019) (0)
Recurrent Interaction Network for Stereoscopic Image Super-Resolution (2023) (0)
Regularized topic-aware latent influence propagation in dynamic relational networks (2019) (0)
Discriminative Spatial Codebook Generation for Image Classification (2013) (0)
Click data guided query modeling with click propagation and sparse coding (2018) (0)
Learning Enriched Hop-Aware Correlation for Robust 3D Human Pose Estimation (2023) (0)
Two-Stage Polishing Network for Camouflaged Object Detection (2021) (0)
Efficient Cross-Modal Retrieval Using Social Tag Information Towards Mobile Applications (2017) (0)
Spatial-Temporal Graph Network for Video Crowd Counting (2023) (0)
Bandwidth adaptive quality smoothing for unequal error protected scalable video streaming (2005) (0)
Correction to: Learning Enriched Hop-Aware Correlation for Robust 3D Human Pose Estimation (2023) (0)
OpenAUC: Towards AUC-Oriented Open-Set Recognition (2022) (0)
Cross modal metric learning with multi-level semantic relevance (2014) (0)
MININGINFORMATION OF ATTACK-DEFENSESTATUSFROM SOCCER VIDEOBASEDON SCENEANALYSIS (2007) (0)
Video Shrinking by Auditory and Visual Cues (2009) (0)
Multi-order visual phrase for scalable partial-duplicate visual search (2014) (0)
Increasing Interpretation of Web Topic Detection via Prototype Learning From Sparse Poisson Deconvolution (2019) (0)
Estimating the value of θ in the intra frame for ρ-domain rate control algorithms (2009) (0)
Multi-modal Multi-grained Embedding Learning for Generalized Zero-Shot Video Classification (2023) (0)
Graph-Based Structural Deep Spectral-Spatial Clustering for Hyperspectral Image (2023) (0)
The 2nd International Conference on Internet Multimedia Computing and Service, ICIMCS'10: Preface (2010) (0)
Weakly Supervised Anomaly Detection in Videos Considering the Openness of Events (2022) (0)
Weakly supervised cross-view action recognition via sequential motion accumulation (2014) (0)
A Tale of HodgeRank and Spectral Method: Target Attack Against Rank Aggregation is the Fixed Point of Adversarial Game (2022) (0)
@ICT: attention-based virtual content insertion (2011) (0)
DBAM: Dense Boundary and Actionness Map for Action Localization in Videos via Sentence Query (2021) (0)
CRNet: Collaborative Refinement Network for Self-Supervised Video Object Segmentation (2022) (0)
Temporal Dynamic Concept Modeling Network for Explainable Video Event Recognition (2022) (0)
One-Shot Example Videos Localization Network for Weakly-Supervised Temporal Action Localization (2021) (0)
Localized Image Matte Evaluation by Gradient Correlation (2010) (0)
Message from the DIKW 2021 Program Chairs (2021) (0)
Human tracking by structured body parts (2011) (0)
Representing dense crowd patterns using bag of trajectory graphs (2014) (0)
Exploiting Completeness and Uncertainty of Pseudo Labels for Weakly Supervised Video Anomaly Detection (2022) (0)
AdAUC: End-to-end Adversarial AUC Optimization Against Long-tail Problems (2022) (0)
Enhanced Semantic Head for Cascade Instance Segmentation (2022) (0)

This paper list is powered by the following services:

Qing-ming Huang's Academic­Influence.com Rankings

Qing-ming Huang's Degrees

Similar Degrees You Can Earn

Why Is Qing-ming Huang Influential?

Qing-ming Huang's Published Works

Published Works

Qing-ming Huang's AcademicInfluence.com Rankings