Qing-ming Huang
#110,438
Most Influential Person Now
Qing-ming Huang's AcademicInfluence.com Rankings
Qing-ming Huangcomputer-science Degrees
Computer Science
#4156
World Rank
#4372
Historical Rank
Artificial Intelligence
#936
World Rank
#953
Historical Rank
Database
#1392
World Rank
#1465
Historical Rank

Download Badge
Computer Science
Qing-ming Huang's Degrees
- PhD Computer Science University of California, Berkeley
- Masters Computer Science University of California, Berkeley
- Bachelors Computer Science Tsinghua University
Similar Degrees You Can Earn
Why Is Qing-ming Huang Influential?
(Suggest an Edit or Addition)Qing-ming Huang's Published Works
Number of citations in a given year to any of this author's works
Total number of citations to an author for the works they published in a given year. This highlights publication of the most important work(s) by the author
Published Works
- CenterNet: Keypoint Triplets for Object Detection (2019) (1381)
- The Visual Object Tracking VOT2016 Challenge Results (2016) (702)
- Hedged Deep Tracking (2016) (644)
- Cascaded Partial Decoder for Fast and Accurate Salient Object Detection (2019) (519)
- The Visual Object Tracking VOT2017 Challenge Results (2017) (424)
- Fast and robust text detection in images and video frames (2005) (372)
- The Unmanned Aerial Vehicle Benchmark: Object Detection and Tracking (2018) (347)
- The Visual Object Tracking VOT2013 Challenge Results (2013) (333)
- Relay Backpropagation for Effective Learning of Deep Convolutional Neural Networks (2015) (315)
- F3Net: Fusion, Feedback and Focus for Salient Object Detection (2019) (310)
- Descriptive visual words and visual phrases for image applications (2009) (255)
- Review of Visual Saliency Detection With Comprehensive Information (2018) (242)
- Spatial Pyramid-Enhanced NetVLAD With Weighted Triplet Loss for Place Recognition (2020) (242)
- Image Matching by Normalized Cross-Correlation (2006) (229)
- Stacked Cross Refinement Network for Edge-Aware Salient Object Detection (2019) (227)
- Global Context-Aware Progressive Aggregation Network for Salient Object Detection (2020) (220)
- Multimodal Transformer With Multi-View Visual Representation for Image Captioning (2019) (217)
- Measuring visual saliency by Site Entropy Rate (2010) (202)
- Saliency Detection for Stereoscopic Images Based on Depth Confidence Analysis and Multiple Cues Fusion (2016) (185)
- A configurable method for multi-style license plate recognition (2009) (185)
- RAM: A Region-Aware Deep Model for Vehicle Re-Identification (2018) (175)
- Using Webcast Text for Semantic Event Detection in Broadcast Sports Video (2008) (162)
- VisDrone-DET2019: The Vision Meets Drone Object Detection in Image Challenge Results (2018) (158)
- Joint Source-Channel Rate-Distortion Optimization for H.264 Video Coding Over Error-Prone Networks (2007) (158)
- Gradually Vanishing Bridge for Adversarial Domain Adaptation (2020) (156)
- Towards Discriminability and Diversity: Batch Nuclear-Norm Maximization Under Label Insufficient Situations (2020) (156)
- Less Is More: Picking Informative Frames for Video Captioning (2018) (149)
- Label Decoupling Framework for Salient Object Detection (2020) (143)
- Building contextual visual vocabulary for large-scale image applications (2010) (139)
- Learning Label-Specific Features and Class-Dependent Labels for Multi-Label Classification (2016) (138)
- Human Daily Action Analysis with Multi-view and Color-Depth Data (2012) (133)
- The Thermal Infrared Visual Object Tracking VOT-TIR2015 Challenge Results (2015) (132)
- Robust moving object segmentation on H.264/AVC compressed video using the block-based MRF model (2005) (123)
- Thresholding technique with adaptive window selection for uneven lighting image (2005) (121)
- Joint video/depth rate allocation for 3D video coding based on view synthesis distortion model (2009) (119)
- ASIF-Net: Attention Steered Interweave Fusion Network for RGB-D Salient Object Detection (2020) (119)
- F³Net: Fusion, Feedback and Focus for Salient Object Detection (2020) (117)
- Joint Feature Selection and Classification for Multilabel Learning (2018) (111)
- Hedging Deep Features for Visual Tracking (2019) (111)
- Trajectory based event tactics analysis in broadcast sports video (2007) (109)
- Going From RGB to RGBD Saliency: A Depth-Guided Transformation Model (2020) (107)
- A Useful Visualization Technique: A Literature Review for Augmented Reality and its Application, limitation & future direction (2009) (105)
- Affective Visualization and Retrieval for Music Video (2010) (105)
- Event Tactic Analysis Based on Broadcast Sports Video (2009) (100)
- Dependency Exploitation: A Unified CNN-RNN Approach for Visual Emotion Recognition (2017) (99)
- An Iterative Co-Saliency Framework for RGBD Images (2017) (97)
- Parsing-Based View-Aware Embedding Network for Vehicle Re-Identification (2020) (93)
- Player action recognition in broadcast tennis video with applications to semantic analysis of sports game (2006) (93)
- Learning Label Specific Features for Multi-label Classification (2015) (92)
- Co-Saliency Detection for RGBD Images Based on Multi-Constraint Feature Matching and Cross Label Propagation (2017) (92)
- Blind image quality prediction by exploiting multi-level deep representations (2018) (90)
- Deep Unsupervised Convolutional Domain Adaptation (2017) (87)
- Improving multi-label classification with missing labels by learning label-specific features (2019) (84)
- Reverse Perspective Network for Perspective-Aware Object Counting (2020) (82)
- Generalized Semi-supervised and Structured Subspace Learning for Cross-Modal Retrieval (2018) (82)
- Learning Fragment Self-Attention Embeddings for Image-Text Matching (2019) (80)
- Human Behavior Analysis for Highlight Ranking in Broadcast Racket Sports Video (2007) (80)
- Region-based visual attention analysis with its application in image browsing on small displays (2007) (79)
- Generating Descriptive Visual Words and Visual Phrases for Large-Scale Image Applications (2011) (76)
- State-Relabeling Adversarial Active Learning (2020) (75)
- Cross-media analysis and reasoning: advances and directions (2017) (74)
- Detecting Violent Scenes in Movies by Auditory and Visual Cues (2008) (73)
- The Visual Object Tracking VOT 2016 Challenge Results (2018) (73)
- The Unmanned Aerial Vehicle Benchmark: Object Detection, Tracking and Baseline (2019) (72)
- Recognizing human group action by layered model with multiple cues (2014) (72)
- CenterNet: Object Detection with Keypoint Triplets (2019) (70)
- Video Saliency Detection via Sparsity-Based Reconstruction and Propagation (2019) (70)
- A generic virtual content insertion system based on visual attention analysis (2008) (69)
- DPANet: Depth Potentiality-Aware Gated Attention Network for RGB-D Salient Object Detection (2020) (69)
- Edge-SIFT: Discriminative Binary Descriptor for Scalable Partial-Duplicate Mobile Search (2013) (69)
- An effective method to detect and categorize digitized traditional Chinese paintings (2006) (68)
- HSCS: Hierarchical Sparsity Based Co-saliency Detection for RGBD Images (2018) (68)
- Spatiotemporal CNN for Video Object Segmentation (2019) (67)
- USB: Ultrashort Binary Descriptor for Fast Visual Matching and Retrieval (2014) (66)
- Image classification by non-negative sparse coding, correlation constrained low-rank and sparse decomposition (2014) (65)
- Learning Hierarchical Semantic Description Via Mixed-Norm Regularization for Image Understanding (2012) (65)
- Affective Image Content Analysis: A Comprehensive Survey (2018) (64)
- Multi-label classification by exploiting local positive and negative pairwise label correlation (2017) (64)
- HodgeRank on Random Graphs for Subjective Video Quality Assessment (2012) (64)
- A Novel Rate Control Technique for Multiview Video Plus Depth Based 3D Video Coding (2011) (62)
- Affective MTV analysis based on arousal and valence features (2008) (62)
- A Scheme for Ball Detection and Tracking in Broadcast Soccer Video (2005) (62)
- Toward Realistic Face Photo–Sketch Synthesis via Composition-Aided GANs (2017) (62)
- Social Attribute-Aware Force Model: Exploiting Richness of Interaction for Abnormal Crowd Detection (2015) (60)
- Cross-Modal Retrieval Using Multiordered Discriminative Structured Subspace Learning (2017) (59)
- Online crowdsourcing subjective image quality assessment (2012) (58)
- VisDrone-VDT2018: The Vision Meets Drone Video Detection and Tracking Challenge Results (2018) (57)
- Joint Global and Co-Attentive Representation Learning for Image-Sentence Retrieval (2018) (57)
- Dual Quaternion Knowledge Graph Embeddings (2021) (57)
- Corner Proposal Network for Anchor-free, Two-stage Object Detection (2020) (57)
- VisDrone-SOT2019: The Vision Meets Drone Single Object Tracking Challenge Results (2018) (56)
- Highlight Summarization in Sports Video Based on Replay Detection (2006) (55)
- Partial-Duplicate Image Retrieval via Saliency-Guided Visual Matching (2013) (54)
- Adaptive Reconstruction Network for Weakly Supervised Referring Expression Grounding (2019) (53)
- Automatic Multi-Player Detection and Tracking in Broadcast Sports Video using Support Vector Machine and Particle Filter (2006) (52)
- Fast and effective text detection (2008) (52)
- Extracting 3D information from broadcast soccer video (2006) (52)
- Robust Spatial Consistency Graph Model for Partial Duplicate Image Retrieval (2013) (52)
- Transferring Boosted Detectors Towards Viewpoint and Scene Adaptiveness (2011) (52)
- Jersey number detection in sports video for athlete identification (2005) (51)
- Abnormal crowd behavior detection based on social attribute-aware force model (2012) (51)
- Multi-Level Discriminative Dictionary Learning With Application to Large Scale Image Classification (2015) (49)
- Multi-level Discriminative Dictionary Learning towards Hierarchical Visual Categorization (2013) (49)
- Error resilience video coding in H.264 encoder with potential distortion tracking (2004) (49)
- Action Recognition in Broadcast Tennis Video (2006) (49)
- A framework for flexible summarization of racquet sports video using multiple modalities (2009) (48)
- Automatic text segmentation from complex background (2004) (48)
- Online Deformable Object Tracking Based on Structure-Aware Hyper-Graph (2016) (48)
- Split Multiplicative Multi-View Subspace Clustering (2019) (46)
- Discrete Probability Distribution Prediction of Image Emotions with Shared Sparse Learning (2020) (45)
- Learning to Predict Bus Arrival Time From Heterogeneous Measurements via Recurrent Neural Network (2019) (45)
- Image classification using spatial pyramid robust sparse coding (2013) (45)
- Geometric Hypergraph Learning for Visual Tracking (2016) (45)
- A Low-Cost Very Large Scale Integration Architecture for Multistandard Inverse Transform (2010) (45)
- Structure-Aware Local Sparse Coding for Visual Tracking (2018) (45)
- Utilizing affective analysis for efficient movie browsing (2009) (45)
- Improving particle filter with support vector regression for efficient visual tracking (2005) (43)
- Action Recognition in Broadcast Tennis Video Using Optical Flow and Support Vector Machine (2006) (43)
- Exciting event detection in broadcast soccer video with mid-level description and incremental learning (2005) (43)
- Random partial paired comparison for subjective video quality assessment via hodgerank (2011) (42)
- Playfield detection using adaptive GMM and its application (2005) (42)
- Mean-Shift Blob Tracking with Adaptive Feature Selection and Scale Adaptation (2007) (41)
- Deep Spatial-Spectral Subspace Clustering for Hyperspectral Image (2021) (40)
- Object tracking using incremental 2D-LDA learning and Bayes inference (2008) (40)
- Learning Attribute-Specific Representations for Visual Tracking (2019) (40)
- Group Activity Recognition by Gaussian Processes Estimation (2010) (40)
- Semantically-Based Human Scanpath Estimation with HMMs (2013) (40)
- Weakly-Supervised Crowd Counting Learns from Sorting Rather Than Locations (2020) (39)
- PL-ranking: A Novel Ranking Method for Cross-Modal Retrieval (2016) (38)
- Fine-Grained Image Classification via Low-Rank Sparse Coding With General and Class-Specific Codebooks (2017) (37)
- Multimodal Similarity Gaussian Process Latent Variable Model (2017) (36)
- The Visual Object Tracking VOT 2017 challenge results (2018) (36)
- Effective Multimodality Fusion Framework for Cross-Media Topic Detection (2016) (36)
- Attentive Recurrent Neural Network for Weak-supervised Multi-label Image Classification (2018) (35)
- ${\rm S}^{3}{\rm MKL}$: Scalable Semi-Supervised Multiple Kernel Learning for Real-World Image Applications (2012) (35)
- Cross-Modal Correlation Learning by Adaptive Hierarchical Semantic Aggregation (2014) (35)
- Adding Affine Invariant Geometric Constraint for Partial-Duplicate Image Retrieval (2010) (35)
- Multi-feature metric learning with knowledge transfer among semantics and social tagging (2012) (34)
- Statistical model, analysis and approximation of rate-distortion function in MPEG-4 FGS videos (2005) (34)
- SCAN: Spatial and Channel Attention Network for Vehicle Re-Identification (2018) (33)
- Image classification using Harr-like transformation of local features with coding residuals (2013) (32)
- Automatic sports genre categorization and view-type classification over large-scale dataset (2009) (32)
- Contextual Exemplar Classifier-Based Image Representation for Classification (2017) (32)
- Video Shot Detection Using Hidden Markov Models with Complementary Features (2006) (32)
- SkeletonNet: A Hybrid Network With a Skeleton-Embedding Process for Multi-View Image Representation Learning (2019) (31)
- Set-label modeling and deep metric learning on person re-identification (2015) (31)
- Online Asymmetric Similarity Learning for Cross-Modal Retrieval (2017) (31)
- Multiple Instance Boost Using Graph Embedding Based Decision Stump for Pedestrian Detection (2008) (31)
- Multi-modal semantic autoencoder for cross-modal retrieval (2019) (30)
- Adaptively Unified Semi-supervised Learning for Cross-Modal Retrieval (2017) (29)
- Cross-media topic detection: A multi-modality fusion framework (2013) (29)
- Semantic invariant cross-domain image generation with generative adversarial networks (2018) (29)
- Unsupervised Texture Classification: Automatically Discover and Classify Texture Patterns (2006) (28)
- S3MKL: scalable semi-supervised multiple kernel learning for image data mining (2010) (28)
- When to Learn What: Deep Cognitive Subspace Clustering (2018) (28)
- Heuristic Domain Adaptation (2020) (27)
- Exploring Coherent Motion Patterns via Structured Trajectory Learning for Crowd Mood Modeling (2017) (27)
- Learning Semantic Structure-preserved Embeddings for Cross-modal Retrieval (2018) (27)
- A Graph Regularized Deep Neural Network for Unsupervised Image Representation Learning (2017) (27)
- Mode mapping method for H.264/AVC spatial downscaling transcoding (2004) (27)
- Treat samples differently: Object tracking with semi-supervised online CovBoost (2011) (27)
- Unsupervised Open Domain Recognition by Semantic Discrepancy Minimization (2019) (26)
- Embedding Perspective Analysis Into Multi-Column Convolutional Neural Network for Crowd Counting (2020) (26)
- Object categorization in sub-semantic space (2014) (26)
- Robust evaluation for quality of experience in crowdsourcing (2013) (25)
- ObjectPatchNet: Towards scalable and semantic image annotation and retrieval (2014) (25)
- JDL at TRECVID 2006 Shot Boundary Detection (2006) (25)
- Action Recognition Using Spatial-Temporal Context (2010) (24)
- Global-and-Local Collaborative Learning for Co-Salient Object Detection (2022) (24)
- Beyond particle flow: Bag of Trajectory Graphs for dense crowd event recognition (2013) (24)
- Query sensitive dynamic web video thumbnail generation (2011) (24)
- Beyond Explicit Codebook Generation: Visual Representation Using Implicitly Transferred Codebooks (2015) (24)
- Knowledge-guided Pairwise Reconstruction Network for Weakly Supervised Referring Expression Grounding (2019) (24)
- Robust visual tracking via scale-and-state-awareness (2019) (24)
- Greedy Gradient Ensemble for Robust Visual Question Answering (2021) (24)
- RD-optimized interactive streaming of multiview video with multiple encodings (2010) (23)
- Similarity Gaussian Process Latent Variable Model for Multi-modal Data Analysis (2015) (23)
- Abnormal event detection in crowded scenes based on Structural Multi-scale Motion Interrelated Patterns (2013) (23)
- Rethinking Graph Neural Architecture Search from Message-passing (2021) (23)
- Viewpoint and Scale Consistency Reinforcement for UAV Vehicle Re-Identification (2020) (22)
- Distributed image understanding with semantic dictionary and semantic expansion (2016) (22)
- TINA: Cross-Modal Correlation Learning by Adaptive Hierarchical Semantic Aggregation (2016) (22)
- Context-based 2D-VLC for video coding (2004) (22)
- Improved error concealment algorithms based on H.264/AVC non-normative decoder (2004) (22)
- Web video thumbnail recommendation with content-aware analysis and query-sensitive matching (2014) (22)
- Online Asymmetric Metric Learning With Multi-Layer Similarity Aggregation for Cross-Modal Retrieval (2019) (22)
- Image Class Prediction by Joint Object, Context, and Background Modeling (2018) (21)
- Beyond visual features: A weak semantic image representation using exemplar classifiers for classification (2013) (21)
- Syntax-Guided Hierarchical Attention Network for Video Captioning (2022) (21)
- Self-Regulated Learning for Egocentric Video Activity Anticipation (2021) (21)
- Unsupervised Web Topic Detection Using A Ranked Clustering-Like Pattern Across Similarity Cascades (2015) (21)
- A Real-Time Score Detection and Recognition Approach for Broadcast Basketball Video (2007) (21)
- An effective multi-clue fusion approach for web video topic detection (2012) (21)
- Decomposition and Completion Network for Salient Object Detection (2021) (21)
- Naming faces in broadcast news video by image google (2008) (21)
- Online Selection of Discriminative Features Using Bayes Error Rate for Visual Tracking (2006) (21)
- Iterative Graph Seeking for Object Tracking (2018) (20)
- Image classification by search with explicitly and implicitly semantic representations (2017) (20)
- Long-Term Video Question Answering via Multimodal Hierarchical Memory Attentive Networks (2021) (20)
- Multi-View Spatial Attention Embedding for Vehicle Re-Identification (2021) (20)
- Group sensitive Classifier Chains for multi-label classification (2015) (20)
- New bi-prediction techniques for B pictures coding [video coding] (2004) (20)
- Detecting Small Objects Using a Channel-Aware Deconvolutional Network (2020) (20)
- Video2Cartoon: A System for Converting Broadcast Soccer Video into 3D Cartoon Animation (2007) (19)
- A New Text Detection Algorithm in Images/Video Frames (2004) (19)
- DM2C: Deep Mixed-Modal Clustering (2019) (19)
- Learning With Multiclass AUC: Theory and Algorithms (2021) (19)
- Friend recommendation according to appearances on photos (2009) (18)
- Multi-Networks Joint Learning for Large-Scale Cross-Modal Retrieval (2017) (18)
- Boosted random contextual semantic space based representation for visual recognition (2016) (18)
- Nearest-neighbor method using multiple neighborhood similarities for social media data mining (2012) (18)
- GOMES: A group-aware multi-view fusion approach towards real-world image clustering (2015) (18)
- Replay Detection Based on Semi-automatic Logo Template Sequence Extraction in Sports Video (2007) (18)
- Image Matching by Multiscale Oriented Corner Correlation (2006) (17)
- Joint multi-view representation and image annotation via optimal predictive subspace learning (2018) (17)
- Near-duplicate video matching with transformation recognition (2009) (17)
- Location-Based Parallel Tag Completion for Geo-Tagged Social Image Retrieval (2017) (17)
- Video2Cartoon: generating 3D cartoon from broadcast soccer video (2005) (17)
- VisDrone-MOT2019: The Vision Meets Drone Multiple Object Tracking Challenge Results (2019) (17)
- A Recursive Constrained Framework for Unsupervised Video Action Clustering (2020) (17)
- Key Techniques of Bit Rate Reduction for H.264 Streams (2004) (17)
- Online selection of the best k-feature subset for object tracking (2012) (17)
- Person Re-Identification by Semantic Region Representation and Topology Constraint (2018) (17)
- Location-Sensitive Visual Recognition with Cross-IOU Loss (2021) (16)
- Online web video topic detection and tracking with semi-supervised learning (2016) (16)
- Unsupervised sports video scene clustering and its applications to story units detection (2005) (16)
- ALID: Scalable Dominant Cluster Detection (2014) (16)
- A Multiple Targets Appearance Tracker Based on Object Interaction Models (2012) (15)
- Augmented Adversarial Training for Cross-Modal Retrieval (2021) (15)
- A new method to calculate the camera focusing area and player position on playfield in soccer video (2005) (15)
- Joint Multi-View Representation Learning and Image Tagging (2016) (15)
- A pixel-wise local information-based background subtraction approach (2008) (15)
- Quaternion-Based Knowledge Graph Network for Recommendation (2020) (15)
- Matching images more efficiently with local descriptors (2008) (14)
- Hierarchical deep semantic representation for visual categorization (2017) (14)
- Cross-modal Retrieval by Real Label Partial Least Squares (2016) (14)
- Human group activity analysis with fusion of motion and appearance information (2011) (14)
- MOCC: A Fast and Robust Correlation-Based Method for Interest Point Matching under Large Scale Changes (2010) (14)
- Few Shot Generative Model Adaption via Relaxed Spatial Structural Alignment (2022) (14)
- Event tactic analysis based on player and ball trajectory in broadcast video (2008) (14)
- A hybrid text segmentation approach (2009) (14)
- Deep Stereoscopic Image Super-Resolution via Interaction Module (2020) (14)
- Online multiple object tracking via exchanging object context (2018) (14)
- Modeling spatial and semantic cues for large-scale near-duplicated image retrieval (2011) (14)
- Exploiting sample correlation for crowd counting with multi-expert network (2021) (14)
- Harmonized Multimodal Learning with Gaussian Process Latent Variable Models (2019) (13)
- Image classification using boosted local features with random orientation and location selection (2015) (13)
- A Generic Approach for Systematic Analysis of Sports Videos (2012) (13)
- From Social to Individuals: A Parsimonious Path of Multi-Level Models for Crowdsourced Preference Aggregation (2018) (13)
- Coarse-to-fine video text detection (2008) (13)
- Video saliency prediction with optimized optical flow and gravity center bias (2016) (13)
- Spatial-temporal attention analysis for home video (2008) (13)
- Relative image similarity learning with contextual information for Internet cross-media retrieval (2014) (13)
- Detection and location of near-duplicate video sub-clips by finding dense subgraphs (2011) (13)
- Representing dense crowd patterns using bag of trajectory graphs (2014) (13)
- Low-delay View Random Access for Multi-view Video Coding (2007) (13)
- Online low-rank similarity function learning with adaptive relative margin for cross-modal retrieval (2017) (13)
- Webpage saliency prediction with multi-features fusion (2016) (13)
- Nearest-neighbor classification using unlabeled data for real world image application (2010) (13)
- Compression-Induced Rendering Distortion Analysis for Texture/Depth Rate Allocation in 3D Video Compression (2009) (13)
- DMVOS: Discriminative Matching for Real-time Video Object Segmentation (2020) (13)
- Aesthetic composition represetation for portrait photographing recommendation (2012) (13)
- A Simulation Analysis on the Existence of Network Traffic Flow Equilibria (2014) (13)
- A Survey on Visual Human Action Recognition: A Survey on Visual Human Action Recognition (2014) (12)
- Exploring Outliers in Crowdsourced Ranking for QoE (2017) (12)
- Fast Batch Nuclear-norm Maximization and Minimization for Robust Domain Adaptation (2021) (12)
- Generating Video Sequence from Photo Image for Mobile Screens by Content Analysis (2007) (12)
- Weakly Supervised Bilinear Attention Network for Fine-Grained Visual Classification (2018) (12)
- A Hierarchical CNN-RNN Approach for Visual Emotion Classification (2019) (12)
- Multi-View Multi-Label Learning With View-Label-Specific Features (2019) (12)
- When All We Need is a Piece of the Pie: A Generic Framework for Optimizing Two-way Partial AUC (2021) (12)
- People re-detection using Adaboost with sift and color correlogram (2008) (12)
- Seeking the Shape of Sound: An Adaptive Framework for Learning Voice-Face Association (2021) (12)
- Viewpoint switching in multiview video streaming (2005) (12)
- Depth Potentiality-Aware Gated Attention Network for RGB-D Salient Object Detection (2020) (12)
- IR-GAN: Image Manipulation with Linguistic Instruction by Increment Reasoning (2020) (12)
- Vicept: link visual features to concepts for large-scale image understanding (2010) (11)
- Online Fast Adaptive Low-Rank Similarity Learning for Cross-Modal Retrieval (2020) (11)
- Personalized MTV Affective Analysis Using User Profile (2008) (11)
- Multiview Video Coding Based on Global Motion Model (2004) (11)
- SSOCBT: A Robust Semisupervised Online CovBoost Tracker That Uses Samples Differently (2013) (11)
- Joint image representation and classification in random semantic spaces (2015) (11)
- Online HodgeRank on Random Graphs for Crowdsourceable QoE Evaluation (2014) (11)
- Deep Constrained Low-Rank Subspace Learning for Multi-View Semi-Supervised Classification (2019) (11)
- Facial Landmarks Detection by Self-Iterative Regression based Landmarks-Attention Network (2018) (11)
- Visual perception based Lagrangian rate distortion optimization for video coding (2011) (11)
- Training Efficient Saliency Prediction Models with Knowledge Distillation (2019) (11)
- Release the Power of Online-Training for Robust Visual Tracking (2020) (11)
- Bilevel Multiview Latent Space Learning (2018) (11)
- Building pair-wise visual word tree for efficent image re-ranking (2010) (11)
- Stereoscopic Image Stitching via Disparity-Constrained Warping and Blending (2020) (10)
- Long Short-Term Relation Transformer With Global Gating for Video Captioning (2022) (10)
- Semantic-aware Hashing for Social Image Retrieval (2015) (10)
- Does Thermal Really Always Matter for RGB-T Salient Object Detection? (2022) (10)
- Depth image segmentation for improved virtual view image quality in 3-DTV (2007) (10)
- LSH-based semantic dictionary learning for large scale image understanding (2015) (10)
- Shot classification for action movies based on motion characteristics (2008) (10)
- Summarization in Soccer Video based on Goalmouth Detection (2006) (10)
- Composition-aided Sketch-realistic Portrait Generation (2017) (10)
- Conditional GAN based individual and global motion fusion for multiple object tracking in UAV videos (2020) (10)
- Robust copy detection by mining temporal self-similarities (2009) (10)
- Matching Content-based Saliency Regions for partial-duplicate image retrieval (2011) (10)
- Cluster-sensitive Structured Correlation Analysis for Web cross-modal retrieval (2015) (10)
- Learning Coupled Convolutional Networks Fusion for Video Saliency Prediction (2019) (10)
- Interpretable Visual Reasoning via Probabilistic Formulation Under Natural Supervision (2020) (10)
- CIR-Net: Cross-Modality Interaction and Refinement for RGB-D Salient Object Detection (2022) (10)
- Laplacian affine sparse coding with tilt and orientation consistency for image classification (2013) (10)
- Cross-media Topic Detection with Refined CNN based Image-Dominant Topic Model (2015) (10)
- Online Discriminative Structured Output SVM Learning for Multi-Target Tracking (2014) (9)
- Multimodal Entity Linking: A New Dataset and A Baseline (2021) (9)
- Fusing cross-media for topic detection by dense keyword groups (2015) (9)
- Semi-Autoregressive Image Captioning (2021) (9)
- Weighted visual vocabulary to balance the descriptive ability on general dataset (2013) (9)
- Visual-aural attention modeling for talk show video highlight detection (2008) (9)
- Learning Self-Supervised Space-Time CNN for Fast Video Style Transfer (2021) (9)
- Multi-order visual phrase for scalable image search (2013) (9)
- Embedding Multi-Order Spatial Clues for Scalable Visual Matching and Retrieval (2014) (9)
- Geometry Interaction Knowledge Graph Embeddings (2022) (9)
- Pornographic Image Detection Based on Multilevel Representation (2009) (9)
- i.MTV: an integrated system for mtv affective analysis (2008) (9)
- Monocular Tracking 3D People By Gaussian Process Spatio-Temporal Variable Model (2007) (9)
- Subjective evaluation criterion for selecting affective features and modeling highlights (2006) (9)
- Learning Deep Convolutional Neural Networks for Places2 Scene Recognition (2015) (9)
- Adversarial Preference Learning with Pairwise Comparisons (2019) (9)
- Style-adaptive photo aesthetic rating via convolutional neural networks and multi-task learning (2020) (8)
- Visual ContextRank for web image re-ranking (2009) (8)
- Topic detection in cross-media: a semi-supervised co-clustering approach (2014) (8)
- A Fast Intra Mode Decision Algorithm for AVS to H.264 Transcoding (2006) (8)
- Discovering Fine-Grained Spatial Pattern From Taxi Trips: Where Point Process Meets Matrix Decomposition and Factorization (2018) (8)
- Extracting Story Units in Sports Video Based on Unsupervised Video Scene Clustering (2006) (8)
- Visual Ontology Construction for Digitized Art Image Retrieval (2005) (8)
- Online Deformable Object Tracking Based on Structure-Aware Hyper-Graph. (2016) (8)
- Advertise gently - in-image advertising with low intrusiveness (2009) (8)
- Robust Latent Poisson Deconvolution From Multiple Features for Web Topic Detection (2016) (8)
- A scheme for racquet sports video analysis with the combination of audio-visual information (2005) (8)
- Justifying the Importance of Color Cues in Object Detection: A Case Study on Pedestrian (2013) (8)
- Towards More Explainability: Concept Knowledge Mining Network for Event Recognition (2020) (8)
- Reverse Densely Connected Feature Pyramid Network for Object Detection (2018) (8)
- Beyond global fusion: A group-aware fusion approach for multi-view image clustering (2019) (8)
- Vehicle Detection in UAV Traffic Video Based on Convolution Neural Network (2018) (8)
- Hierarchical Modular Network for Video Captioning (2021) (8)
- CNN-MR for No Reference Video Quality Assessment (2017) (8)
- Multiple Kernel Learning with High Order Kernels (2010) (8)
- Advances in Multimedia Information Processing – PCM 2017 (2017) (8)
- Reducing Spatial Resolution for MPEG-2 to H.264/AVC Transcoding (2005) (8)
- A novel rate control scheme for video streaming over wireless networks (2004) (8)
- Saliency-Based Spatiotemporal Attention for Video Captioning (2018) (7)
- Implicit Feedbacks are Not Always Favorable: Iterative Relabeled One-Class Collaborative Filtering against Noisy Interactions (2021) (7)
- Categorizing Social Multimedia by Neighborhood Decision Using Local Pairwise Label Correlation (2014) (7)
- Cascade Category-Aware Visual Search (2014) (7)
- Location-Based Parallel Tag Completion for Geo-tagged Social Image Retrieval (2015) (7)
- Pedestrian detection via logistic multiple instance boosting (2008) (7)
- The Demo: A Real-Time Score Detection and Recognition Approach in Broadcast Basketball Sports Video (2007) (7)
- Cascade Cross-modal Attention Network for Video Actor and Action Segmentation from a Sentence (2021) (7)
- Cross-media topic detection associated with hot search queries (2013) (7)
- A Rotation Invariant Descriptor for Robust Video Copy Detection (2013) (7)
- Self-Supervised Deep TripleNet for Video Object Segmentation (2021) (7)
- Click data guided query modeling with click propagation and sparse coding (2018) (7)
- HodgeRank With Information Maximization for Crowdsourced Pairwise Ranking Aggregation (2017) (7)
- Collaborative Preference Embedding against Sparse Labels (2019) (7)
- LVE-S2D: Low-Light Video Enhancement From Static to Dynamic (2022) (7)
- Fast copy detection based on Slice Entropy Scattergraph (2010) (7)
- Multimodal Gaussian Process Latent Variable Models with Harmonization (2017) (7)
- Structure-aware multi-object discovery for weakly supervised tracking (2014) (7)
- Rethinking Graph Neural Network Search from Message-passing (2021) (7)
- Deep Affine Motion Compensation Network for Inter Prediction in VVC (2021) (7)
- Crowd video retrieval via deep attribute-embedding graph ranking (2016) (7)
- Learning image Vicept description via mixed-norm regularization for large scale semantic image search (2011) (6)
- Automatic video genre categorization and event detection techniques on large-scale sports data (2010) (6)
- Fine-grained Feature Alignment with Part Perspective Transformation for Vehicle ReID (2020) (6)
- Label Correlation Guided Deep Multi-View Image Annotation (2019) (6)
- From Common to Special: When Multi-Attribute Learning Meets Personalized Opinions (2017) (6)
- Composition-Aided Face Photo-Sketch Synthesis (2017) (6)
- Linear transform based motion compensated prediction for luminance intensity changes (2005) (6)
- Modeling Temporal Concept Receptive Field Dynamically for Untrimmed Video Analysis (2020) (6)
- Nearest Neighbor Classifier Embedded Network for Active Learning (2021) (6)
- Multi-view Video Coding with Flexible View-Temporal Prediction Structure for Fast Random Access (2006) (6)
- Sports Video Analysis: From Semantics to Tactics (2009) (6)
- Structured Stochastic Recurrent Network for Linguistic Video Prediction (2019) (6)
- Channel-wise Temporal Attention Network for Video Action Recognition (2019) (6)
- ER: Equivariance Regularizer for Knowledge Graph Completion (2022) (6)
- A Margin-based MLE for Crowdsourced Partial Ranking (2018) (6)
- Mining Information of Attack-Defense Status from Soccer Video Based on Scene Analysis (2007) (6)
- Generalized Zero-Shot Video Classification via Generative Adversarial Networks (2020) (6)
- Undo the codebook bias by linear transformation for visual applications (2013) (6)
- Novel observation model for probabilistic object tracking (2010) (6)
- CenterNet++ for Object Detection (2022) (6)
- Intra- and Inter-modal Multilinear Pooling with Multitask Learning for Video Grounding (2020) (6)
- Symmetric segment-based stereo matching of motion blurred images with illumination variations (2008) (6)
- MULTFRC-LERD: An Improved Rate Control Scheme for Video Streaming over Wireless (2004) (6)
- Personalized online video recommendation by neighborhood score propagation based global ranking (2009) (5)
- Who to Ask: An Intelligent Fashion Consultant (2018) (5)
- DA-CCD: A novel action representation by Deep Architecture of local depth feature (2014) (5)
- Coarse-to-Fine Dissolve Detection Based on Image Quality Assessment (2013) (5)
- The third eye: mining the visual cognition across multi-language communities (2010) (5)
- Learning Feature Representation and Partial Correlation for Multimodal Multi-Label Data (2021) (5)
- Diverter-Guider Recurrent Network for Diverse Poems Generation from Image (2020) (5)
- Recognizing Realistic Action Using Contextual Feature Group (2013) (5)
- WIKI-CMR: A web cross modality dataset for studying and evaluation of cross modality retrieval models (2013) (5)
- Fine-Grained Image Quality Assessment: A Revisit and Further Thinking (2021) (5)
- Cross media topic analytics based on synergetic content and user behavior modeling (2014) (5)
- Toward Understanding and Boosting Adversarial Transferability From a Distribution Perspective (2022) (5)
- Motion Based Perceptual Distortion and Rate Optimization for Video Coding (2012) (5)
- Bridging the gap between objective score and subjective preference in video quality assessment (2010) (5)
- Saliency detection with two-level fully convolutional networks (2017) (5)
- Lower attentive region detection for virtual content insertion in broadcast video (2008) (5)
- Self-balance Motion and Appearance Model for Multi-object Tracking in UAV (2019) (5)
- A two-step approach to describing web topics via probable keywords and prototype images from background-removed similarities (2018) (5)
- A novel FGS base-layer encoding model and weight-based rate adaptation for constant-quality streaming (2004) (5)
- Accurate and efficient cross-domain visual matching leveraging multiple feature representations (2013) (5)
- Task-Feature Collaborative Learning with Application to Personalized Attribute Prediction (2020) (5)
- Memory matrix: a novel user experience for home video (2010) (5)
- Structural Semantic Adversarial Active Learning for Image Captioning (2020) (5)
- Pareto Optimality for Fairness-constrained Collaborative Filtering (2021) (5)
- Formation Period Matters: Towards Socially Consistent Group Detection via Dense Subgraph Seeking (2015) (5)
- Image Inpainting Based on Multi-frequency Probabilistic Inference Model (2020) (5)
- A model-based demand-balancing control for dynamically divided multiple urban subnetworks (2016) (4)
- Socio-mobile landmark recognition using local features with adaptive region selection (2016) (4)
- A Novel Framework for Web Video Thumbnail Generation (2012) (4)
- ObjectBook construction for large-scale semantic-aware image retrieval (2011) (4)
- Multi-Stream Region Proposal Network for Pedestrian Detection (2018) (4)
- I2Transformer: Intra- and Inter-Relation Embedding Transformer for TV Show Captioning (2022) (4)
- News video story sentiment classification and ranking (2011) (4)
- Weakly Supervised Local Attention Network for Fine-Grained Visual Classification (2018) (4)
- Multi-description of local interest point for partial-duplicate image retrieval (2010) (4)
- Task-distribution-aware Meta-learning for Cold-start CTR Prediction (2020) (4)
- Improving Image Distance Metric Learning by Embedding Semantic Relations (2012) (4)
- An efficient occlusion detection method to improve object trackers (2013) (4)
- Local-binarized very deep residual network for visual categorization (2020) (4)
- Beyond visual word ambiguity: Weighted local feature encoding with governing region (2014) (4)
- Graph-Density-based visual word vocabulary for image retrieval (2014) (4)
- A Bit-Plane Decomposition Matrix-Based VLSI Integer Transform Architecture for HEVC (2017) (4)
- Coupling Reranking and Structured Output SVM Co-Train for Multitarget Tracking (2016) (4)
- An Edge-Based Median Filtering Algorithm with Consideration of Motion Vector Reliability for Adaptive Video Deinterlacing (2006) (4)
- Content-Based Video Semantic Analysis (2009) (4)
- Image-regulated graph topic model for cross-media topic detection (2015) (4)
- Human reappearance detection based on on-line learning (2008) (4)
- Improving cross-modal correlation learning with hyperlinks (2015) (4)
- Beyond bag of words: image representation in sub-semantic space (2013) (4)
- Self-calibration Based 3D Information Extraction and Application in Broadcast Soccer Video (2006) (4)
- Multi-label double-layer learning for cross-modal retrieval (2018) (4)
- Proposal Complementary Action Detection (2020) (4)
- Online learning affinity measure with CovBoost for multi-target tracking (2015) (4)
- Learning Sparse Prototypes for Crowd Perception via Ensemble Coding Mechanisms (2014) (3)
- SIEV-Net: A Structure-Information Enhanced Voxel Network for 3D Object Detection From LiDAR Point Clouds (2022) (3)
- Large scale image understanding with non-convex multi-task learning (2014) (3)
- Stereoscopic Image Retargeting Based on Deep Convolutional Neural Network (2021) (3)
- Beyond appearance model: Learning appearance variations for object tracking (2016) (3)
- Highlight Ranking for Racquet Sports Video in User Attention Subspaces Based on Relevance Feedback (2007) (3)
- Attribute Group Editing for Reliable Few-shot Image Generation (2022) (3)
- Rotative maximal pattern: A local coloring descriptor for object classification and recognition (2017) (3)
- Advances in Multimedia Information Processing – PCM 2017 (2017) (3)
- Fast common visual pattern detection via radiate geometric model (2011) (3)
- Abnormal Event Detection Based on Multi-scale Markov Random Field (2015) (3)
- Spatial-temporal video browsing for mobile environment based on visual attention analysis (2009) (3)
- Evaluating Visual Properties via Robust HodgeRank (2021) (3)
- Deep Robust Subjective Visual Property Prediction in Crowdsourcing (2019) (3)
- Adaptive Moving Cast Shadow Detection (2013) (3)
- Video Anomaly Detection Using Open Data Filter and Domain Adaptation (2020) (3)
- Multi-Attention Network for Compressed Video Referring Object Segmentation (2022) (3)
- Effective algorithms for fast transcoding of AVS to H.264/AVC in the spatial domain (2007) (3)
- Visual Saliency and Distortion Weighting Based Video Quality Assessment (2012) (3)
- Macroblock-level Reduced Resolution Video Coding Allowing Adaptive DCT Coefficients Selection (2007) (3)
- Face Distortion Recovery Based on Online Learning Database for Conversational Video (2014) (3)
- Effective scene matching with local feature representatives (2008) (3)
- Meta-Wrapper: Differentiable Wrapping Operator for User Interest Selection in CTR Prediction (2021) (3)
- Cross-media retrieval with semantics clustering and enhancement (2017) (3)
- Accelerate convolutional neural networks for binary classification via cascading cost-sensitive feature (2016) (3)
- Set-based classification for person re-identification utilizing mutual-information (2013) (3)
- Dist-PU: Positive-Unlabeled Learning from a Label Distribution Perspective (2022) (3)
- Graph Regularized Encoder-Decoder Networks for Image Representation Learning (2021) (3)
- Theoretical analysis of learning local anchors for classification (2012) (3)
- Learning Personalized Attribute Preference via Multi-task AUC Optimization (2019) (3)
- Robust real-time transmission of scalable multimedia for heterogeneous client bandwidths (2005) (3)
- Edge-featured Graph Neural Architecture Search (2021) (3)
- Attention Based Album Slideshow (2010) (3)
- Online dictionary learning for Local Coordinate Coding with Locality Coding Adaptors (2015) (3)
- Deep Partial Rank Aggregation for Personalized Attributes (2021) (2)
- A generic approach to classify sports video shots and its application in event detection (2009) (2)
- Continuation Multiple Instance Learning for Weakly and Fully Supervised Object Detection (2021) (2)
- Joint learning for side information and correlation model based on linear regression model in distributed video coding (2009) (2)
- Event based news video people classification and ranking using multimodality features (2010) (2)
- Coupling Multiple Alignments and Re-ranking for Low-Latency Online Multi-target Tracking (2014) (2)
- Justify role of Similarity Diffusion Process in cross-media topic ranking: an empirical evaluation (2017) (2)
- Action recognition using trajectories of spatio-tempral feature points (2014) (2)
- Two-stream deep sparse network for accurate and efficient image restoration (2020) (2)
- Undoing the codebook bias by linear transformation with sparsity and F-norm constraints for image classification (2014) (2)
- A System for Automatic Generation of Music Sports-Video (2005) (2)
- Video frame prediction with dual-stream deep network emphasizing motions and content details (2022) (2)
- Metric based on multi-order spaces for cross-modal retrieval (2017) (2)
- iSplit LBI: Individualized Partial Ranking with Ties via Split LBI (2019) (2)
- Learning Unified Embeddings for Recommendation via Meta-path Semantics (2021) (2)
- Edge Guided Generation Network for Video Prediction (2018) (2)
- The Minority Matters: A Diversity-Promoting Collaborative Metric Learning Algorithm (2022) (2)
- Semantic Manifold Alignment in Visual Feature Space for Zero-Shot Learning (2018) (2)
- RGB-D Human Matting: A Real-World Benchmark Dataset and A Baseline Method (2023) (2)
- Monocular Tracking 3 D People with Back Constrained Scaled Gaussian Process Latent Variable Models (2007) (2)
- Fast and Accurately Measuring Crack Width via Cascade Principal Component Analysis (2019) (2)
- When False Positive is Intolerant: End-to-End Optimization with Low FPR for Multipartite Ranking (2021) (2)
- Optimum End-to-End Distortion Estimation for Error Resilient Video Coding (2004) (2)
- FNet: Fusion, Feedback and Focus for Salient Object Detection (2020) (2)
- Adaptive Sharing for Image Classification (2015) (2)
- Robust Statistical Ranking: Theory and Algorithms (2014) (2)
- Local Laplacian Coding From Theoretical Analysis of Local Coding Schemes for Locally Linear Classification (2015) (2)
- A Novel Story Unit Segmentation Algorithm Avoiding Voice Cutting (2007) (2)
- Poisoning Attack Against Estimating From Pairwise Comparisons (2021) (2)
- From Seed Discovery to Deep Reconstruction: Predicting Saliency in Crowd via Deep Networks (2016) (2)
- Generalized Block-Diagonal Structure Pursuit: Learning Soft Latent Task Assignment against Negative Transfer (2019) (2)
- Two Birds With One Stone: A Coupled Poisson Deconvolution for Detecting and Describing Topics From Multimodal Web Data (2019) (2)
- Online Learning Based Face Distortion Recovery for Conversational Video Coding (2013) (2)
- Multi-order visual phrase for scalable partial-duplicate visual search (2015) (2)
- User Attention Analysis Based Video Summarization and Highlight Ranking: User Attention Analysis Based Video Summarization and Highlight Ranking (2009) (2)
- Multi-view Subspace Learning with Diversity Enforced Skeleton Embedding (2017) (2)
- Moving Object Segmentation: A Block-Based Moving Region Detection Approach (2004) (2)
- S2L: Single-Streamline For Complex Video Event Detection (2018) (2)
- C2FNet: A Coarse-to-Fine Network for Multi-View 3D Point Cloud Generation (2022) (2)
- A close-up detection method for movies (2010) (1)
- Span-based Audio-Visual Localization (2022) (1)
- Error-resistance and Low-complexity Integer Inverse Discrete Cosine Transform (2010) (1)
- Concept Propagation via Attentional Knowledge Graph Reasoning for Video-Text Retrieval (2022) (1)
- A Sparse-Motif Ensemble Graph Convolutional Network against Over-smoothing (2022) (1)
- Latent influence propagation on dynamic networks (2015) (1)
- Not All Samples are Trustworthy: Towards Deep Robust SVP Prediction (2020) (1)
- How Functions Evolve in Deep Convolutional Neural Network (2018) (1)
- Semantic Editing On Segmentation Map Via Multi-Expansion Loss (2020) (1)
- @ICT: attention-based virtual content insertion (2012) (1)
- Sports video summarization and adaptation for application in mobile communication (2006) (1)
- Polysemious visual representation based on feature aggregation for large scale image applications (2014) (1)
- MaxMatch: Semi-Supervised Learning With Worst-Case Consistency (2022) (1)
- Story Unit Segmentation with Friendly Acoustic Perception (2007) (1)
- JEREMIE: Joint Semantic Feature Learning via Multi-relational Matrix Completion (2017) (1)
- Efficient lp-norm multiple feature metric learning for image categorization (2011) (1)
- Cross community news event summary generation based on collaborative ranking (2012) (1)
- Neural Collaborative Preference Learning With Pairwise Comparisons (2020) (1)
- Using timing to detect horror shots in horror movies (2007) (1)
- Introduction to the Special Issue on Fine-Grained Visual Recognition and Re-Identification (2022) (1)
- Introduction to the Special Issue on MMAC: Multimodal Affective Computing of Large-Scale Multimedia Data (2021) (1)
- A Fast Approach for Natural Image Matting using Structure Information (2007) (1)
- Learning-to-Share Based on Finding Groups for Large Scale Image Classification (2013) (1)
- Optimizing Two-way Partial AUC with an End-to-end Framework (2022) (1)
- Who Likes What? - SplitLBI in Exploring Preferential Diversity of Ratings (2020) (1)
- Fusing multi-cues description for partial-duplicate image retrieval (2014) (1)
- A Two-Stage Approach to Highlight Extraction in Sports Video by Using AdaBoost and Multi-modal (2008) (1)
- Viewpoint Alignment and Discriminative Parts Enhancement in 3D Space for Vehicle ReID (2022) (1)
- Embedded Packetization Framework for Layered Multiple Description Coding (2004) (1)
- Siamese Dynamic Mask Estimation Network for Fast Video Object Segmentation (2021) (1)
- Strategy for aesthetic photography recommendation via collaborative composition model (2015) (1)
- Transfer pedestrian detector towards view-adaptiveness and efficiency (2009) (1)
- What to Select: Pursuing Consistent Motion Segmentation from Multiple Geometric Models (2021) (1)
- Uncertainty Modeling for Robust Domain Adaptation Under Noisy Environments (2022) (1)
- Color Maximal-Dissimilarity Pattern for pedestrian detection (2012) (1)
- Drift-compensated coding optimization for fast bit-rate reduction transcoding (2007) (1)
- FEC-based multiple description coding for heterogeneous client bandwidths (2004) (1)
- Real-time interactive multi-target tracking using kernel-based trackers (2010) (1)
- Fine-Grained Feature Generation for Generalized Zero-Shot Video Classification (2023) (1)
- Stochastic boosting for large-scale image classification (2013) (1)
- Web topic detection using a ranked clustering-like pattern across similarity cascades (2014) (1)
- A fast intra 4×4 mode decision algorithm for H.264/AVC down rate transcoding (2010) (1)
- Rethinking Collaborative Metric Learning: Toward an Efficient Alternative Without Negative Sampling (2022) (1)
- Active Sampling for Subjective Video Quality Assessment (2018) (1)
- Zero-shot Video Classification with Appropriate Web and Task Knowledge Transfer (2022) (1)
- Online web video topic detection and tracking with semi-supervised learning (2013) (1)
- Transferrable Referring Expression Grounding with Concept Transfer and Context Inheritance (2020) (1)
- A Structured Latent Variable Recurrent Network With Stochastic Attention For Generating Weibo Comments (2020) (1)
- Fine-Grained Image Classification Using Color Exemplar Classifiers (2013) (1)
- Exploring the Algorithm-Dependent Generalization of AUPRC Optimization with List Stability (2022) (0)
- Interactive event detection in crowd scenes (2012) (0)
- A Study of Neural Collapse Phenomenon: Grassmannian Frame, Symmetry, Generalization (2023) (0)
- HIGHLIGHTRANKINGFORRACQUET SPORTSVIDEOINUSERATTENTION SUBSPACESBASEDON RELEVANCE FEEDBACK (2007) (0)
- Action Category and Phase Consistency Regularization for High-Quality Temporal Action Proposal Generation (2021) (0)
- Content-based intelligent video recorder with its implementation on sports video (2011) (0)
- Self Supervised Progressive Network for High Performance Video Object Segmentation. (2022) (0)
- Online Vicept learning for web-scale image understanding (2011) (0)
- A REAL-TIMESCORE DETECTIONAND RECOGNITIONAPPROACH FOR BROADCAST BASKETBALLVIDEO (2007) (0)
- Online multi-target tracking via depth range segmentation (2017) (0)
- Accurate and efficient cross-domain visual matching leveraging multiple feature representations (2013) (0)
- Language Attention Proposal Attention + Training Inference man in white on the left holding a bat Subject Location Context Input query Input image (2019) (0)
- Spatio-temporal Visual Distortion and Rate Optimization for Video Coding (2012) (0)
- Confederated Learning: Going Beyond Centralization (2022) (0)
- A Unified Framework against Topology and Class Imbalance (2022) (0)
- Accelerating Topic Detection on Web for a Large-Scale Data Set via Stochastic Poisson Deconvolution (2018) (0)
- Active Perception Network for Salient Object Detection (2019) (0)
- Cross Concept Local Fisher Discriminant Analysis for Image Classification (2013) (0)
- Tri-level Combination for Image Representation (2016) (0)
- ZS-SBPRnet: A Zero-Shot Sketch-Based Point Cloud Retrieval Network Based on Feature Projection and Cross-Reconstruction (2022) (0)
- OTKGE: Multi-modal Knowledge Graph Embeddings via Optimal Transport (2022) (0)
- Fixation guided network for salient object detection (2021) (0)
- ASMMC-MMAC 2018: The Joint Workshop of 4th the Workshop on Affective Social Multimedia Computing and first Multi-Modal Affective Computing of Large-Scale Multimedia Data Workshop (2018) (0)
- Web video thumbnail recommendation with content-aware analysis and query-sensitive matching (2013) (0)
- Multi-Projection Fusion and Refinement Network for Salient Object Detection in 360° Omnidirectional Image (2022) (0)
- Sharing model with multi-level feature representations (2014) (0)
- The Unmanned Aerial Vehicle Benchmark: Object Detection, Tracking and Baseline (2019) (0)
- Progressive Multi-resolution Loss for Crowd Counting (2022) (0)
- CSCNet: A Shallow Single Column Network for Crowd Counting (2020) (0)
- Robust latent poisson deconvolution from multiple imperfect features for web topic detection (2016) (0)
- Inferential Visual Question Generation (2022) (0)
- Online learning af fi nity measure with CovBoost for multi-target tracking (2015) (0)
- View Sequence Coding using Warping-based Image Alignment for Multiview Video (2006) (0)
- General Greedy De-bias Learning (2021) (0)
- DMVOS (2020) (0)
- Justify role of Similarity Diffusion Process in cross-media topic ranking: an empirical evaluation (2017) (0)
- Intra- and Inter-modal Multilinear Pooling with Multitask Learning for Video Grounding (2020) (0)
- Rethinking Label Flipping Attack: From Sample Masking to Sample Thresholding (2023) (0)
- Consistency-Aware Anchor Pyramid Network for Crowd Localization (2022) (0)
- Two-Stream Sparse Network for Accurate Image Super-Resolution (2019) (0)
- Topic detection in cross-media: a semi-supervised co-clustering approach (2014) (0)
- Automatic Relation-aware Graph Network Proliferation (2022) (0)
- Recurrent Meta-Learning against Generalized Cold-start Problem in CTR Prediction (2022) (0)
- The Face Object based HEVC System for Video Call (2015) (0)
- Relative image similarity learning with contextual information for Internet cross-media retrieval (2013) (0)
- Modeling Long-Range Dependencies and Epipolar Geometry for Multi-View Stereo (2023) (0)
- THE DEMO:A REAL-TIMESCOREDETECTIONAND RECOGNITIONAPPROACH IN BROADCAST BASKETBALLSPORTSVIDEO (2007) (0)
- Deep neural networks for emerging multimedia computing and applications (2020) (0)
- Asymptotically Unbiased Instance-wise Regularized Partial AUC Optimization: Theory and Algorithm (2022) (0)
- Quaternion Ordinal Embedding (2022) (0)
- DVCFlow: Modeling Information Flow Towards Human-like Video Captioning (2021) (0)
- Learning Linguistic Association Towards Efficient Text-Video Retrieval (2022) (0)
- Descriptive VisualWords: the Visual Correspondences of Text Words (2009) (0)
- Visual Object Tracking using Sparse Representation and Interest Points in a Double Step Approach (2020) (0)
- Proceedings of the 2nd International Conference on Internet Multimedia Computing and Service, ICIMCS'10 (2010) (0)
- Optimizing Partial Area Under the Top-k Curve: Theory and Practice (2022) (0)
- On Discriminability and Diversity in Domain Adaptation (2021) (0)
- Pay Attention to Your Positive Pairs: Positive Pair Aware Contrastive Knowledge Distillation (2022) (0)
- Automatic Shadow Generation via Exposure Fusion (2023) (0)
- Highlight Ranking for Broadcast Tennis Video Based on Multi-modality Analysis and Relevance Feedback (2008) (0)
- Domain Specific and Idiom Adaptive Video Summarization (2019) (0)
- Recurrent Interaction Network for Stereoscopic Image Super-Resolution (2023) (0)
- Regularized topic-aware latent influence propagation in dynamic relational networks (2019) (0)
- Discriminative Spatial Codebook Generation for Image Classification (2013) (0)
- Click data guided query modeling with click propagation and sparse coding (2018) (0)
- Learning Enriched Hop-Aware Correlation for Robust 3D Human Pose Estimation (2023) (0)
- Two-Stage Polishing Network for Camouflaged Object Detection (2021) (0)
- Efficient Cross-Modal Retrieval Using Social Tag Information Towards Mobile Applications (2017) (0)
- Spatial-Temporal Graph Network for Video Crowd Counting (2023) (0)
- Bandwidth adaptive quality smoothing for unequal error protected scalable video streaming (2005) (0)
- Correction to: Learning Enriched Hop-Aware Correlation for Robust 3D Human Pose Estimation (2023) (0)
- OpenAUC: Towards AUC-Oriented Open-Set Recognition (2022) (0)
- Cross modal metric learning with multi-level semantic relevance (2014) (0)
- MININGINFORMATION OF ATTACK-DEFENSESTATUSFROM SOCCER VIDEOBASEDON SCENEANALYSIS (2007) (0)
- Video Shrinking by Auditory and Visual Cues (2009) (0)
- Multi-order visual phrase for scalable partial-duplicate visual search (2014) (0)
- Increasing Interpretation of Web Topic Detection via Prototype Learning From Sparse Poisson Deconvolution (2019) (0)
- Estimating the value of θ in the intra frame for ρ-domain rate control algorithms (2009) (0)
- Multi-modal Multi-grained Embedding Learning for Generalized Zero-Shot Video Classification (2023) (0)
- Graph-Based Structural Deep Spectral-Spatial Clustering for Hyperspectral Image (2023) (0)
- The 2nd International Conference on Internet Multimedia Computing and Service, ICIMCS'10: Preface (2010) (0)
- Weakly Supervised Anomaly Detection in Videos Considering the Openness of Events (2022) (0)
- Weakly supervised cross-view action recognition via sequential motion accumulation (2014) (0)
- A Tale of HodgeRank and Spectral Method: Target Attack Against Rank Aggregation is the Fixed Point of Adversarial Game (2022) (0)
- @ICT: attention-based virtual content insertion (2011) (0)
- DBAM: Dense Boundary and Actionness Map for Action Localization in Videos via Sentence Query (2021) (0)
- CRNet: Collaborative Refinement Network for Self-Supervised Video Object Segmentation (2022) (0)
- Temporal Dynamic Concept Modeling Network for Explainable Video Event Recognition (2022) (0)
- One-Shot Example Videos Localization Network for Weakly-Supervised Temporal Action Localization (2021) (0)
- Localized Image Matte Evaluation by Gradient Correlation (2010) (0)
- Message from the DIKW 2021 Program Chairs (2021) (0)
- Human tracking by structured body parts (2011) (0)
- Representing dense crowd patterns using bag of trajectory graphs (2014) (0)
- Exploiting Completeness and Uncertainty of Pseudo Labels for Weakly Supervised Video Anomaly Detection (2022) (0)
- AdAUC: End-to-end Adversarial AUC Optimization Against Long-tail Problems (2022) (0)
- Enhanced Semantic Head for Cascade Instance Segmentation (2022) (0)
This paper list is powered by the following services: