Yugang Jiang
#138,450
Most Influential Person Now
Yugang Jiang's AcademicInfluence.com Rankings
Yugang Jiangengineering Degrees
Engineering
#5072
World Rank
#6311
Historical Rank
Electrical Engineering
#1354
World Rank
#1445
Historical Rank

Yugang Jiangcomputer-science Degrees
Computer Science
#6494
World Rank
#6847
Historical Rank
Algorithms
#235
World Rank
#238
Historical Rank
Machine Learning
#2129
World Rank
#2157
Historical Rank
Database
#3578
World Rank
#3729
Historical Rank

Download Badge
Engineering Computer Science
Yugang Jiang's Degrees
- Bachelors Electrical Engineering Tsinghua University
Why Is Yugang Jiang Influential?
(Suggest an Edit or Addition)Yugang Jiang's Published Works
Number of citations in a given year to any of this author's works
Total number of citations to an author for the works they published in a given year. This highlights publication of the most important work(s) by the author
Published Works
- Supervised hashing with kernels (2012) (1378)
- Evaluating bag-of-visual-words representations in scene classification (2007) (921)
- Pixel2Mesh: Generating 3D Mesh Models from Single RGB Images (2018) (917)
- Towards optimal bag-of-features for object categorization and semantic video retrieval (2007) (721)
- DSOD: Learning Deeply Supervised Object Detectors from Scratch (2017) (530)
- Modeling Spatial-Temporal Clues in a Hybrid Deep Learning Framework for Video Classification (2015) (412)
- NAIS: Neural Attentive Item Similarity Model for Recommendation (2018) (358)
- The MediaMill TRECVID 2006 Semantic Video Search Engine (2006) (357)
- Pose-Normalized Image Generation for Person Re-identification (2017) (343)
- The THUMOS challenge on action recognition for videos "in the wild" (2016) (331)
- Exploiting Feature and Class Relationships in Video Categorization with Regularized Deep Neural Networks (2015) (320)
- Consumer video understanding: a benchmark database and an evaluation of human and machine performance (2011) (290)
- Representations of Keypoint-Based Semantic Concept Detection: A Comprehensive Study (2010) (284)
- Learning Fashion Compatibility with Bidirectional LSTMs (2017) (277)
- Trajectory-Based Modeling of Human Actions with Motion Reference Points (2012) (226)
- Multi-scale Deep Learning Architectures for Person Re-identification (2017) (218)
- High-level event recognition in unconstrained videos (2013) (190)
- Recurrent Fusion Network for Image Captioning (2018) (188)
- Multi-Stream Multi-Class Fusion of Deep Networks for Video Classification (2016) (167)
- News Credibility Evaluation on Microblog with a Hierarchical Propagation Model (2014) (163)
- Multi-Level Semantic Feature Augmentation for One-Shot Learning (2018) (151)
- Learning Hash Codes with Listwise Supervision (2013) (142)
- Clean-Label Backdoor Attacks on Video Recognition Models (2020) (136)
- Exploring Inter-feature and Inter-class Relationships with Deep Neural Networks for Video Classification (2014) (134)
- Semantic Proposal for Activity Localization in Videos via Sentence Query (2019) (122)
- CNN-Based Chinese NER with Lexicon Rethinking (2019) (121)
- Weakly Supervised Dense Video Captioning (2017) (115)
- Recent Advances in Zero-Shot Recognition: Toward Data-Efficient Understanding of Visual Content (2018) (115)
- Columbia-UCF TRECVID2010 Multimedia Event Detection: Combining Multiple Modalities, Contextual Concepts, and Temporal Matching (2010) (111)
- WildDeepfake: A Challenging Real-World Dataset for Deepfake Detection (2020) (105)
- Evaluating Two-Stream CNN for Video Classification (2015) (105)
- Black-box Adversarial Attacks on Video Recognition Models (2019) (104)
- Portfolio Choices with Orthogonal Bandit Learning (2015) (104)
- Hookworm Detection in Wireless Capsule Endoscopy Images With Deep Learning (2018) (103)
- Deep Learning for Video Classification and Captioning (2016) (101)
- Video event detection using motion relativity and visual relatedness (2008) (99)
- Predicting Emotions in User-Generated Videos (2014) (97)
- Domain adaptive semantic diffusion for large scale context-based video annotation (2009) (93)
- Harnessing Object and Scene Semantics for Large-Scale Video Understanding (2016) (90)
- Columbia University/VIREO-CityU/IRIT TRECVID2008 High-Level Feature Extraction and Interactive Video Search (2008) (86)
- Understanding and Predicting Interestingness of Videos (2013) (83)
- Modeling Multimodal Clues in a Hybrid Deep Learning Framework for Video Classification (2017) (83)
- Heterogeneous Knowledge Transfer in Video Emotion Recognition, Attribution and Summarization (2015) (82)
- The MediaEval 2013 Affect Task: Violent Scenes Detection (2013) (81)
- Motion Guided Spatial Attention for Video Captioning (2019) (80)
- Fast tracking of near-duplicate keyframes in broadcast domain with transitivity propagation (2006) (80)
- Query-Adaptive Image Search With Hash Codes (2013) (79)
- BEVT: BERT Pretraining of Video Transformers (2021) (79)
- Noise resistant graph ranking for improved web image search (2011) (78)
- Brain state decoding for rapid image retrieval (2009) (75)
- Unsupervised Image-to-Image Translation with Stacked Cycle-Consistent Adversarial Networks (2018) (75)
- A Coarse-to-Fine Framework for Resource Efficient Video Recognition (2019) (74)
- Cross-Domain Sentiment Classification with Target Domain Specific Information (2018) (73)
- Hyperbolic Visual Embedding Learning for Zero-Shot Recognition (2020) (70)
- A relative similarity based method for interactive patient risk prediction (2015) (68)
- Human Action Recognition in Unconstrained Videos by Explicit Motion Modeling (2015) (66)
- Image Block Augmentation for One-Shot Learning (2019) (64)
- Fudan-Huawei at MediaEval 2015: Detecting Violent Scenes and Affective Impact in Movies with Deep Learning (2015) (63)
- Semantic context transfer across heterogeneous sources for domain adaptive video search (2009) (63)
- Concept-Driven Multi-Modality Fusion for Video Search (2011) (63)
- Leader-Based Multi-Scale Attention Deep Architecture for Person Re-Identification (2020) (62)
- M2TR: Multi-modal Multi-scale Transformers for Deepfake Detection (2021) (61)
- Super Fast Event Recognition in Internet Videos (2015) (61)
- Fast Semantic Diffusion for Large-Scale Context-Based Image and Video Annotation (2012) (60)
- Visual word proximity and linguistics for semantic video indexing and near-duplicate retrieval (2009) (60)
- Trainable Undersampling for Class-Imbalance Learning (2019) (59)
- Social Anchor-Unit Graph Regularized Tensor Completion for Large-Scale Image Retagging (2018) (58)
- Keyframe Retrieval by Keypoints: Can Point-to-Point Matching Help? (2006) (58)
- Partial Copy Detection in Videos: A Benchmark and an Evaluation of Popular Methods (2016) (55)
- VCDB: A Large-Scale Database for Partial Copy Detection in Videos (2014) (54)
- Learning Modality Interaction for Temporal Sentence Localization and Event Captioning in Videos (2020) (53)
- Matching User Photos to Online Products with Robust Deep Features (2016) (53)
- Object Detection from Scratch with Deep Supervision (2018) (53)
- Long-Term Cloth-Changing Person Re-identification (2020) (50)
- Multi-task Deep Neural Network for Joint Face Recognition and Facial Attribute Prediction (2017) (49)
- Image Classification With Tailored Fine-Grained Dictionaries (2018) (49)
- VIREO/DVMM at TRECVID 2009: High-Level Feature Extraction, Automatic Video Search, and Content-Based Copy Detection (2009) (48)
- Lost in binarization: query-adaptive ranking for similar image search with compact codes (2011) (46)
- CU-VIREO 374 : Fusing Columbia 374 and VIREO 374 for Large Scale Semantic Concept Detection (2008) (46)
- Video Emotion Recognition with Transferred Deep Feature Encodings (2016) (46)
- Learning Hybrid Part Filters for Scene Recognition (2012) (46)
- Adaptively Weighted Multi-task Deep Network for Person Attribute Classification (2017) (45)
- Learning to Score Figure Skating Sport Videos (2020) (45)
- Semantic Feature Augmentation in Few-shot Learning (2018) (45)
- Sampling and Ontologically Pooling Web Images for Visual Concept Learning (2012) (44)
- Selection of Concept Detectors for Video Search by Ontology-Enriched Semantic Spaces (2008) (44)
- An End-to-End Architecture for Class-Incremental Object Detection with Knowledge Distillation (2019) (43)
- Discovering joint audio–visual codewords for video event detection (2013) (43)
- Fusing Multi-Stream Deep Networks for Video Classification (2015) (43)
- Video Event Detection Using Motion Relativity and Feature Selection (2014) (42)
- Towards textually describing complex video contents with audio-visual concept classifiers (2011) (40)
- Emotion in Context: Deep Semantic Feature Fusion for Video Emotion Recognition (2016) (39)
- Label diagnosis through self tuning for web image search (2009) (38)
- On the sampling of web images for learning visual concept classifiers (2010) (37)
- Recent Advances in Zero-shot Recognition (2017) (37)
- Hierarchical Visual-Textual Graph for Temporal Activity Localization via Language (2020) (37)
- AdaViT: Adaptive Vision Transformers for Efficient Image Recognition (2021) (36)
- Cross-domain Contrastive Learning for Unsupervised Domain Adaptation (2021) (35)
- Spatial-Temporal Graphs for Cross-Modal Text2Video Retrieval (2022) (35)
- Binary Optimized Hashing (2016) (35)
- TC-Net for iSBIR: Triplet Classification Network for Instance-level Sketch Based Image Retrieval (2019) (34)
- SUPER: towards real-time event recognition in internet videos (2012) (34)
- Beauty is here: evaluating aesthetics in videos using multimodal features and free training data (2013) (34)
- Joint audio-visual bi-modal codewords for video event detection (2012) (33)
- Which Looks Like Which: Exploring Inter-class Relationships in Fine-Grained Visual Categorization (2014) (33)
- Re-Caption: Saliency-Enhanced Image Captioning Through Two-Phase Learning (2020) (33)
- Modeling Scene and Object Contexts for Human Action Retrieval With Few Examples (2011) (32)
- Non-local NetVLAD Encoding for Video Classification (2018) (32)
- Benchmarking Violent Scenes Detection in movies (2014) (32)
- Depth Guided Adaptive Meta-Fusion Network for Few-shot Video Recognition (2020) (31)
- Dense Dilated Network for Few Shot Action Recognition (2018) (31)
- VSD2014: A dataset for violent scenes detection in hollywood movies and web videos (2015) (29)
- Deep Learning for Video Captioning: A Review (2019) (29)
- Sketch-BERT: Learning Sketch Bidirectional Encoder Representation From Transformers by Self-Supervised Learning of Sketch Gestalt (2020) (29)
- Harnessing Synthesized Abstraction Images to Improve Facial Attribute Recognition (2018) (28)
- OmniVL: One Foundation Model for Image-Language and Video-Language Tasks (2022) (28)
- Pixel2Mesh: 3D Mesh Model Generation via Image Guided Deformation (2020) (28)
- DB-LSTM: Densely-connected Bi-directional LSTM for human action recognition (2020) (28)
- Sketch Recognition with Deep Visual-Sequential Fusion Model (2017) (27)
- Revisiting Adversarial Robustness Distillation: Robust Soft Labels Make Student Better (2021) (26)
- Exploring inter-concept relationship with context space for semantic video indexing (2009) (25)
- Deep Ranking for Image Zero-Shot Multi-Label Classification (2020) (25)
- Towards Transferable Adversarial Attacks on Vision Transformers (2021) (25)
- A Dynamic Frame Selection Framework for Fast Video Recognition (2020) (25)
- Dense Dilated Network for Video Action Recognition (2019) (24)
- A Study of Multi-Task and Region-Wise Deep Learning for Food Ingredient Recognition (2020) (24)
- Video Relation Detection via Multiple Hypothesis Association (2020) (23)
- Bag-of-visual-words expansion using visual relatedness for video indexing (2008) (22)
- Fudan-NJUST at MediaEval 2014: Violent Scenes Detection Using Deep Neural Networks (2014) (22)
- Experimenting VIREO-374: Bag-of-Visual-Words and Visual-Based Ontology for Semantic Video Indexing and search (2007) (21)
- Regional Gating Neural Networks for Multi-label Image Classification (2016) (21)
- Aggregating Frame-level Features for Large-Scale Video Classification (2017) (20)
- Exploiting Objects with LSTMs for Video Categorization (2016) (20)
- Fast Summarization of User-Generated Videos: Exploiting Semantic, Emotional, and Quality Clues (2016) (19)
- Generalized Meta-FDMixup: Cross-Domain Few-Shot Learning Guided by Labeled Target Data (2021) (19)
- Motion Guided Region Message Passing for Video Captioning (2021) (19)
- Towards Bridging Event Captioner and Sentence Localizer for Weakly Supervised Dense Event Captioning (2021) (19)
- Imbalanced Gradients: A New Cause of Overestimated Adversarial Robustness (2020) (18)
- Hot Topic-Aware Retweet Prediction with Masked Self-attentive Model (2019) (18)
- A Multi-Task Neural Approach for Emotion Attribution, Classification, and Summarization (2018) (18)
- DeepProduct: Mobile Product Search With Portable Deep Features (2018) (18)
- Efficient Video Transformers with Spatial-Temporal Token Selection (2021) (18)
- Co-Attention Memory Network for Multimodal Microblog's Hashtag Recommendation (2019) (18)
- Learning Multiple Relative Attributes With Humans in the Loop (2014) (18)
- Fudan at MediaEval 2013: Violent Scenes Detection Using Motion Features and Part-Level Attributes (2013) (18)
- Learning to Generate and Edit Hairstyles (2017) (18)
- Real-time summarization of user-generated videos based on semantic recognition (2014) (17)
- Recurrent Memory Reasoning Network for Expert Finding in Community Question Answering (2020) (17)
- Vocabulary-Informed Zero-Shot and Open-Set Learning (2020) (17)
- SVTR: Scene Text Recognition with a Single Visual Model (2022) (17)
- ObjectFormer for Image Manipulation Detection and Localization (2022) (17)
- Flexible multi-task learning with latent task grouping (2016) (17)
- Semi-Supervised Vision Transformers (2021) (16)
- Multi-modal Cooking Workflow Construction for Food Recipes (2020) (15)
- VideoLT: Large-scale Long-tailed Video Recognition (2021) (15)
- Visual Co-Occurrence Alignment Learning for Weakly-Supervised Video Moment Retrieval (2021) (15)
- Frame-Transformer Emotion Classification Network (2017) (15)
- Embodied One-Shot Video Recognition: Learning from Actions of a Virtual Embodied Agent (2019) (15)
- CHCF: A Cloud-Based Heterogeneous Computing Framework for Large-Scale Image Retrieval (2015) (14)
- Matching Image and Sentence With Multi-Faceted Representations (2020) (14)
- Beyond Semantic Search: What You Observe May Not Be What You Think (2008) (14)
- Visual Relations Augmented Cross-modal Retrieval (2020) (13)
- Name-Face Association in Web Videos: A Large-Scale Dataset, Baselines, and Open Issues (2014) (13)
- What Do Deep Nets Learn? Class-wise Patterns Revealed in the Input Space (2021) (13)
- GPU-based MapReduce for large-scale near-duplicate video retrieval (2015) (13)
- Hierarchical Visualization of Video Search Results for Topic-Based Browsing (2016) (12)
- Special issue on Multimedia Event Detection (2013) (12)
- Multiple Task Learning Using Iteratively Reweighted Least Square (2013) (12)
- FM2u-Net: Face Morphological Multi-Branch Network for Makeup-Invariant Face Verification (2020) (11)
- A Bayesian Hashing approach and its application to face recognition (2016) (11)
- Take Goods from Shelves: A Dataset for Class-Incremental Object Detection (2019) (11)
- Learning Semantic Feature Map for Visual Content Recognition (2017) (11)
- Cross-Modal Transferable Adversarial Attacks from Images to Videos (2021) (11)
- Sparse Temporal Causal Convolution for Efficient Action Modeling (2019) (11)
- Person-level Action Recognition in Complex Events via TSD-TSM Networks (2020) (10)
- Challenge Huawei challenge: Fusing multimodal features with deep neural networks for Mobile Video Annotation (2014) (10)
- Generating Keyword Queries for Natural Language Queries to Alleviate Lexical Chasm Problem (2018) (10)
- Comp-GAN: Compositional Generative Adversarial Network in Synthesizing and Recognizing Facial Expression (2019) (10)
- Pose-Guided Person Image Synthesis in the Non-Iconic Views (2020) (9)
- Learning to score the figure skating sports videos (2018) (9)
- Learning to Augment Expressions for Few-shot Fine-grained Facial Expression Recognition (2020) (9)
- The Shanghai-Hongkong Team at MediaEval2012: Violent Scene Detection Using Trajectory-based Features (2012) (9)
- Editorial IEEE Transactions on Multimedia Special Section on Video Analytics: Challenges, Algorithms, and Applications (2018) (9)
- Feature Deformation Meta-Networks in Image Captioning of Novel Objects (2020) (8)
- On Stochastic Primal-Dual Hybrid Gradient Approach for Compositely Regularized Minimization (2016) (8)
- Visual Content Recognition by Exploiting Semantic Feature Map with Attention and Multi-task Learning (2019) (8)
- Dual Skipping Networks (2017) (8)
- BigVid at MediaEval 2016: Predicting Interestingness in Images and Videos (2016) (8)
- Boosting the Transferability of Video Adversarial Examples via Temporal Translation (2021) (8)
- Learning to Separate Domains in Generalized Zero-Shot and Open Set Learning: a probabilistic perspective (2018) (7)
- Supplementary of Multi-scale Deep Learning Architectures for Person Re-identification (2017) (7)
- Web video categorization using category-predictive classifiers and category-specific concept classifiers (2016) (7)
- Attacking Video Recognition Models with Bullet-Screen Comments (2021) (7)
- Wave-SAN: Wavelet based Style Augmentation Network for Cross-Domain Few-Shot Learning (2022) (7)
- VIREO-374 : LSCOM Semantic Concept Detectors Using Local Keypoint Features (2007) (7)
- Strong geometrical consistency in large scale partial-duplicate image search (2013) (7)
- Scene Graph Refinement Network for Visual Question Answering (2022) (7)
- A Framework of Video Coding for Compressing Near-Duplicate Videos (2014) (6)
- Exploring Semantic Concept Using Local Invariant Features (2006) (6)
- FT-TDR: Frequency-guided Transformer and Top-Down Refinement Network for Blind Face Inpainting (2021) (6)
- Detection Hub: Unifying Object Detection Datasets via Query Adaptation on Language Embedding (2022) (6)
- Proceedings of the 2017 ACM on International Conference on Multimedia Retrieval (2015) (6)
- DeepProduct (2018) (6)
- Modeling Local Interest Points for Semantic Detection and Video Search at TRECVID 2006 (2006) (6)
- Learning part-based mid-level representation for visual recognition (2018) (6)
- CDistNet: Perceiving Multi-Domain Character Distance for Robust Text Recognition (2021) (6)
- FCVID : Fudan-Columbia Video Dataset (2016) (6)
- WildDeepfake (2020) (5)
- TC-GAN: Triangle Cycle-Consistent GANs for Face Frontalization with Facial Features Preserved (2019) (5)
- Two-stage Visual Cues Enhancement Network for Referring Image Segmentation (2021) (5)
- Video Moment Retrieval from Text Queries via Single Frame Annotation (2022) (5)
- Multiple task learning with flexible structure regularization (2016) (5)
- Large scale semantic concept detection, fusion, and selection for domain adaptive video search (2009) (5)
- MORE: Multi-Order RElation Mining for Dense Captioning in 3D Scenes (2022) (5)
- Learning Layer-Skippable Inference Network (2020) (4)
- Unified Multimodal Pre-training and Prompt-based Tuning for Vision-Language Understanding and Generation (2021) (4)
- Towards Optimal CNN Descriptors for Large-Scale Image Retrieval (2019) (4)
- HMS: Hierarchical Modality Selection for Efficient Video Recognition (2021) (4)
- LSVC2017: Large-Scale Video Classification Challenge (2017) (4)
- Dynamic Mixup for Multi-Label Long-Tailed Food Ingredient Recognition (2022) (4)
- Placing Videos on a Semantic Hierarchy for Search Result Navigation (2014) (4)
- SVFormer: Semi-supervised Video Transformer for Action Recognition (2022) (3)
- Reformulating natural language queries using sequence-to-sequence models (2019) (3)
- Predicting Content Similarity via Multimodal Modeling for Video-In-Video Advertising (2021) (3)
- Categorizing Big Video Data on the Web: Challenges and Opportunities (2015) (3)
- Discovering joint audio–visual codewords for video event detection (2013) (3)
- Ontology-based visual word matching for near-duplicate retrieval (2008) (3)
- Instance-level Sketch-based Retrieval by Deep Triplet Classification Siamese Network (2018) (3)
- Organizing Video Search Results to Adapted Semantic Hierarchies for Topic-based Browsing (2014) (3)
- Adaptive Proximal Average Approximation for Composite Convex Minimization (2017) (3)
- Can Action be Imitated? Learn to Reconstruct and Transfer Human Dynamics from Videos (2021) (3)
- ResFormer: Scaling ViTs with Multi-Resolution Training (2022) (3)
- Optimal Bayesian Hashing for Efficient Face Recognition (2015) (3)
- Look Before You Match: Instance Understanding Matters in Video Object Segmentation (2022) (3)
- Self-supervised Learning for Semi-supervised Temporal Language Grounding (2021) (3)
- ME-D2N: Multi-Expert Domain Decompositional Network for Cross-Domain Few-Shot Learning (2022) (3)
- Semi-Supervised Single-View 3D Reconstruction via Prototype Shape Priors (2022) (3)
- Multi-Prompt Alignment for Multi-source Unsupervised Domain Adaptation (2022) (2)
- SAM: Modeling Scene, Object and Action With Semantics Attention Modules for Video Recognition (2022) (2)
- ASM'15: The 1st International Workshop on Affect and Sentiment in Multimedia (2015) (2)
- Bag of Tricks for Building an Accurate and Slim Object Detector for Embedded Applications (2021) (2)
- TGDM: Target Guided Dynamic Mixup for Cross-Domain Few-Shot Learning (2022) (2)
- YOLO-based Adaptive Window Two-stream Convolutional Neural Network for Video Classification (2017) (2)
- Fast Summarization of User-Generated Videos Using Semantic , Emotional and Quality Clues (2016) (2)
- Suspected Object Matters: Rethinking Model's Prediction for One-stage Visual Grounding (2022) (2)
- Video Mobile-Former: Video Recognition with Efficient Global Spatial-temporal Modeling (2022) (2)
- Masked Video Distillation: Rethinking Masked Feature Modeling for Self-supervised Video Representation Learning (2022) (2)
- Adaptive Split-Fusion Transformer (2022) (2)
- A fast video event recognition system and its application to video search (2012) (1)
- Stacked multichannel autoencoder – an efficient way of learning from synthetic data (2018) (1)
- Incorporating Locality of Images to Generate Targeted Transferable Adversarial Examples (2022) (1)
- Deeper Insights into ViTs Robustness towards Common Corruptions (2022) (1)
- Story-driven Video Editing (2021) (1)
- Iterative object and part transfer for fine-grained recognition (2017) (1)
- Smart Advertising in Videos Based on Comprehensive Content Analytics (2019) (1)
- Learning to score and summarize figure skating sport videos (2018) (1)
- Imbalanced gradients: a subtle cause of overestimated adversarial robustness (2020) (1)
- Extreme vocabulary learning (2019) (1)
- High-level event recognition in unconstrained videos (2012) (1)
- Ingredient-enriched Recipe Generation from Cooking Videos (2022) (1)
- VSCC'2017: Visual Analysis for Smart and Connected Communities (2017) (1)
- On the pooling of positive examples with ontology for visual concept learning (2011) (1)
- OmniTracker: Unifying Object Tracking by Tracking-with-Detection (2023) (1)
- Exploring the Consistency of Segment-level and Video-level Predictions for Improved Temporal Concept Localization in Videos (2019) (1)
- Proceedings of the Workshop on Large-Scale Video Classification Challenge (2017) (0)
- Left-Right Skip-DenseNets for Coarse-to-Fine Object Categorization (2017) (0)
- Deeper Insights into the Robustness of ViTs towards Common Corruptions (2022) (0)
- HCMS: Hierarchical and Conditional Modality Selection for Efficient Video Recognition (2021) (0)
- Adaptive Temporal Grouping for Black-box Adversarial Attacks on Videos (2022) (0)
- Data-Free Network Debiasing for Long-Tailed Visual Recognition (2022) (0)
- Proceedings of the Workshop on Visual Analysis in Smart and Connected Communities, VSCC@MM 2017, Mountain View, CA, USA, October 23, 2017 (2017) (0)
- Stacked multichannel autoencoder – an efficient way of learning from synthetic data (2018) (0)
- NTT-Fudan Team @ TRECVID 2015: Multimedia Event Detection (2015) (0)
- Session details: Oral Session 2: Content analysis (2015) (0)
- Text-driven Video Prediction (2022) (0)
- Transforming CLIP to an Open-vocabulary Video Model via Interpolated Weight Optimization (2023) (0)
- Colonoscopy Polyp Detection: Domain Adaptation From Medical Report Images to Real-time Videos (2020) (0)
- Session details: Social-video semantics (2012) (0)
- PromptFusion: Decoupling Stability and Plasticity for Continual Learning (2023) (0)
- FDU Participation in TRECVID 2019 VTT Task (2019) (0)
- Session details: Oral Session 3: Applications (2015) (0)
- Mix-DANN and Dynamic-Modal-Distillation for Video Domain Adaptation (2022) (0)
- VideoLT: Large-scale Long-tailed Video Recognition (Supplementary Material) (2021) (0)
- Locate before Answering: Answer Guided Question Localization for Video Question Answering (2022) (0)
- Joint Audio-Visual Signatures for Web Video Analysis (2010) (0)
- Long-Term Cloth-Changing Person Re-identification (Supplementary Material) (2020) (0)
- Fudan at TRECVID 2015: Adaptive Feature Fusion for Multimedia Event Detection in Videos (2015) (0)
- Large-scale video semantic recognition based on consistency of segment-level and video-level predictions (2020) (0)
- A Multimodal Framework for Video Ads Understanding (2021) (0)
- Proceedings of the 1st International Workshop on Affect & Sentiment in Multimedia (2015) (0)
- ChatVideo: A Tracklet-centric Multimodal and Versatile Video Understanding System (2023) (0)
- Composite Binary Decomposition Networks (2018) (0)
- Semantic Video Search by Exploiting Large-Scale Visual Concepts (2009) (0)
- Implicit Temporal Modeling with Learnable Alignment for Video Recognition (2023) (0)
This paper list is powered by the following services:
What Schools Are Affiliated With Yugang Jiang?
Yugang Jiang is affiliated with the following schools: