Yugang Jiang

Yugang Jiang's AcademicInfluence.com Rankings

Yugang Jiang

Engineering

#5072

World Rank

#6311

Historical Rank

Electrical Engineering

#1354

World Rank

#1445

Historical Rank

engineering Degrees

Yugang Jiang

Computer Science

#6494

World Rank

#6847

Historical Rank

Algorithms

#235

World Rank

#238

Historical Rank

Machine Learning

#2129

World Rank

#2157

Historical Rank

Database

#3578

World Rank

#3729

Historical Rank

computer-science Degrees

Download Badge

Engineering
Computer Science

Yugang Jiang's Degrees

Bachelors Electrical Engineering Tsinghua University

Why Is Yugang Jiang Influential?

(Suggest an Edit or Addition)

(See a Problem?)

Yugang Jiang's Published Works

Number of citations in a given year to any of this author's works

Total number of citations to an author for the works they published in a given year. This highlights publication of the most important work(s) by the author

Published Works

Supervised hashing with kernels (2012) (1378)
Evaluating bag-of-visual-words representations in scene classification (2007) (921)
Pixel2Mesh: Generating 3D Mesh Models from Single RGB Images (2018) (917)
Towards optimal bag-of-features for object categorization and semantic video retrieval (2007) (721)
DSOD: Learning Deeply Supervised Object Detectors from Scratch (2017) (530)
Modeling Spatial-Temporal Clues in a Hybrid Deep Learning Framework for Video Classification (2015) (412)
NAIS: Neural Attentive Item Similarity Model for Recommendation (2018) (358)
The MediaMill TRECVID 2006 Semantic Video Search Engine (2006) (357)
Pose-Normalized Image Generation for Person Re-identification (2017) (343)
The THUMOS challenge on action recognition for videos "in the wild" (2016) (331)
Exploiting Feature and Class Relationships in Video Categorization with Regularized Deep Neural Networks (2015) (320)
Consumer video understanding: a benchmark database and an evaluation of human and machine performance (2011) (290)
Representations of Keypoint-Based Semantic Concept Detection: A Comprehensive Study (2010) (284)
Learning Fashion Compatibility with Bidirectional LSTMs (2017) (277)
Trajectory-Based Modeling of Human Actions with Motion Reference Points (2012) (226)
Multi-scale Deep Learning Architectures for Person Re-identification (2017) (218)
High-level event recognition in unconstrained videos (2013) (190)
Recurrent Fusion Network for Image Captioning (2018) (188)
Multi-Stream Multi-Class Fusion of Deep Networks for Video Classification (2016) (167)
News Credibility Evaluation on Microblog with a Hierarchical Propagation Model (2014) (163)
Multi-Level Semantic Feature Augmentation for One-Shot Learning (2018) (151)
Learning Hash Codes with Listwise Supervision (2013) (142)
Clean-Label Backdoor Attacks on Video Recognition Models (2020) (136)
Exploring Inter-feature and Inter-class Relationships with Deep Neural Networks for Video Classification (2014) (134)
Semantic Proposal for Activity Localization in Videos via Sentence Query (2019) (122)
CNN-Based Chinese NER with Lexicon Rethinking (2019) (121)
Weakly Supervised Dense Video Captioning (2017) (115)
Recent Advances in Zero-Shot Recognition: Toward Data-Efficient Understanding of Visual Content (2018) (115)
Columbia-UCF TRECVID2010 Multimedia Event Detection: Combining Multiple Modalities, Contextual Concepts, and Temporal Matching (2010) (111)
WildDeepfake: A Challenging Real-World Dataset for Deepfake Detection (2020) (105)
Evaluating Two-Stream CNN for Video Classification (2015) (105)
Black-box Adversarial Attacks on Video Recognition Models (2019) (104)
Portfolio Choices with Orthogonal Bandit Learning (2015) (104)
Hookworm Detection in Wireless Capsule Endoscopy Images With Deep Learning (2018) (103)
Deep Learning for Video Classification and Captioning (2016) (101)
Video event detection using motion relativity and visual relatedness (2008) (99)
Predicting Emotions in User-Generated Videos (2014) (97)
Domain adaptive semantic diffusion for large scale context-based video annotation (2009) (93)
Harnessing Object and Scene Semantics for Large-Scale Video Understanding (2016) (90)
Columbia University/VIREO-CityU/IRIT TRECVID2008 High-Level Feature Extraction and Interactive Video Search (2008) (86)
Understanding and Predicting Interestingness of Videos (2013) (83)
Modeling Multimodal Clues in a Hybrid Deep Learning Framework for Video Classification (2017) (83)
Heterogeneous Knowledge Transfer in Video Emotion Recognition, Attribution and Summarization (2015) (82)
The MediaEval 2013 Affect Task: Violent Scenes Detection (2013) (81)
Motion Guided Spatial Attention for Video Captioning (2019) (80)
Fast tracking of near-duplicate keyframes in broadcast domain with transitivity propagation (2006) (80)
Query-Adaptive Image Search With Hash Codes (2013) (79)
BEVT: BERT Pretraining of Video Transformers (2021) (79)
Noise resistant graph ranking for improved web image search (2011) (78)
Brain state decoding for rapid image retrieval (2009) (75)
Unsupervised Image-to-Image Translation with Stacked Cycle-Consistent Adversarial Networks (2018) (75)
A Coarse-to-Fine Framework for Resource Efficient Video Recognition (2019) (74)
Cross-Domain Sentiment Classification with Target Domain Specific Information (2018) (73)
Hyperbolic Visual Embedding Learning for Zero-Shot Recognition (2020) (70)
A relative similarity based method for interactive patient risk prediction (2015) (68)
Human Action Recognition in Unconstrained Videos by Explicit Motion Modeling (2015) (66)
Image Block Augmentation for One-Shot Learning (2019) (64)
Fudan-Huawei at MediaEval 2015: Detecting Violent Scenes and Affective Impact in Movies with Deep Learning (2015) (63)
Semantic context transfer across heterogeneous sources for domain adaptive video search (2009) (63)
Concept-Driven Multi-Modality Fusion for Video Search (2011) (63)
Leader-Based Multi-Scale Attention Deep Architecture for Person Re-Identification (2020) (62)
M2TR: Multi-modal Multi-scale Transformers for Deepfake Detection (2021) (61)
Super Fast Event Recognition in Internet Videos (2015) (61)
Fast Semantic Diffusion for Large-Scale Context-Based Image and Video Annotation (2012) (60)
Visual word proximity and linguistics for semantic video indexing and near-duplicate retrieval (2009) (60)
Trainable Undersampling for Class-Imbalance Learning (2019) (59)
Social Anchor-Unit Graph Regularized Tensor Completion for Large-Scale Image Retagging (2018) (58)
Keyframe Retrieval by Keypoints: Can Point-to-Point Matching Help? (2006) (58)
Partial Copy Detection in Videos: A Benchmark and an Evaluation of Popular Methods (2016) (55)
VCDB: A Large-Scale Database for Partial Copy Detection in Videos (2014) (54)
Learning Modality Interaction for Temporal Sentence Localization and Event Captioning in Videos (2020) (53)
Matching User Photos to Online Products with Robust Deep Features (2016) (53)
Object Detection from Scratch with Deep Supervision (2018) (53)
Long-Term Cloth-Changing Person Re-identification (2020) (50)
Multi-task Deep Neural Network for Joint Face Recognition and Facial Attribute Prediction (2017) (49)
Image Classification With Tailored Fine-Grained Dictionaries (2018) (49)
VIREO/DVMM at TRECVID 2009: High-Level Feature Extraction, Automatic Video Search, and Content-Based Copy Detection (2009) (48)
Lost in binarization: query-adaptive ranking for similar image search with compact codes (2011) (46)
CU-VIREO 374 : Fusing Columbia 374 and VIREO 374 for Large Scale Semantic Concept Detection (2008) (46)
Video Emotion Recognition with Transferred Deep Feature Encodings (2016) (46)
Learning Hybrid Part Filters for Scene Recognition (2012) (46)
Adaptively Weighted Multi-task Deep Network for Person Attribute Classification (2017) (45)
Learning to Score Figure Skating Sport Videos (2020) (45)
Semantic Feature Augmentation in Few-shot Learning (2018) (45)
Sampling and Ontologically Pooling Web Images for Visual Concept Learning (2012) (44)
Selection of Concept Detectors for Video Search by Ontology-Enriched Semantic Spaces (2008) (44)
An End-to-End Architecture for Class-Incremental Object Detection with Knowledge Distillation (2019) (43)
Discovering joint audio–visual codewords for video event detection (2013) (43)
Fusing Multi-Stream Deep Networks for Video Classification (2015) (43)
Video Event Detection Using Motion Relativity and Feature Selection (2014) (42)
Towards textually describing complex video contents with audio-visual concept classifiers (2011) (40)
Emotion in Context: Deep Semantic Feature Fusion for Video Emotion Recognition (2016) (39)
Label diagnosis through self tuning for web image search (2009) (38)
On the sampling of web images for learning visual concept classifiers (2010) (37)
Recent Advances in Zero-shot Recognition (2017) (37)
Hierarchical Visual-Textual Graph for Temporal Activity Localization via Language (2020) (37)
AdaViT: Adaptive Vision Transformers for Efficient Image Recognition (2021) (36)
Cross-domain Contrastive Learning for Unsupervised Domain Adaptation (2021) (35)
Spatial-Temporal Graphs for Cross-Modal Text2Video Retrieval (2022) (35)
Binary Optimized Hashing (2016) (35)
TC-Net for iSBIR: Triplet Classification Network for Instance-level Sketch Based Image Retrieval (2019) (34)
SUPER: towards real-time event recognition in internet videos (2012) (34)
Beauty is here: evaluating aesthetics in videos using multimodal features and free training data (2013) (34)
Joint audio-visual bi-modal codewords for video event detection (2012) (33)
Which Looks Like Which: Exploring Inter-class Relationships in Fine-Grained Visual Categorization (2014) (33)
Re-Caption: Saliency-Enhanced Image Captioning Through Two-Phase Learning (2020) (33)
Modeling Scene and Object Contexts for Human Action Retrieval With Few Examples (2011) (32)
Non-local NetVLAD Encoding for Video Classification (2018) (32)
Benchmarking Violent Scenes Detection in movies (2014) (32)
Depth Guided Adaptive Meta-Fusion Network for Few-shot Video Recognition (2020) (31)
Dense Dilated Network for Few Shot Action Recognition (2018) (31)
VSD2014: A dataset for violent scenes detection in hollywood movies and web videos (2015) (29)
Deep Learning for Video Captioning: A Review (2019) (29)
Sketch-BERT: Learning Sketch Bidirectional Encoder Representation From Transformers by Self-Supervised Learning of Sketch Gestalt (2020) (29)
Harnessing Synthesized Abstraction Images to Improve Facial Attribute Recognition (2018) (28)
OmniVL: One Foundation Model for Image-Language and Video-Language Tasks (2022) (28)
Pixel2Mesh: 3D Mesh Model Generation via Image Guided Deformation (2020) (28)
DB-LSTM: Densely-connected Bi-directional LSTM for human action recognition (2020) (28)
Sketch Recognition with Deep Visual-Sequential Fusion Model (2017) (27)
Revisiting Adversarial Robustness Distillation: Robust Soft Labels Make Student Better (2021) (26)
Exploring inter-concept relationship with context space for semantic video indexing (2009) (25)
Deep Ranking for Image Zero-Shot Multi-Label Classification (2020) (25)
Towards Transferable Adversarial Attacks on Vision Transformers (2021) (25)
A Dynamic Frame Selection Framework for Fast Video Recognition (2020) (25)
Dense Dilated Network for Video Action Recognition (2019) (24)
A Study of Multi-Task and Region-Wise Deep Learning for Food Ingredient Recognition (2020) (24)
Video Relation Detection via Multiple Hypothesis Association (2020) (23)
Bag-of-visual-words expansion using visual relatedness for video indexing (2008) (22)
Fudan-NJUST at MediaEval 2014: Violent Scenes Detection Using Deep Neural Networks (2014) (22)
Experimenting VIREO-374: Bag-of-Visual-Words and Visual-Based Ontology for Semantic Video Indexing and search (2007) (21)
Regional Gating Neural Networks for Multi-label Image Classification (2016) (21)
Aggregating Frame-level Features for Large-Scale Video Classification (2017) (20)
Exploiting Objects with LSTMs for Video Categorization (2016) (20)
Fast Summarization of User-Generated Videos: Exploiting Semantic, Emotional, and Quality Clues (2016) (19)
Generalized Meta-FDMixup: Cross-Domain Few-Shot Learning Guided by Labeled Target Data (2021) (19)
Motion Guided Region Message Passing for Video Captioning (2021) (19)
Towards Bridging Event Captioner and Sentence Localizer for Weakly Supervised Dense Event Captioning (2021) (19)
Imbalanced Gradients: A New Cause of Overestimated Adversarial Robustness (2020) (18)
Hot Topic-Aware Retweet Prediction with Masked Self-attentive Model (2019) (18)
A Multi-Task Neural Approach for Emotion Attribution, Classification, and Summarization (2018) (18)
DeepProduct: Mobile Product Search With Portable Deep Features (2018) (18)
Efficient Video Transformers with Spatial-Temporal Token Selection (2021) (18)
Co-Attention Memory Network for Multimodal Microblog's Hashtag Recommendation (2019) (18)
Learning Multiple Relative Attributes With Humans in the Loop (2014) (18)
Fudan at MediaEval 2013: Violent Scenes Detection Using Motion Features and Part-Level Attributes (2013) (18)
Learning to Generate and Edit Hairstyles (2017) (18)
Real-time summarization of user-generated videos based on semantic recognition (2014) (17)
Recurrent Memory Reasoning Network for Expert Finding in Community Question Answering (2020) (17)
Vocabulary-Informed Zero-Shot and Open-Set Learning (2020) (17)
SVTR: Scene Text Recognition with a Single Visual Model (2022) (17)
ObjectFormer for Image Manipulation Detection and Localization (2022) (17)
Flexible multi-task learning with latent task grouping (2016) (17)
Semi-Supervised Vision Transformers (2021) (16)
Multi-modal Cooking Workflow Construction for Food Recipes (2020) (15)
VideoLT: Large-scale Long-tailed Video Recognition (2021) (15)
Visual Co-Occurrence Alignment Learning for Weakly-Supervised Video Moment Retrieval (2021) (15)
Frame-Transformer Emotion Classification Network (2017) (15)
Embodied One-Shot Video Recognition: Learning from Actions of a Virtual Embodied Agent (2019) (15)
CHCF: A Cloud-Based Heterogeneous Computing Framework for Large-Scale Image Retrieval (2015) (14)
Matching Image and Sentence With Multi-Faceted Representations (2020) (14)
Beyond Semantic Search: What You Observe May Not Be What You Think (2008) (14)
Visual Relations Augmented Cross-modal Retrieval (2020) (13)
Name-Face Association in Web Videos: A Large-Scale Dataset, Baselines, and Open Issues (2014) (13)
What Do Deep Nets Learn? Class-wise Patterns Revealed in the Input Space (2021) (13)
GPU-based MapReduce for large-scale near-duplicate video retrieval (2015) (13)
Hierarchical Visualization of Video Search Results for Topic-Based Browsing (2016) (12)
Special issue on Multimedia Event Detection (2013) (12)
Multiple Task Learning Using Iteratively Reweighted Least Square (2013) (12)
FM2u-Net: Face Morphological Multi-Branch Network for Makeup-Invariant Face Verification (2020) (11)
A Bayesian Hashing approach and its application to face recognition (2016) (11)
Take Goods from Shelves: A Dataset for Class-Incremental Object Detection (2019) (11)
Learning Semantic Feature Map for Visual Content Recognition (2017) (11)
Cross-Modal Transferable Adversarial Attacks from Images to Videos (2021) (11)
Sparse Temporal Causal Convolution for Efficient Action Modeling (2019) (11)
Person-level Action Recognition in Complex Events via TSD-TSM Networks (2020) (10)
Challenge Huawei challenge: Fusing multimodal features with deep neural networks for Mobile Video Annotation (2014) (10)
Generating Keyword Queries for Natural Language Queries to Alleviate Lexical Chasm Problem (2018) (10)
Comp-GAN: Compositional Generative Adversarial Network in Synthesizing and Recognizing Facial Expression (2019) (10)
Pose-Guided Person Image Synthesis in the Non-Iconic Views (2020) (9)
Learning to score the figure skating sports videos (2018) (9)
Learning to Augment Expressions for Few-shot Fine-grained Facial Expression Recognition (2020) (9)
The Shanghai-Hongkong Team at MediaEval2012: Violent Scene Detection Using Trajectory-based Features (2012) (9)
Editorial IEEE Transactions on Multimedia Special Section on Video Analytics: Challenges, Algorithms, and Applications (2018) (9)
Feature Deformation Meta-Networks in Image Captioning of Novel Objects (2020) (8)
On Stochastic Primal-Dual Hybrid Gradient Approach for Compositely Regularized Minimization (2016) (8)
Visual Content Recognition by Exploiting Semantic Feature Map with Attention and Multi-task Learning (2019) (8)
Dual Skipping Networks (2017) (8)
BigVid at MediaEval 2016: Predicting Interestingness in Images and Videos (2016) (8)
Boosting the Transferability of Video Adversarial Examples via Temporal Translation (2021) (8)
Learning to Separate Domains in Generalized Zero-Shot and Open Set Learning: a probabilistic perspective (2018) (7)
Supplementary of Multi-scale Deep Learning Architectures for Person Re-identification (2017) (7)
Web video categorization using category-predictive classifiers and category-specific concept classifiers (2016) (7)
Attacking Video Recognition Models with Bullet-Screen Comments (2021) (7)
Wave-SAN: Wavelet based Style Augmentation Network for Cross-Domain Few-Shot Learning (2022) (7)
VIREO-374 : LSCOM Semantic Concept Detectors Using Local Keypoint Features (2007) (7)
Strong geometrical consistency in large scale partial-duplicate image search (2013) (7)
Scene Graph Refinement Network for Visual Question Answering (2022) (7)
A Framework of Video Coding for Compressing Near-Duplicate Videos (2014) (6)
Exploring Semantic Concept Using Local Invariant Features (2006) (6)
FT-TDR: Frequency-guided Transformer and Top-Down Refinement Network for Blind Face Inpainting (2021) (6)
Detection Hub: Unifying Object Detection Datasets via Query Adaptation on Language Embedding (2022) (6)
Proceedings of the 2017 ACM on International Conference on Multimedia Retrieval (2015) (6)
DeepProduct (2018) (6)
Modeling Local Interest Points for Semantic Detection and Video Search at TRECVID 2006 (2006) (6)
Learning part-based mid-level representation for visual recognition (2018) (6)
CDistNet: Perceiving Multi-Domain Character Distance for Robust Text Recognition (2021) (6)
FCVID : Fudan-Columbia Video Dataset (2016) (6)
WildDeepfake (2020) (5)
TC-GAN: Triangle Cycle-Consistent GANs for Face Frontalization with Facial Features Preserved (2019) (5)
Two-stage Visual Cues Enhancement Network for Referring Image Segmentation (2021) (5)
Video Moment Retrieval from Text Queries via Single Frame Annotation (2022) (5)
Multiple task learning with flexible structure regularization (2016) (5)
Large scale semantic concept detection, fusion, and selection for domain adaptive video search (2009) (5)
MORE: Multi-Order RElation Mining for Dense Captioning in 3D Scenes (2022) (5)
Learning Layer-Skippable Inference Network (2020) (4)
Unified Multimodal Pre-training and Prompt-based Tuning for Vision-Language Understanding and Generation (2021) (4)
Towards Optimal CNN Descriptors for Large-Scale Image Retrieval (2019) (4)
HMS: Hierarchical Modality Selection for Efficient Video Recognition (2021) (4)
LSVC2017: Large-Scale Video Classification Challenge (2017) (4)
Dynamic Mixup for Multi-Label Long-Tailed Food Ingredient Recognition (2022) (4)
Placing Videos on a Semantic Hierarchy for Search Result Navigation (2014) (4)
SVFormer: Semi-supervised Video Transformer for Action Recognition (2022) (3)
Reformulating natural language queries using sequence-to-sequence models (2019) (3)
Predicting Content Similarity via Multimodal Modeling for Video-In-Video Advertising (2021) (3)
Categorizing Big Video Data on the Web: Challenges and Opportunities (2015) (3)
Discovering joint audio–visual codewords for video event detection (2013) (3)
Ontology-based visual word matching for near-duplicate retrieval (2008) (3)
Instance-level Sketch-based Retrieval by Deep Triplet Classification Siamese Network (2018) (3)
Organizing Video Search Results to Adapted Semantic Hierarchies for Topic-based Browsing (2014) (3)
Adaptive Proximal Average Approximation for Composite Convex Minimization (2017) (3)
Can Action be Imitated? Learn to Reconstruct and Transfer Human Dynamics from Videos (2021) (3)
ResFormer: Scaling ViTs with Multi-Resolution Training (2022) (3)
Optimal Bayesian Hashing for Efficient Face Recognition (2015) (3)
Look Before You Match: Instance Understanding Matters in Video Object Segmentation (2022) (3)
Self-supervised Learning for Semi-supervised Temporal Language Grounding (2021) (3)
ME-D2N: Multi-Expert Domain Decompositional Network for Cross-Domain Few-Shot Learning (2022) (3)
Semi-Supervised Single-View 3D Reconstruction via Prototype Shape Priors (2022) (3)
Multi-Prompt Alignment for Multi-source Unsupervised Domain Adaptation (2022) (2)
SAM: Modeling Scene, Object and Action With Semantics Attention Modules for Video Recognition (2022) (2)
ASM'15: The 1st International Workshop on Affect and Sentiment in Multimedia (2015) (2)
Bag of Tricks for Building an Accurate and Slim Object Detector for Embedded Applications (2021) (2)
TGDM: Target Guided Dynamic Mixup for Cross-Domain Few-Shot Learning (2022) (2)
YOLO-based Adaptive Window Two-stream Convolutional Neural Network for Video Classification (2017) (2)
Fast Summarization of User-Generated Videos Using Semantic , Emotional and Quality Clues (2016) (2)
Suspected Object Matters: Rethinking Model's Prediction for One-stage Visual Grounding (2022) (2)
Video Mobile-Former: Video Recognition with Efficient Global Spatial-temporal Modeling (2022) (2)
Masked Video Distillation: Rethinking Masked Feature Modeling for Self-supervised Video Representation Learning (2022) (2)
Adaptive Split-Fusion Transformer (2022) (2)
A fast video event recognition system and its application to video search (2012) (1)
Stacked multichannel autoencoder – an efficient way of learning from synthetic data (2018) (1)
Incorporating Locality of Images to Generate Targeted Transferable Adversarial Examples (2022) (1)
Deeper Insights into ViTs Robustness towards Common Corruptions (2022) (1)
Story-driven Video Editing (2021) (1)
Iterative object and part transfer for fine-grained recognition (2017) (1)
Smart Advertising in Videos Based on Comprehensive Content Analytics (2019) (1)
Learning to score and summarize figure skating sport videos (2018) (1)
Imbalanced gradients: a subtle cause of overestimated adversarial robustness (2020) (1)
Extreme vocabulary learning (2019) (1)
High-level event recognition in unconstrained videos (2012) (1)
Ingredient-enriched Recipe Generation from Cooking Videos (2022) (1)
VSCC'2017: Visual Analysis for Smart and Connected Communities (2017) (1)
On the pooling of positive examples with ontology for visual concept learning (2011) (1)
OmniTracker: Unifying Object Tracking by Tracking-with-Detection (2023) (1)
Exploring the Consistency of Segment-level and Video-level Predictions for Improved Temporal Concept Localization in Videos (2019) (1)
Proceedings of the Workshop on Large-Scale Video Classification Challenge (2017) (0)
Left-Right Skip-DenseNets for Coarse-to-Fine Object Categorization (2017) (0)
Deeper Insights into the Robustness of ViTs towards Common Corruptions (2022) (0)
HCMS: Hierarchical and Conditional Modality Selection for Efficient Video Recognition (2021) (0)
Adaptive Temporal Grouping for Black-box Adversarial Attacks on Videos (2022) (0)
Data-Free Network Debiasing for Long-Tailed Visual Recognition (2022) (0)
Proceedings of the Workshop on Visual Analysis in Smart and Connected Communities, VSCC@MM 2017, Mountain View, CA, USA, October 23, 2017 (2017) (0)
Stacked multichannel autoencoder – an efficient way of learning from synthetic data (2018) (0)
NTT-Fudan Team @ TRECVID 2015: Multimedia Event Detection (2015) (0)
Session details: Oral Session 2: Content analysis (2015) (0)
Text-driven Video Prediction (2022) (0)
Transforming CLIP to an Open-vocabulary Video Model via Interpolated Weight Optimization (2023) (0)
Colonoscopy Polyp Detection: Domain Adaptation From Medical Report Images to Real-time Videos (2020) (0)
Session details: Social-video semantics (2012) (0)
PromptFusion: Decoupling Stability and Plasticity for Continual Learning (2023) (0)
FDU Participation in TRECVID 2019 VTT Task (2019) (0)
Session details: Oral Session 3: Applications (2015) (0)
Mix-DANN and Dynamic-Modal-Distillation for Video Domain Adaptation (2022) (0)
VideoLT: Large-scale Long-tailed Video Recognition (Supplementary Material) (2021) (0)
Locate before Answering: Answer Guided Question Localization for Video Question Answering (2022) (0)
Joint Audio-Visual Signatures for Web Video Analysis (2010) (0)
Long-Term Cloth-Changing Person Re-identification (Supplementary Material) (2020) (0)
Fudan at TRECVID 2015: Adaptive Feature Fusion for Multimedia Event Detection in Videos (2015) (0)
Large-scale video semantic recognition based on consistency of segment-level and video-level predictions (2020) (0)
A Multimodal Framework for Video Ads Understanding (2021) (0)
Proceedings of the 1st International Workshop on Affect & Sentiment in Multimedia (2015) (0)
ChatVideo: A Tracklet-centric Multimodal and Versatile Video Understanding System (2023) (0)
Composite Binary Decomposition Networks (2018) (0)
Semantic Video Search by Exploiting Large-Scale Visual Concepts (2009) (0)
Implicit Temporal Modeling with Learnable Alignment for Video Recognition (2023) (0)

This paper list is powered by the following services:

What Schools Are Affiliated With Yugang Jiang?

Yugang Jiang is affiliated with the following schools:

Fudan University

Yugang Jiang's Academic­Influence.com Rankings

Yugang Jiang's Degrees

Why Is Yugang Jiang Influential?

Yugang Jiang's Published Works

Published Works

What Schools Are Affiliated With Yugang Jiang?

Yugang Jiang's AcademicInfluence.com Rankings