Caiming Xiong
#136,046
Most Influential Person Now
Caiming Xiong's AcademicInfluence.com Rankings
Caiming Xiongcomputer-science Degrees
Computer Science
#6243
World Rank
#6584
Historical Rank
Algorithms
#222
World Rank
#225
Historical Rank
Computational Linguistics
#1029
World Rank
#1043
Historical Rank
Machine Learning
#1957
World Rank
#1983
Historical Rank

Download Badge
Computer Science
Caiming Xiong's Degrees
- Bachelors Computer Science Peking University
Similar Degrees You Can Earn
Why Is Caiming Xiong Influential?
(Suggest an Edit or Addition)Caiming Xiong's Published Works
Number of citations in a given year to any of this author's works
Total number of citations to an author for the works they published in a given year. This highlights publication of the most important work(s) by the author
Published Works
- Pointer Sentinel Mixture Models (2016) (1299)
- A Deep Reinforced Model for Abstractive Summarization (2017) (1249)
- Knowing When to Look: Adaptive Attention via a Visual Sentinel for Image Captioning (2016) (1166)
- Learned in Translation: Contextualized Word Vectors (2017) (812)
- Dynamic Memory Networks for Visual and Textual Question Answering (2016) (712)
- Seq2SQL: Generating Structured Queries from Natural Language using Reinforcement Learning (2018) (704)
- CTRL: A Conditional Transformer Language Model for Controllable Generation (2019) (701)
- Dynamic Coattention Networks For Question Answering (2016) (659)
- The Natural Language Decathlon: Multitask Learning as Question Answering (2018) (515)
- A Joint Many-Task Model: Growing a Neural Network for Multiple NLP Tasks (2016) (507)
- Prototypical Contrastive Learning of Unsupervised Representations (2020) (494)
- Non-Autoregressive Neural Machine Translation (2017) (430)
- End-to-End Dense Video Captioning with Masked Transformer (2018) (373)
- Quasi-Recurrent Neural Networks (2016) (346)
- Evaluating the Factual Consistency of Abstractive Text Summarization (2019) (344)
- ERASER: A Benchmark to Evaluate Rationalized NLP Models (2019) (325)
- Transferable Multi-Domain State Generator for Task-Oriented Dialogue Systems (2019) (313)
- Explain Yourself! Leveraging Language Models for Commonsense Reasoning (2019) (297)
- Jointly Modeling Deep Video and Compositional Text to Bridge Vision and Language in a Unified Framework (2015) (273)
- Streaming Hierarchical Video Segmentation (2012) (258)
- Neural Text Summarization: A Critical Evaluation (2019) (235)
- Multi-Hop Knowledge Graph Reasoning with Reward Shaping (2018) (233)
- Learn to Grow: A Continual Structure Learning Framework for Overcoming Catastrophic Forgetting (2019) (229)
- Learning to Retrieve Reasoning Paths over Wikipedia Graph for Question Answering (2019) (210)
- TOD-BERT: Pre-trained Natural Language Understanding for Task-Oriented Dialogue (2020) (199)
- Self-Monitoring Navigation Agent via Auxiliary Progress Estimation (2019) (188)
- Joint action recognition and pose estimation from video (2015) (184)
- Global-Locally Self-Attentive Encoder for Dialogue State Tracking (2018) (172)
- From image parsing to painterly rendering (2009) (166)
- A Closer Look at Deep Learning Heuristics: Learning rate restarts, Warmup and Distillation (2018) (156)
- BERTology Meets Biology: Interpreting Attention in Protein Language Models (2020) (153)
- Efficient and Robust Question Answering from Minimal Context over Documents (2018) (148)
- Find or Classify? Dual Strategy for Slot-Value Predictions on Multi-Domain Dialog State Tracking (2019) (132)
- The Regretful Agent: Heuristic-Aided Navigation Through Progress Estimation (2019) (130)
- Global-to-local Memory Pointer Networks for Task-Oriented Dialogue (2019) (125)
- AdaFrame: Adaptive Frame Selection for Fast Video Recognition (2018) (125)
- GraPPa: Grammar-Augmented Pre-Training for Table Semantic Parsing (2020) (124)
- CoSQL: A Conversational Text-to-SQL Challenge Towards Cross-Domain Natural Language Interfaces to Databases (2019) (113)
- Hierarchical and Interpretable Skill Acquisition in Multi-task Reinforcement Learning (2017) (110)
- Towards Theoretically Understanding Why SGD Generalizes Better Than ADAM in Deep Learning (2020) (106)
- Improving Abstraction in Text Summarization (2018) (104)
- DCN+: Mixed Objective and Deep Residual Coattention for Question Answering (2017) (97)
- SParC: Cross-Domain Semantic Parsing in Context (2019) (96)
- Can humans fly? Action understanding with multiple classes of actors (2015) (96)
- DART: Open-Domain Structured Data Record to Text Generation (2020) (96)
- CoMatch: Semi-supervised Learning with Contrastive Graph Regularization (2020) (94)
- Editing-Based SQL Query Generation for Cross-Domain Context-Dependent Questions (2019) (94)
- Robustness Gym: Unifying the NLP Evaluation Landscape (2021) (91)
- Bridging Textual and Tabular Data for Cross-Domain Text-to-SQL Semantic Parsing (2020) (90)
- Actionness Ranking with Lattice Conditional Ordinal Random Fields (2014) (84)
- Explore, Discover and Learn: Unsupervised Discovery of State-Covering Skills (2020) (80)
- Adv-BERT: BERT is not robust on misspellings! Generating nature adversarial samples on BERT (2020) (76)
- A Coarse-to-Fine Framework for Resource Efficient Video Recognition (2019) (74)
- Random forests for metric learning with implicit pairwise position dependence (2012) (68)
- VD-BERT: A Unified Vision and Dialog Transformer with BERT (2020) (65)
- XLDA: Cross-Lingual Data Augmentation for Natural Language Inference and Question Answering (2019) (65)
- Taming MAML: Efficient unbiased meta-reinforcement learning (2019) (64)
- CTRLsum: Towards Generic Controllable Text Summarization (2020) (64)
- Policy Finetuning: Bridging Sample-Efficient Offline and Online Reinforcement Learning (2021) (63)
- Learning From Noisy Anchors for One-Stage Object Detection (2019) (63)
- BERT is Not an Interlingua and the Bias of Tokenization (2019) (60)
- Proposal Learning for Semi-Supervised Object Detection (2020) (58)
- Universal Natural Language Processing with Limited Annotations: Try Few-shot Textual Entailment as a Start (2020) (55)
- Coarse-grain Fine-grain Coattention Network for Multi-evidence Question Answering (2019) (55)
- Discriminative Nearest Neighbor Few-Shot Intent Detection by Transferring Natural Language Inference (2020) (54)
- Keeping Your Distance: Solving Sparse Reward Tasks Using Self-Balancing Shaped Rewards (2019) (49)
- MoPro: Webly Supervised Learning with Momentum Prototypes (2020) (49)
- A Multi-Discriminator CycleGAN for Unsupervised Non-Parallel Speech Domain Adaptation (2018) (49)
- Interpretable Counting for Visual Question Answering (2017) (49)
- Structured Scene Memory for Vision-Language Navigation (2021) (48)
- Active Clustering with Model-Based Uncertainty Reduction (2014) (47)
- CoCo: Controllable Counterfactuals for Evaluating Dialogue State Trackers (2020) (45)
- Augmented Cyclic Adversarial Learning for Low Resource Domain Adaptation (2018) (44)
- WSLLN:Weakly Supervised Natural Language Localization Networks (2019) (44)
- Accurate Annotation of Remote Sensing Images via Active Spectral Clustering with Little Expert Knowledge (2015) (42)
- Robot learning with a spatial, temporal, and causal and-or graph (2016) (42)
- Identifying Generalization Properties in Neural Networks (2018) (42)
- Deep neural language modeling enables functional protein generation across families (2021) (42)
- Improving End-to-End Speech Recognition with Policy Learning (2017) (40)
- Deep Verifier Networks: Verification of Deep Discriminative Models with Deep Generative Models (2019) (40)
- Grounded Semantic Role Labeling (2016) (39)
- Theory-Inspired Path-Regularized Differential Network Architecture Search (2020) (39)
- How Important is the Train-Validation Split in Meta-Learning? (2020) (38)
- FastIF: Scalable Influence Functions for Efficient Model Interpretation and Debugging (2020) (36)
- Catastrophic Fisher Explosion: Early Phase Fisher Matrix Impacts Generalization (2020) (36)
- Competitive Experience Replay (2019) (36)
- Photon: A Robust Cross-Domain Text-to-SQL System (2020) (36)
- Discern: Discourse-Aware Entailment Reasoning Network for Conversational Machine Reading (2020) (35)
- Learning from Noisy Data with Robust Representation Learning (2021) (35)
- StartNet: Online Detection of Action Start in Untrimmed Videos (2019) (32)
- Latent Domains Modeling for Visual Domain Adaptation (2014) (31)
- An Investigation of Phone-Based Subword Units for End-to-End Speech Recognition (2020) (30)
- Sample-Efficient Learning of Stackelberg Equilibria in General-Sum Games (2021) (30)
- Marker-less registration based on template tracking for augmented reality (2009) (29)
- Adapt-and-Adjust: Overcoming the Long-Tail Problem of Multilingual Speech Recognition (2020) (29)
- Large language models generate functional protein sequences across diverse families. (2023) (28)
- Ensemble of Averages: Improving Model Selection and Boosting Performance in Domain Generalization (2021) (26)
- Unifying Question Answering, Text Classification, and Regression via Span Extraction (2019) (26)
- Spectral active clustering via purification of the K-Nearest neighbor graph (2012) (26)
- A Dynamic Frame Selection Framework for Fast Video Recognition (2020) (25)
- Semi-Supervised Nonlinear Distance Metric Learning via Forests of Max-Margin Cluster Hierarchies (2014) (25)
- Towards Understanding Hierarchical Learning: Benefits of Neural Representations (2020) (25)
- Composed Variational Natural Language Generation for Few-shot Intents (2020) (23)
- Online Structured Meta-learning (2020) (22)
- On the Generalization Gap in Reparameterizable Reinforcement Learning (2019) (21)
- Learning World Graphs to Accelerate Hierarchical Reinforcement Learning (2019) (19)
- MKD: a Multi-Task Knowledge Distillation Approach for Pretrained Language Models (2019) (19)
- Taylorized Training: Towards Better Approximation of Neural Network Training at Finite Width (2020) (18)
- Probing Task-Oriented Dialogue Representation from Language Models (2020) (18)
- Attentive Student Meets Multi-Task Teacher: Improved Knowledge Distillation for Pretrained Models (2019) (18)
- Unsupervised Out-of-Domain Detection via Pre-trained Transformers (2021) (17)
- Unifying Question Answering and Text Classification via Span Extraction (2019) (17)
- Towards Noise-resistant Object Detection with Noisy Annotations (2020) (17)
- Merlion: A Machine Learning Library for Time Series (2021) (17)
- Explaining and Improving Model Behavior with k Nearest Neighbor Representations (2020) (17)
- Using Mode Connectivity for Loss Landscape Analysis (2018) (16)
- Improved Regularization Techniques for End-to-End Speech Recognition (2017) (15)
- Task similarity aware meta learning: theory-inspired improvement on MAML (2021) (15)
- WOAD: Weakly Supervised Online Action Detection in Untrimmed Videos (2020) (15)
- Explicit Memory Tracker with Coarse-to-Fine Reasoning for Conversational Machine Reading (2020) (15)
- Don't Just Blame Over-parametrization for Over-confidence: Theoretical Analysis of Calibration in Binary Classification (2021) (15)
- Block-diagonal Hessian-free Optimization for Training Neural Networks (2017) (14)
- Adaptive Quantization for Hashing: An Information-Based Approach to Learning Binary Codes (2014) (14)
- Recognizing Car Fluents from Video (2016) (14)
- Dictionary transfer for image denoising via domain adaptation (2012) (14)
- ESPRIT: Explaining Solutions to Physical Reasoning Tasks (2020) (13)
- Seeing is Worse than Believing: Reading People's Minds Better than Computer-Vision Methods Recognize Actions (2014) (13)
- Augmented Cyclic Adversarial Learning for Domain Adaptation (2018) (12)
- Unsupervised Paraphrase Generation via Dynamic Blocking (2020) (12)
- Policy Optimization for Markov Games: Unified Framework and Faster Convergence (2022) (11)
- A High-Quality Multilingual Dataset for Structured Documentation Translation (2020) (11)
- Efficient max-margin metric learning (2012) (11)
- Joint Energy-based Model Training for Better Calibrated Natural Language Understanding Models (2021) (11)
- Maximum Margin Dirichlet Process Mixtures for Clustering (2016) (11)
- Correction Networks: Meta-Learning for Zero-Shot Learning (2018) (10)
- Representation Learning for Sequence Data with Deep Autoencoding Predictive Components (2020) (10)
- A Way out of the Odyssey: Analyzing and Combining Recent Insights for LSTMs (2016) (10)
- AirTouch: Interacting with computer systems at a distance (2011) (10)
- Improving Limited Labeled Dialogue State Tracking with Self-Supervision (2020) (10)
- Neural Abstract Style Transfer for Chinese Traditional Painting (2018) (9)
- Coaction discovery: segmentation of common actions across multiple videos (2012) (8)
- Predicting with High Correlation Features (2019) (7)
- SEQ2SQL: GENERATING STRUCTURED QUERIES (2017) (7)
- Comprehensive Cross-Hierarchy Cluster Agreement Evaluation (2013) (7)
- Efficient and Differentiable Conformal Prediction with General Function Classes (2022) (7)
- A Theory-Driven Self-Labeling Refinement Method for Contrastive Representation Learning (2021) (7)
- A Unified Framework for Human-Robot Knowledge Transfer (2015) (6)
- GAEA: Graph Augmentation for Equitable Access via Reinforcement Learning (2020) (6)
- DIME: An Information-Theoretic Difficulty Measure for AI Datasets (2019) (6)
- Online Active Constraint Selection For Semi-Supervised Clustering (2012) (5)
- Sketch-Fill-A-R: A Persona-Grounded Chit-Chat Generation Framework (2019) (5)
- A model of open source software maintenance activities (2009) (5)
- What’s New? Summarizing Contributions in Scientific Literature (2020) (5)
- Spectral active clustering of remote sensing images (2014) (4)
- Deleter: Leveraging BERT to Perform Unsupervised Successive Text Compression (2019) (4)
- Localized Calibration: Metrics and Recalibration (2021) (4)
- Artificial intelligence for streamlined immunofluorescence-based biomarker discovery in prostate cancer. (2020) (4)
- The Thieves on Sesame Street Are Polyglots — Extracting Multilingual Models from Monolingual APIs (2020) (4)
- Global Capacity Measures for Deep ReLU Networks via Path Sampling (2019) (4)
- Understanding the Under-Coverage Bias in Uncertainty Estimation (2021) (3)
- Towards the ImageNet-CNN of NLP: Pretraining Sentence Encoders with Machine Translation (2017) (3)
- Evaluating State-of-the-Art Classification Models Against Bayes Optimality (2021) (3)
- Action Understanding with Multiple Classes of Actors (2017) (3)
- Recent Progress in Deep Reinforcement Learning for Computer Vision and NLP (2017) (3)
- Compositional Structure Learning for Action Understanding (2014) (3)
- Simple Data Augmentation with the Mask Token Improves Domain Adaptation for Dialog Act Tagging (2020) (2)
- Private Deep Learning with Teacher Ensembles (2019) (2)
- Differentially Private Deep Learning with Smooth Sensitivity (2020) (2)
- Improved Online Conformal Prediction via Strongly Adaptive Online Learning (2023) (2)
- On the Diversity and Explainability of Recommender Systems: A Practical Framework for Enterprise App Recommendation (2021) (2)
- Sentinel gate for modulating auxiliary information in a long short-term memory (LSTM) neural network (2017) (2)
- Interactive Agent Modeling by Learning to Probe (2018) (2)
- The Compositional Nature of Verb and ArgumentRepresentations in the Human Brain (2013) (2)
- Robustness Evaluation of Transformer-based Form Field Extractors via Form Attacks (2021) (2)
- Active Clustering with Model-Based Uncertainty Reduction. (2017) (1)
- Assessing Local Generalization Capability in Deep Models (2020) (1)
- Improving Tail-Class Representation with Centroid Contrastive Learning (2021) (1)
- SEMI-SUPERVISED NONLINEAR DISTANCE METRIC LEARNING VIA RANDOM FOREST AND RELATIVE SIMILARITY ALGORITHM (2017) (1)
- MACE: An Efficient Model-Agnostic Framework for Counterfactual Explanation (2022) (1)
- Towards a parts-based approach to sub-cortical brain structure parsing (2011) (1)
- BINING RECENT INSIGHTS FOR LSTMS (2016) (1)
- Uncertainty Reduction for Active Image Clustering via a Hybrid Global-Local Uncertainty Model (2013) (1)
- SITION IN MULTI-TASK REINFORCEMENT LEARNING (2017) (1)
- Entropy Penalty: Towards Generalization Beyond the IID Assumption (2019) (1)
- NaturalCC: A Toolkit to Naturalize the Source Code Corpus (2020) (1)
- [CASPI] Causal-aware Safe Policy Improvement for Task-oriented Dialogue (2021) (1)
- ERMAS: Becoming Robust to Reward Function Sim-to-Real Gaps in Multi-Agent Simulations (2021) (0)
- Lower Bounds for Learning in Revealing POMDPs (2023) (0)
- Learning Adversarially Robust Policies in Multi-Agent Games (2022) (0)
- Recognizing Car Fluents from Video Supplementary Material (2016) (0)
- ENTROPY PENALTY: TOWARDS GENERALIZATION BE- (2019) (0)
- Neural Bayes: A Generic Parameterization Method for Unsupervised Representation Learning (2020) (0)
- Identifying Generalization Properties in Neural Networks Identifying Generalization Properties in Neural Networks (2018) (0)
- C OMPETITIVE EXPERIENCE REPLAY (2019) (0)
- Style Recognition and Kinship Understanding (2019) (0)
- Learning to Play General-Sum Games Against Multiple Boundedly Rational Agents (2021) (0)
- The Compositional Nature of Event Representations in the Human Brain (2014) (0)
- Robust Domain Adaptation By Augmented Cyclic Adversarial Learning (2018) (0)
- Continual Learning via Explicit Structure Learning (2018) (0)
- Local Calibration: Metrics and Recalibration (Supplementary Material) (2022) (0)
- RÉSEAU NEURONAL QUASI-RÉCURRENT (2018) (0)
- NaturalCC (2022) (0)
- IARY PROGRESS ESTIMATION (2019) (0)
- RGRecSys (2022) (0)
- ODEL FOR A BSTRACTIVE S UMMARIZATION (2018) (0)
- Near-Zero-Cost Differentially Private Deep Learning with Teacher Ensembles (2019) (0)
- MP28-01 ARTIFICIAL INTELLIGENCE (AI) ACCURATELY AUTOMATE AND SPEED IMMUNOFLUORESCENCE (IF)-BASED DISCOVERY AND VALIDATION OF NOVEL PROGNOSTIC AND PREDICTIVE BIOMARKERS IN PROSTATE CANCER (2019) (0)
- DESC LIMIT 1 LSTM Query Decoder Attention Over Previous Utterances , Column Headers , Previous Query Bi LSTM Query Encoder Table Encoder (2019) (0)
- Fantastic Rewards and How to Tame Them: A Case Study on Reward Learning for Task-oriented Dialogue Systems (2023) (0)
- Building Salesforce Neural Machine Translation System (2020) (0)
- BiLSTM 1 BiLSTM 1 Coattention 1 Coattention 2 BiLSTM 2 BiLSTM 2 Output BiLSTM Question Document (2018) (0)
- Active Constraint Selection For Semi-Supervised Clustering (2012) (0)
- Object category recognition using boosting tree with heterogenous features (2007) (0)
- Learning Rich Nearest Neighbor Representations from Self-supervised Ensembles (2021) (0)
- Learning from and actively selecting pairwise constraints in data science (2014) (0)
- Learning World Graph Decompositions To Accelerate Reinforcement Learning (2019) (0)
- Guided Adaptive Credit Assignment for Sample Efficient Policy Optimization (2019) (0)
This paper list is powered by the following services: