Fei Sha
#165,211
Most Influential Person Now
Fei Sha's AcademicInfluence.com Rankings
Fei Shacomputer-science Degrees
Computer Science
#9843
World Rank
#10326
Historical Rank
Machine Learning
#4415
World Rank
#4465
Historical Rank
Artificial Intelligence
#4770
World Rank
#4833
Historical Rank
Database
#6795
World Rank
#7034
Historical Rank

Download Badge
Computer Science
Fei Sha's Degrees
- PhD Computer Science University of California, Berkeley
- Masters Computer Science University of California, Berkeley
- Bachelors Computer Science Peking University
Similar Degrees You Can Earn
Why Is Fei Sha Influential?
(Suggest an Edit or Addition)Fei Sha's Published Works
Number of citations in a given year to any of this author's works
Total number of citations to an author for the works they published in a given year. This highlights publication of the most important work(s) by the author
Published Works
- Geodesic flow kernel for unsupervised domain adaptation (2012) (2052)
- Shallow Parsing with Conditional Random Fields (2003) (1570)
- Marginalized Denoising Autoencoders for Domain Adaptation (2012) (767)
- Synthesized Classifiers for Zero-Shot Learning (2016) (674)
- Learning a kernel matrix for nonlinear dimensionality reduction (2004) (577)
- Video Summarization with Long Short-Term Memory (2016) (522)
- Connecting the Dots with Landmarks: Discriminatively Learning Domain-Invariant Features for Unsupervised Domain Adaptation (2013) (467)
- An Empirical Study and Analysis of Generalized Zero-Shot Learning for Object Recognition in the Wild (2016) (456)
- DiscLDA: Discriminative Learning for Dimensionality Reduction and Classification (2008) (444)
- Actor-Attention-Critic for Multi-Agent Reinforcement Learning (2018) (425)
- Learning Globally-Consistent Local Distance Functions for Shape-Based Image Retrieval and Classification (2007) (400)
- Diverse Sequential Subset Selection for Supervised Video Summarization (2014) (383)
- Few-Shot Learning via Embedding Adaptation With Set-to-Set Functions (2018) (373)
- Learning with Whom to Share in Multi-task Feature Learning (2011) (356)
- Spectral Methods for Dimensionality Reduction (2006) (288)
- Deformable Spatial Pyramid Matching for Fast Dense Correspondences (2013) (257)
- Large Margin Hidden Markov Models for Automatic Speech Recognition (2006) (217)
- Information-Theoretical Learning of Discriminative Clusters for Unsupervised Domain Adaptation (2012) (205)
- Attention Correctness in Neural Image Captioning (2016) (203)
- Summary Transfer: Exemplar-Based Subset Selection for Video Summarization (2016) (193)
- Non-linear Metric Learning (2012) (187)
- Multiplicative Updates for Nonnegative Quadratic Programming (2007) (173)
- Multiplicative Updates for Nonnegative Quadratic Programming in Support Vector Machines (2002) (166)
- Large Margin Gaussian Mixture Modeling for Phonetic Classification and Recognition (2006) (164)
- Predicting Visual Exemplars of Unseen Classes for Zero-Shot Learning (2016) (156)
- Supervised Word Mover's Distance (2016) (146)
- Decorrelating Semantic Visual Attributes by Resisting the Urge to Share (2014) (144)
- Aligning Where to See and What to Tell: Image Captioning with Region-Based Attention and Scene-Specific Contexts (2017) (136)
- Cross-Modal and Hierarchical Modeling of Video and Text (2018) (136)
- Reshaping Visual Datasets for Domain Adaptation (2013) (134)
- Graph Laplacian Regularization for Large-Scale Semidefinite Programming (2006) (128)
- Marginalized Denoising Auto-encoders for Nonlinear Representations (2014) (125)
- Sharing features between objects and their attributes (2011) (124)
- Analysis and extension of spectral methods for nonlinear dimensionality reduction (2005) (119)
- Regression on manifolds using kernel dimension reduction (2007) (108)
- Aligning where to see and what to tell: image caption with region-based attention and scene factorization (2015) (104)
- Cloud-enabled privacy-preserving collaborative learning for mobile sensing (2012) (103)
- Retrospective Encoders for Video Summarization (2018) (101)
- Learning Kernels for Unsupervised Domain Adaptation with Applications to Visual Object Recognition (2014) (90)
- Sparse Compositional Metric Learning (2014) (83)
- Real-Time Pitch Determination of One or More Voices by Nonnegative Matrix Factorization (2004) (82)
- Robust web extraction: an approach based on a probabilistic tree-edit model (2009) (80)
- Comparison of Large Margin Training to Other Discriminative Methods for Phonetic Recognition by Hidden Markov Models (2007) (80)
- Learning Embedding Adaptation for Few-Shot Learning (2018) (78)
- Learning a Tree of Metrics with Disjoint Visual Features (2011) (74)
- Active site prediction using evolutionary and structural information (2010) (71)
- How to Scale Up Kernel Methods to Be As Good As Deep Neural Nets (2014) (64)
- Multi-Task Learning for Sequence Tagging: An Empirical Study (2018) (58)
- Large-Margin Determinantal Point Processes (2014) (47)
- BabyWalk: Going Farther in Vision-and-Language Navigation by Taking Baby Steps (2020) (45)
- Discriminative non-negative matrix factorization for single-channel speech separation (2014) (45)
- Unsupervised Kernel Dimension Reduction (2010) (45)
- Predicting Pedestrian Counts in Crowded Scenes With Rich and High-Dimensional Features (2011) (43)
- Semantic Kernel Forests from Multiple Taxonomies (2012) (41)
- Kernel Approximation Methods for Speech Recognition (2017) (41)
- Classifier and Exemplar Synthesis for Zero-Shot Learning (2018) (41)
- A Distributed Frank-Wolfe Algorithm for Communication-Efficient Sparse Learning (2014) (40)
- Convex Optimizations for Distance Metric Learning and Pattern Classification [Applications Corner] (2010) (38)
- Robust Active Label Correction (2018) (38)
- Designing a socially assistive robot for personalized number concepts learning in preschool children (2015) (38)
- Speech recognitionwith segmental conditional random fields: A summary of the JHU CLSP 2010 Summer Workshop (2011) (37)
- Large margin training of acoustic models for speech recognition (2007) (37)
- Learning Adaptive Classifiers Synthesis for Generalized Few-Shot Learning (2019) (35)
- Analogy-preserving Semantic Embedding for Visual Object Categorization (2013) (35)
- Co-training Transformer with Videos and Images Improves Action Recognition (2021) (33)
- Coordinated Exploration via Intrinsic Rewards for Multi-Agent Reinforcement Learning (2019) (33)
- Locally Linear Denoising on Image Manifolds (2010) (33)
- Similarity Learning for High-Dimensional Sparse Data (2014) (32)
- Statistical signal processing with nonnegativity constraints (2003) (31)
- Cross-Dataset Adaptation for Visual Question Answering (2018) (31)
- A fast online algorithm for large margin training of continuous density hidden Markov models (2009) (30)
- Marginalizing stacked linear denoising autoencoders (2015) (30)
- Being Negative but Constructively: Lessons Learnt from Creating Better Visual Question Answering Datasets (2017) (30)
- Learning Answer Embeddings for Visual Question Answering (2018) (28)
- Randomized Entity-wise Factorization for Multi-Agent Reinforcement Learning (2020) (27)
- Multiplicative Updates for Large Margin Classifiers (2003) (26)
- From sBoW to dCoT marginalized encoders for text representation (2012) (26)
- Multiband statistical learning for f/sub 0/ estimation in speech (2004) (26)
- Demystifying Information-Theoretic Clustering (2013) (26)
- Improving Compositional Generalization with Latent Structure and Data Augmentation (2021) (26)
- FastMask: Segment Multi-scale Object Candidates in One Shot (2016) (25)
- Metric learning for reinforcement learning agents (2011) (25)
- Similarity Component Analysis (2013) (25)
- Active multi-view object recognition: A unifying view on online feature selection and view planning (2016) (24)
- A Study of Web Services Performance Prediction: A Client's Perspective (2011) (23)
- A Bayesian Theory of Mind Approach to Nonverbal Communication (2019) (23)
- Information Theoretical Clustering via Semidefinite Programming (2011) (22)
- Mention Memory: incorporating textual knowledge into Transformers through entity mention attention (2021) (21)
- Exponential Integration for Hamiltonian Monte Carlo (2015) (20)
- When MAML Can Adapt Fast and How to Assist When It Cannot (2019) (19)
- An alternative text representation to TF-IDF and Bag-of-Words (2013) (19)
- Uncertainty Estimation with Infinitesimal Jackknife, Its Distribution and Mean-Field Approximation (2020) (18)
- Aiming to Know You Better Perhaps Makes Me a More Engaging Dialogue Partner (2018) (18)
- AQuaMuSe: Automatically Generating Datasets for Query-Based Multi-Document Summarization (2020) (18)
- Learning to Represent Image and Text with Denotation Graphs (2020) (17)
- A Hierarchical Multi-Modal Encoder for Moment Localization in Video Corpus (2020) (17)
- LabelBank: Revisiting Global Perspectives for Semantic Segmentation (2017) (17)
- AI-QMIX: Attention and Imagination for Dynamic Multi-Agent Reinforcement Learning (2020) (16)
- Predicting Likability of Speakers with Gaussian Processes (2012) (16)
- Matrix updates for perceptron training of continuous density hidden Markov models (2009) (15)
- Hyper-parameter Tuning under a Budget Constraint (2019) (14)
- An Empirical Study on The Properties of Random Bases for Kernel Methods (2017) (13)
- Large Margin Training of Continuous Density Hidden Markov Models (2009) (13)
- Evaluating the Impact of Model Scale for Compositional Generalization in Semantic Parsing (2022) (13)
- Active Multi-view Object Recognition and Online Feature Selection (2015) (13)
- Topic Augmented Generator for Abstractive Summarization (2019) (13)
- ReadTwice: Reading Very Large Documents with Memories (2021) (12)
- Geodesic Flow Kernel and Landmarks: Kernel Methods for Unsupervised Domain Adaptation (2017) (12)
- HyperPINN: Learning parameterized differential equations with physics-informed hypernetworks (2021) (12)
- Learning Classifier Synthesis for Generalized Few-Shot Learning (2019) (12)
- Distributed Frank-Wolfe Algorithm: A Unified Framework for Communication-Efficient Sparse Learning (2014) (11)
- A comparison between deep neural nets and kernel acoustic models for speech recognition (2016) (10)
- Drinking From a Firehose: Continual Learning With Web-Scale Natural Language (2020) (10)
- Two-Stage Metric Learning (2014) (9)
- Systematic Generalization on gSCAN: What is Nearly Solved and What is Next? (2021) (9)
- Synthesize Policies for Transfer and Adaptation across Tasks and Environments (2019) (9)
- Mean-Field Approximation to Gaussian-Softmax Integral with Application to Uncertainty Estimation (2020) (9)
- Online Learning and Acoustic Feature Adaptation in Large-Margin Hidden Markov Models (2010) (9)
- Metric Learning for Ordinal Data (2016) (9)
- Divide, Share, and Conquer: Multi-task Attribute Learning with Selective Sharing (2017) (8)
- Multiplicative Updates for L1-Regularized Linear and Logistic Regression (2007) (8)
- A Probabilistic Model for Joint Learning of Word Embeddings from Texts and Images (2018) (8)
- Understanding Image and Text Simultaneously: a Dual Vision-Language Machine Comprehension Task (2016) (7)
- Towards Interactive Object Recognition (2014) (7)
- Embedding Adaptation is Still Needed for Few-Shot Learning (2021) (7)
- Towards a Personalized Model of Number Concepts Learning in Preschool Children (2015) (6)
- DOCENT: Learning Self-Supervised Entity Representations from Large Document Collections (2021) (6)
- Speech Recognition with Segmental Conditional Random Fields: Final Report from the 2010 JHU Summer Workshop (2010) (6)
- Neural Theorem Provers Do Not Learn Rules Without Exploration (2019) (6)
- Visual Storytelling via Predicting Anchor Word Embeddings in the Stories (2020) (6)
- Learning to Generalize Compositionally by Transferring Across Semantic Parsing Tasks (2021) (5)
- The More the Merrier: Parameter Learning for Graphical Models with Multiple MAPs (2013) (5)
- Generate-and-Retrieve: Use Your Predictions to Improve Retrieval for Semantic Parsing (2022) (5)
- Decoupling Adaptation from Modeling with Meta-Optimizers for Meta Learning (2019) (5)
- Domain Adaptation in Machine Learning and Speech Processing (2012) (4)
- Supplementary Material : Video Summarization with Long Short-term Memory (2016) (4)
- Visually Grounded Concept Composition (2021) (4)
- Learning the Kernel Matrix with Low-Rank Multiplicative Shaping (2012) (4)
- Large-margin feature adaptation for automatic speech recognition (2009) (4)
- Rapid Feature Learning with Stacked Linear Denoisers (2011) (4)
- Recalling Holistic Information for Semantic Segmentation (2016) (3)
- FiDO: Fusion-in-Decoder optimized for stronger performance and faster inference (2022) (3)
- Online learning of large margin hidden Markov models for automatic speech recognition (2011) (3)
- Learning Discriminative Metrics via Generative Models and Kernel Learning (2011) (2)
- Possibility Before Utility: Learning And Using Hierarchical Affordances (2022) (2)
- FastMask: Segment Object Multi-scale Candidates in One Shot (2016) (2)
- Policy Learning and Evaluation with Randomized Quasi-Monte Carlo (2022) (1)
- Amortized Inference of Variational Bounds for Learning Noisy-OR (2019) (1)
- Pre-computed memory or on-the-fly encoding? A hybrid approach to retrieval augmentation makes the most of your compute (2023) (1)
- Classifier and Exemplar Synthesis for Zero-Shot Learning (2019) (0)
- Supplementary Material : Retrospective Encoders for Video Summarization (2018) (0)
- Evolve Smoothly, Fit Consistently: Learning Smooth Latent Dynamics For Advection-Dominated Systems (2023) (0)
- Learning Kernels for Unsupervised Domain Adaptation with Applications to Visual Object Recognition (2014) (0)
- Adversarially robust subspace learning in the spiked covariance model (2022) (0)
- A Computational Approach to Earlier Detection and Intervention for Infants with Developmental Disabilities (0)
- LARGE MARGIN DISCRIMINATIVE LEARNING METHODS FOR ACOUSTIC MODELING (2006) (0)
- Policy-Induced Self-Supervision Improves Representation Finetuning in Visual RL (2023) (0)
- Invited Talk Abstracts (2010) (0)
- Robust Active Label Correction ( Supplementary Material ) (2018) (0)
- Learning to Represent Images and Texts with Denotation Graphs (2020) (0)
- Supplementary Material: Randomized Entity-wise Factorization for Multi-Agent Reinforcement Learning (2021) (0)
- Active Multi-View Object Recognition and Change Detection (2015) (0)
- Margin based discriminative training techniques for automatic speech recognition. (2010) (0)
- ALMA: Hierarchical Learning for Composite Multi-Agent Tasks (2022) (0)
- Sharing Features Between Visual Tasks at Different Levels of Granularity (2011) (0)
- MENTION MEMORY : INCORPORATING TEXTUAL KNOWLEDGE INTO TRANSFORMERS THROUGH ENTITY MENTION ATTENTION (2022) (0)
- 1 Spectral Methods for Dimensionality Reduction (2005) (0)
This paper list is powered by the following services: