K. Gimpel
#158,581
Most Influential Person Now
K. Gimpel's AcademicInfluence.com Rankings
K. Gimpelcomputer-science Degrees
Computer Science
#8917
World Rank
#9373
Historical Rank
Computational Linguistics
#2072
World Rank
#2094
Historical Rank
Database
#5913
World Rank
#6132
Historical Rank

Download Badge
Computer Science
K. Gimpel's Degrees
- PhD Computer Science Stanford University
- Masters Computer Science Stanford University
- Bachelors Computer Science Stanford University
Similar Degrees You Can Earn
Why Is K. Gimpel Influential?
(Suggest an Edit or Addition)K. Gimpel's Published Works
Number of citations in a given year to any of this author's works
Total number of citations to an author for the works they published in a given year. This highlights publication of the most important work(s) by the author
Published Works
- ALBERT: A Lite BERT for Self-supervised Learning of Language Representations (2019) (3926)
- A Baseline for Detecting Misclassified and Out-of-Distribution Examples in Neural Networks (2016) (1892)
- Gaussian Error Linear Units (GELUs) (2016) (1826)
- Part-of-Speech Tagging for Twitter: Annotation, Features, and Experiments (2010) (1096)
- Improved Part-of-Speech Tagging for Online Conversational Text with Word Clusters (2013) (805)
- Towards Universal Paraphrastic Sentence Embeddings (2015) (519)
- Adversarial Example Generation with Syntactically Controlled Paraphrase Networks (2018) (493)
- Bridging Nonlinearities and Stochastic Regularizers with Gaussian Error Linear Units (2016) (400)
- Using Trusted Data to Train Deep Networks on Labels Corrupted by Severe Noise (2018) (387)
- Multi-Perspective Sentence Similarity Modeling with Convolutional Neural Networks (2015) (362)
- Tailoring Continuous Word Representations for Dependency Parsing (2014) (310)
- From Paraphrase Database to Compositional Paraphrase Model and Back (2015) (266)
- ParaNMT-50M: Pushing the Limits of Paraphrastic Sentence Embeddings with Millions of Machine Translations (2017) (260)
- Movie Reviews and Revenues: An Experiment in Text Regression (2010) (234)
- Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models (2022) (218)
- Early Methods for Detecting Adversarial Images (2016) (200)
- Charagram: Embedding Words and Sentences via Character n-grams (2016) (181)
- Softmax-Margin CRFs: Training Log-Linear Models with Cost Functions (2010) (155)
- Commonsense Knowledge Base Completion (2016) (141)
- Deep Multilingual Correlation for Improved Word Embeddings (2015) (138)
- Who did What: A Large-Scale Person-Centered Cloze Dataset (2016) (134)
- Machine Comprehension with Syntax, Frames, and Semantics (2015) (102)
- Learning Paraphrastic Sentence Embeddings from Back-Translated Bitext (2017) (88)
- A Systematic Exploration of Diversity in Machine Translation (2013) (87)
- Part-of-Speech Tagging for Twitter : Word Clusters and Other Advances (2012) (86)
- Controllable Paraphrase Generation with a Syntactic Exemplar (2019) (81)
- Structured Ramp Loss Minimization for Machine Translation (2012) (77)
- Beyond BLEU:Training Neural Machine Translation with Semantic Similarity (2019) (76)
- Revisiting Recurrent Networks for Paraphrastic Sentence Embeddings (2017) (76)
- Logistic Normal Priors for Unsupervised Probabilistic Grammar Induction (2008) (74)
- Rich Source-Side Context for Statistical Machine Translation (2008) (72)
- A Multi-Task Approach for Disentangling Syntax and Semantics in Sentence Representations (2019) (61)
- Visually Grounded Neural Syntax Acquisition (2019) (60)
- Predicting the NFL using Twitter (2013) (60)
- Pay Attention to the Ending:Strong Neural Baselines for the ROC Story Cloze Task (2017) (56)
- Learning to Represent the Evolution of Dynamic Graphs with Recurrent Models (2019) (52)
- Learning Approximate Inference Networks for Structured Prediction (2018) (47)
- SummScreen: A Dataset for Abstractive Screenplay Summarization (2021) (43)
- A Sense-Topic Model for Word Sense Induction with Unsupervised Data Enrichment (2015) (43)
- Distributed Asynchronous Online Learning for Natural Language Processing (2010) (42)
- Parsing Speech: a Neural Approach to Integrating Lexical and Acoustic-Prosodic Information (2017) (41)
- Simple and Effective Paraphrastic Similarity from Parallel Translations (2019) (39)
- ENGINE: Energy-Based Inference Networks for Non-Autoregressive Machine Translation (2020) (36)
- Variational Sequential Labelers for Semi-Supervised Learning (2019) (35)
- Evaluation Benchmarks and Learning Criteria for Discourse-Aware Sentence Representations (2019) (31)
- Learning to Ignore: Long Document Coreference with Bounded Memory Neural Networks (2020) (30)
- UMD-TTIC-UW at SemEval-2016 Task 1: Attention-Based Multi-Perspective Convolutional Neural Networks for Textual Similarity Measurement (2016) (29)
- Feature-Rich Translation by Quasi-Synchronous Lattice Parsing (2009) (27)
- Word Salad: Relating Food Prices and Descriptions (2012) (27)
- Concavity and Initialization for Unsupervised Dependency Parsing (2012) (26)
- End-to-End Neural Segmental Models for Speech Recognition (2017) (25)
- A Cross-Task Analysis of Text Span Representations (2020) (25)
- Broad Context Language Modeling as Reading Comprehension (2016) (25)
- Softmax-Margin Training for Structured Log-Linear Models (2010) (24)
- Unsupervised Evaluation Metrics and Learning Criteria for Non-Parallel Textual Transfer (2018) (23)
- Discriminative segmental cascades for feature-rich phone recognition (2015) (21)
- Visible Progress on Adversarial Images and a New Saliency Map (2016) (20)
- Substructure Substitution: Structured Data Augmentation for NLP (2021) (20)
- Cube Summing, Approximate Inference with Non-Local Features, and Dynamic Programming without Semirings (2009) (18)
- Learning to Embed Words in Context for Syntactic Tasks (2017) (17)
- Adjusting for Dropout Variance in Batch Normalization and Weight Initialization (2016) (16)
- Phrase Dependency Machine Translation with Quasi-Synchronous Tree-to-Tree Features (2014) (15)
- Mapping Unseen Words to Task-Trained Embedding Spaces (2015) (14)
- Generative Models of Monolingual and Bilingual Gappy Patterns (2011) (14)
- WikiTableT: A Large-Scale Data-to-Text Dataset for Generating Wikipedia Article Sections (2020) (14)
- Unsupervised Label Refinement Improves Dataless Text Classification (2020) (13)
- A comparison of training approaches for discriminative segmental models (2014) (13)
- Benchmarking Approximate Inference Methods for Neural Structured Prediction (2019) (13)
- Learning Structured Classifiers with Dual Coordinate Ascent (2010) (13)
- On the Role of Supervision in Unsupervised Constituency Parsing (2020) (13)
- The CMU-ARK German-English Translation System (2011) (12)
- EntEval: A Holistic Evaluation Benchmark for Entity Representations (2019) (12)
- On Generalization in Coreference Resolution (2021) (12)
- Quality Signals in Generated Stories (2018) (12)
- How to Ask Better Questions? A Large-Scale Multi-Domain Dataset for Rewriting Ill-Formed Questions (2019) (11)
- Emergent Predication Structure in Hidden State Vectors of Neural Readers (2016) (11)
- Joint Modeling of Text and Acoustic-Prosodic Cues for Neural Parsing (2017) (11)
- PoMo: Generating Entity-Specific Post-Modifiers in Context (2019) (10)
- Generating Diverse Story Continuations with Controllable Semantics (2019) (10)
- Quasi-Synchronous Phrase Dependency Grammars for Machine Translation (2011) (9)
- Generalizing and Improving Weight Initialization (2016) (8)
- A Study of All-Convolutional Encoders for Connectionist Temporal Classification (2017) (8)
- Modeling Topics (2006) (8)
- Improving Joint Training of Inference Networks and Structured Prediction Energy Networks (2019) (7)
- End-to-end training approaches for discriminative segmental models (2016) (7)
- Emergent Logical Structure in Vector Representations of Neural Readers (2016) (7)
- Smaller Text Classifiers with Discriminative Cluster Embeddings (2018) (6)
- Paraphrastic Representations at Scale (2021) (6)
- Erratum: “From Paraphrase Database to Compositional Paraphrase Model and Back” (2015) (5)
- NATCAT: Weakly Supervised Text Classification with Naturally Annotated Resources (2020) (5)
- Mining Knowledge for Natural Language Inference from Wikipedia Categories (2020) (5)
- Discriminative Feature-Rich Modeling for Syntax-Based Machine Translation (2012) (5)
- PeTra: A Sparsely Supervised Memory Model for People Tracking (2020) (5)
- Weakly-Supervised Learning with Cost-Augmented Contrastive Estimation (2014) (5)
- Discriminatively-Tuned Generative Classifiers for Robust Natural Language Inference (2020) (5)
- Beating the NFL Football Point Spread (2006) (5)
- Learning Chess Blindfolded: Evaluating Language Models on State Tracking (2021) (5)
- Clustering Contextualized Representations of Text for Unsupervised Syntax Induction (2020) (4)
- Discriminative Online Algorithms for Sequence Labeling-A Comparative Study (2007) (4)
- Latent-Variable Generative Models for Data-Efficient Text Classification (2019) (4)
- FlowPrior: Learning Expressive Priors for Latent Variable Sentence Models (2021) (4)
- Natcat: Weakly Supervised Text Classification with Naturally Annotated Datasets (2020) (4)
- Efficient Segmental Cascades for Speech Recognition (2016) (4)
- Controllable Paraphrasing and Translation with a Syntactic Exemplar (2020) (4)
- Distractor Analysis and Selection for Multiple-Choice Cloze Questions for Second-Language Learners (2020) (4)
- Sequence-to-sequence modeling for graph representation learning (2019) (4)
- “What makes a question inquisitive?” A Study on Type-Controlled Inquisitive Question Generation (2022) (4)
- Substructure Distribution Projection for Zero-Shot Cross-Lingual Dependency Parsing (2021) (3)
- Adding Recurrence to Pretrained Transformers for Improved Efficiency and Context Size (2020) (3)
- Chess as a Testbed for Language Model State Tracking (2021) (3)
- TVRecap: A Dataset for Generating Stories with Character Descriptions (2021) (3)
- Deep Clustering of Text Representations for Supervision-Free Probing of Syntax (2020) (3)
- Learning Criteria and Evaluation Metrics for Textual Transfer between Non-Parallel Corpora (2018) (3)
- Aggressive Online Learning of Structured Classifiers (2010) (2)
- Generating Wikipedia Article Sections from Diverse Data Sources (2020) (2)
- Statistical Inference in Graphical Models (2008) (2)
- Reconsidering the Past: Optimizing Hidden States in Language Models (2021) (1)
- Emergent Predication Structure in Vector Representations of Neural Readers (2017) (1)
- An Exploration of Arbitrary-Order Sequence Labeling via Energy-Based Inference Networks (2020) (1)
- A note on more efficient architectures for NLP (2021) (1)
- Word Salad : Relating Food Prices and Descriptions Supplementary Material (2012) (1)
- Learning Probabilistic Sentence Representations from Paraphrases (2020) (1)
- Constraints Based Convex Belief Propagation (2016) (0)
- The Benefits of Label-Description Training for Zero-Shot Text Classification (2023) (0)
- Baked-in State Probing (2022) (0)
- Explorer End-to-end neural segmental models for speech recognition (2017) (0)
- Improving and Stabilizing Deep Energy-Based Learning (2019) (0)
- Sequence-to-sequence modeling for graph representation learning (2019) (0)
- Exemplar-Controllable Paraphrasing and Translation using Bitext (2020) (0)
- TVStoryGen: A Dataset for Generating Stories with Character Descriptions (2021) (0)
This paper list is powered by the following services: