K. Gimpel

K. Gimpel's AcademicInfluence.com Rankings

K. Gimpel

Computer Science

#8917

World Rank

#9373

Historical Rank

Computational Linguistics

#2072

World Rank

#2094

Historical Rank

Database

#5913

World Rank

#6132

Historical Rank

computer-science Degrees

Download Badge

Computer Science

K. Gimpel's Degrees

PhD Computer Science Stanford University
Masters Computer Science Stanford University
Bachelors Computer Science Stanford University

Similar Degrees You Can Earn

Why Is K. Gimpel Influential?

(Suggest an Edit or Addition)

(See a Problem?)

K. Gimpel's Published Works

Number of citations in a given year to any of this author's works

Total number of citations to an author for the works they published in a given year. This highlights publication of the most important work(s) by the author

Published Works

ALBERT: A Lite BERT for Self-supervised Learning of Language Representations (2019) (3926)
A Baseline for Detecting Misclassified and Out-of-Distribution Examples in Neural Networks (2016) (1892)
Gaussian Error Linear Units (GELUs) (2016) (1826)
Part-of-Speech Tagging for Twitter: Annotation, Features, and Experiments (2010) (1096)
Improved Part-of-Speech Tagging for Online Conversational Text with Word Clusters (2013) (805)
Towards Universal Paraphrastic Sentence Embeddings (2015) (519)
Adversarial Example Generation with Syntactically Controlled Paraphrase Networks (2018) (493)
Bridging Nonlinearities and Stochastic Regularizers with Gaussian Error Linear Units (2016) (400)
Using Trusted Data to Train Deep Networks on Labels Corrupted by Severe Noise (2018) (387)
Multi-Perspective Sentence Similarity Modeling with Convolutional Neural Networks (2015) (362)
Tailoring Continuous Word Representations for Dependency Parsing (2014) (310)
From Paraphrase Database to Compositional Paraphrase Model and Back (2015) (266)
ParaNMT-50M: Pushing the Limits of Paraphrastic Sentence Embeddings with Millions of Machine Translations (2017) (260)
Movie Reviews and Revenues: An Experiment in Text Regression (2010) (234)
Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models (2022) (218)
Early Methods for Detecting Adversarial Images (2016) (200)
Charagram: Embedding Words and Sentences via Character n-grams (2016) (181)
Softmax-Margin CRFs: Training Log-Linear Models with Cost Functions (2010) (155)
Commonsense Knowledge Base Completion (2016) (141)
Deep Multilingual Correlation for Improved Word Embeddings (2015) (138)
Who did What: A Large-Scale Person-Centered Cloze Dataset (2016) (134)
Machine Comprehension with Syntax, Frames, and Semantics (2015) (102)
Learning Paraphrastic Sentence Embeddings from Back-Translated Bitext (2017) (88)
A Systematic Exploration of Diversity in Machine Translation (2013) (87)
Part-of-Speech Tagging for Twitter : Word Clusters and Other Advances (2012) (86)
Controllable Paraphrase Generation with a Syntactic Exemplar (2019) (81)
Structured Ramp Loss Minimization for Machine Translation (2012) (77)
Beyond BLEU:Training Neural Machine Translation with Semantic Similarity (2019) (76)
Revisiting Recurrent Networks for Paraphrastic Sentence Embeddings (2017) (76)
Logistic Normal Priors for Unsupervised Probabilistic Grammar Induction (2008) (74)
Rich Source-Side Context for Statistical Machine Translation (2008) (72)
A Multi-Task Approach for Disentangling Syntax and Semantics in Sentence Representations (2019) (61)
Visually Grounded Neural Syntax Acquisition (2019) (60)
Predicting the NFL using Twitter (2013) (60)
Pay Attention to the Ending:Strong Neural Baselines for the ROC Story Cloze Task (2017) (56)
Learning to Represent the Evolution of Dynamic Graphs with Recurrent Models (2019) (52)
Learning Approximate Inference Networks for Structured Prediction (2018) (47)
SummScreen: A Dataset for Abstractive Screenplay Summarization (2021) (43)
A Sense-Topic Model for Word Sense Induction with Unsupervised Data Enrichment (2015) (43)
Distributed Asynchronous Online Learning for Natural Language Processing (2010) (42)
Parsing Speech: a Neural Approach to Integrating Lexical and Acoustic-Prosodic Information (2017) (41)
Simple and Effective Paraphrastic Similarity from Parallel Translations (2019) (39)
ENGINE: Energy-Based Inference Networks for Non-Autoregressive Machine Translation (2020) (36)
Variational Sequential Labelers for Semi-Supervised Learning (2019) (35)
Evaluation Benchmarks and Learning Criteria for Discourse-Aware Sentence Representations (2019) (31)
Learning to Ignore: Long Document Coreference with Bounded Memory Neural Networks (2020) (30)
UMD-TTIC-UW at SemEval-2016 Task 1: Attention-Based Multi-Perspective Convolutional Neural Networks for Textual Similarity Measurement (2016) (29)
Feature-Rich Translation by Quasi-Synchronous Lattice Parsing (2009) (27)
Word Salad: Relating Food Prices and Descriptions (2012) (27)
Concavity and Initialization for Unsupervised Dependency Parsing (2012) (26)
End-to-End Neural Segmental Models for Speech Recognition (2017) (25)
A Cross-Task Analysis of Text Span Representations (2020) (25)
Broad Context Language Modeling as Reading Comprehension (2016) (25)
Softmax-Margin Training for Structured Log-Linear Models (2010) (24)
Unsupervised Evaluation Metrics and Learning Criteria for Non-Parallel Textual Transfer (2018) (23)
Discriminative segmental cascades for feature-rich phone recognition (2015) (21)
Visible Progress on Adversarial Images and a New Saliency Map (2016) (20)
Substructure Substitution: Structured Data Augmentation for NLP (2021) (20)
Cube Summing, Approximate Inference with Non-Local Features, and Dynamic Programming without Semirings (2009) (18)
Learning to Embed Words in Context for Syntactic Tasks (2017) (17)
Adjusting for Dropout Variance in Batch Normalization and Weight Initialization (2016) (16)
Phrase Dependency Machine Translation with Quasi-Synchronous Tree-to-Tree Features (2014) (15)
Mapping Unseen Words to Task-Trained Embedding Spaces (2015) (14)
Generative Models of Monolingual and Bilingual Gappy Patterns (2011) (14)
WikiTableT: A Large-Scale Data-to-Text Dataset for Generating Wikipedia Article Sections (2020) (14)
Unsupervised Label Refinement Improves Dataless Text Classification (2020) (13)
A comparison of training approaches for discriminative segmental models (2014) (13)
Benchmarking Approximate Inference Methods for Neural Structured Prediction (2019) (13)
Learning Structured Classifiers with Dual Coordinate Ascent (2010) (13)
On the Role of Supervision in Unsupervised Constituency Parsing (2020) (13)
The CMU-ARK German-English Translation System (2011) (12)
EntEval: A Holistic Evaluation Benchmark for Entity Representations (2019) (12)
On Generalization in Coreference Resolution (2021) (12)
Quality Signals in Generated Stories (2018) (12)
How to Ask Better Questions? A Large-Scale Multi-Domain Dataset for Rewriting Ill-Formed Questions (2019) (11)
Emergent Predication Structure in Hidden State Vectors of Neural Readers (2016) (11)
Joint Modeling of Text and Acoustic-Prosodic Cues for Neural Parsing (2017) (11)
PoMo: Generating Entity-Specific Post-Modifiers in Context (2019) (10)
Generating Diverse Story Continuations with Controllable Semantics (2019) (10)
Quasi-Synchronous Phrase Dependency Grammars for Machine Translation (2011) (9)
Generalizing and Improving Weight Initialization (2016) (8)
A Study of All-Convolutional Encoders for Connectionist Temporal Classification (2017) (8)
Modeling Topics (2006) (8)
Improving Joint Training of Inference Networks and Structured Prediction Energy Networks (2019) (7)
End-to-end training approaches for discriminative segmental models (2016) (7)
Emergent Logical Structure in Vector Representations of Neural Readers (2016) (7)
Smaller Text Classifiers with Discriminative Cluster Embeddings (2018) (6)
Paraphrastic Representations at Scale (2021) (6)
Erratum: “From Paraphrase Database to Compositional Paraphrase Model and Back” (2015) (5)
NATCAT: Weakly Supervised Text Classification with Naturally Annotated Resources (2020) (5)
Mining Knowledge for Natural Language Inference from Wikipedia Categories (2020) (5)
Discriminative Feature-Rich Modeling for Syntax-Based Machine Translation (2012) (5)
PeTra: A Sparsely Supervised Memory Model for People Tracking (2020) (5)
Weakly-Supervised Learning with Cost-Augmented Contrastive Estimation (2014) (5)
Discriminatively-Tuned Generative Classifiers for Robust Natural Language Inference (2020) (5)
Beating the NFL Football Point Spread (2006) (5)
Learning Chess Blindfolded: Evaluating Language Models on State Tracking (2021) (5)
Clustering Contextualized Representations of Text for Unsupervised Syntax Induction (2020) (4)
Discriminative Online Algorithms for Sequence Labeling-A Comparative Study (2007) (4)
Latent-Variable Generative Models for Data-Efficient Text Classification (2019) (4)
FlowPrior: Learning Expressive Priors for Latent Variable Sentence Models (2021) (4)
Natcat: Weakly Supervised Text Classification with Naturally Annotated Datasets (2020) (4)
Efficient Segmental Cascades for Speech Recognition (2016) (4)
Controllable Paraphrasing and Translation with a Syntactic Exemplar (2020) (4)
Distractor Analysis and Selection for Multiple-Choice Cloze Questions for Second-Language Learners (2020) (4)
Sequence-to-sequence modeling for graph representation learning (2019) (4)
“What makes a question inquisitive?” A Study on Type-Controlled Inquisitive Question Generation (2022) (4)
Substructure Distribution Projection for Zero-Shot Cross-Lingual Dependency Parsing (2021) (3)
Adding Recurrence to Pretrained Transformers for Improved Efficiency and Context Size (2020) (3)
Chess as a Testbed for Language Model State Tracking (2021) (3)
TVRecap: A Dataset for Generating Stories with Character Descriptions (2021) (3)
Deep Clustering of Text Representations for Supervision-Free Probing of Syntax (2020) (3)
Learning Criteria and Evaluation Metrics for Textual Transfer between Non-Parallel Corpora (2018) (3)
Aggressive Online Learning of Structured Classifiers (2010) (2)
Generating Wikipedia Article Sections from Diverse Data Sources (2020) (2)
Statistical Inference in Graphical Models (2008) (2)
Reconsidering the Past: Optimizing Hidden States in Language Models (2021) (1)
Emergent Predication Structure in Vector Representations of Neural Readers (2017) (1)
An Exploration of Arbitrary-Order Sequence Labeling via Energy-Based Inference Networks (2020) (1)
A note on more efficient architectures for NLP (2021) (1)
Word Salad : Relating Food Prices and Descriptions Supplementary Material (2012) (1)
Learning Probabilistic Sentence Representations from Paraphrases (2020) (1)
Constraints Based Convex Belief Propagation (2016) (0)
The Benefits of Label-Description Training for Zero-Shot Text Classification (2023) (0)
Baked-in State Probing (2022) (0)
Explorer End-to-end neural segmental models for speech recognition (2017) (0)
Improving and Stabilizing Deep Energy-Based Learning (2019) (0)
Sequence-to-sequence modeling for graph representation learning (2019) (0)
Exemplar-Controllable Paraphrasing and Translation using Bitext (2020) (0)
TVStoryGen: A Dataset for Generating Stories with Character Descriptions (2021) (0)

This paper list is powered by the following services:

K. Gimpel's Academic­Influence.com Rankings

K. Gimpel's Degrees

Similar Degrees You Can Earn

Why Is K. Gimpel Influential?

K. Gimpel's Published Works

Published Works

K. Gimpel's AcademicInfluence.com Rankings