Alexander M. Rush
#153,126
Most Influential Person Now
Researcher, language, Cornell Tech, Hugging Face
Alexander M. Rush's AcademicInfluence.com Rankings
Download Badge
Communications
Why Is Alexander M. Rush Influential?
(Suggest an Edit or Addition)Alexander M. Rush's Published Works
Number of citations in a given year to any of this author's works
Total number of citations to an author for the works they published in a given year. This highlights publication of the most important work(s) by the author
Published Works
- A Neural Attention Model for Abstractive Sentence Summarization (2015) (2379)
- OpenNMT: Open-Source Toolkit for Neural Machine Translation (2017) (1542)
- Character-Aware Neural Language Models (2015) (1514)
- Abstractive Sentence Summarization with Attentive Recurrent Neural Networks (2016) (776)
- Bottom-Up Abstractive Summarization (2018) (552)
- Sequence-to-Sequence Learning as Beam-Search Optimization (2016) (513)
- Sequence-Level Knowledge Distillation (2016) (483)
- Multitask Prompted Training Enables Zero-Shot Task Generalization (2021) (461)
- Challenges in Data-to-Document Generation (2017) (433)
- Structured Attention Networks (2017) (367)
- LSTMVis: A Tool for Visual Analysis of Hidden State Dynamics in Recurrent Neural Networks (2016) (313)
- Adversarially Regularized Autoencoders (2017) (243)
- Commonsense Knowledge Mining from Pretrained Models (2019) (222)
- Movement Pruning: Adaptive Sparsity by Fine-Tuning (2020) (207)
- Datasets: A Community Library for Natural Language Processing (2021) (205)
- Dual Decomposition for Parsing with Non-Projective Head Automata (2010) (192)
- On Dual Decomposition and Linear Programming Relaxations for Natural Language Processing (2010) (187)
- Semi-Amortized Variational Autoencoders (2018) (182)
- Seq2seq-Vis: A Visual Debugging Tool for Sequence-to-Sequence Models (2018) (176)
- Learning Global Features for Coreference Resolution (2016) (175)
- Learning Neural Templates for Text Generation (2018) (173)
- BLOOM: A 176B-Parameter Open-Access Multilingual Language Model (2022) (172)
- How many data points is a prompt worth? (2021) (163)
- Image-to-Markup Generation with Coarse-to-Fine Attention (2016) (157)
- Avoiding Latent Variable Collapse With Generative Skip Models (2018) (143)
- Learning Anaphoricity and Antecedent Ranking Features for Coreference Resolution (2015) (142)
- GLTR: Statistical Detection and Visualization of Generated Text (2019) (139)
- A Neural Attention Model for Sentence Summarization (2015) (123)
- Parameter-Efficient Transfer Learning with Diff Pruning (2020) (114)
- A Tutorial on Dual Decomposition and Lagrangian Relaxation for Inference in Natural Language Processing (2012) (111)
- Compound Probabilistic Context-Free Grammars for Grammar Induction (2019) (108)
- Latent Normalizing Flows for Discrete Sequences (2019) (102)
- Latent Alignment and Variational Attention (2018) (97)
- OpenNMT: Neural Machine Translation Toolkit (2018) (97)
- Unsupervised Recurrent Neural Network Grammars (2019) (95)
- Adversarially Regularized Autoencoders for Generating Discrete Structures (2017) (80)
- Visual Analysis of Hidden State Dynamics in Recurrent Neural Networks (2016) (77)
- Don’t Take the Premise for Granted: Mitigating Artifacts in Natural Language Inference (2019) (69)
- PromptSource: An Integrated Development Environment and Repository for Natural Language Prompts (2022) (68)
- Block Pruning For Faster Transformers (2021) (65)
- End-to-End Content and Plan Selection for Data-to-Text Generation (2018) (65)
- Vine Pruning for Efficient Multi-Pass Dependency Parsing (2012) (60)
- Learning from others' mistakes: Avoiding dataset biases without modeling them (2020) (60)
- Exact Decoding of Syntactic Translation Models through Lagrangian Relaxation (2011) (58)
- Sequence-level Mixed Sample Data Augmentation (2020) (55)
- What You Get Is What You See: A Visual Markup Decompiler (2016) (55)
- Torch-Struct: Deep Structured Prediction Library (2020) (54)
- Improved Parsing and POS Tagging Using Inter-Sentence Consistency Constraints (2012) (54)
- EdgeBERT: Sentence-Level Energy Optimizations for Latency-Aware Multi-Task NLP Inference (2020) (49)
- On Adversarial Removal of Hypothesis-only Bias in Natural Language Inference (2019) (49)
- Adapting Sequence Models for Sentence Correction (2017) (48)
- The Annotated Transformer (2018) (47)
- Induction of Probabilistic Synchronous Tree-Insertion Grammars for Machine Translation (2006) (45)
- Training for Diversity in Image Paragraph Captioning (2018) (44)
- Encoder-Agnostic Adaptation for Conditional Language Generation (2019) (44)
- Dilated Convolutions for Modeling Long-Distance Genomic Dependencies (2017) (42)
- Visual Interaction with Deep Learning Models through Collaborative Semantic Inference (2019) (40)
- Cognitive Patterns: Problem-Solving Frameworks for Object Technology (1998) (40)
- Sentence-Level Grammatical Error Identification as Sequence-to-Sequence Correction (2016) (40)
- OpenNMT System Description for WNMT 2018: 800 words/sec on a single-core CPU (2018) (38)
- Neural Linguistic Steganography (2019) (38)
- Word Ordering Without Syntax (2016) (37)
- Coarse-to-Fine Attention Models for Document Summarization (2017) (36)
- A Tutorial on Deep Latent Variable Models of Natural Language (2018) (36)
- Pre-trained Summarization Distillation (2020) (35)
- MASR: A Modular Accelerator for Sparse RNNs (2019) (35)
- Algorithm-Hardware Co-Design of Adaptive Floating-Point Encodings for Resilient Deep Learning Inference (2020) (33)
- GRIT: Generative Role-filler Transformers for Document-level Event Entity Extraction (2021) (31)
- Weightless: Lossy Weight Encoding For Deep Neural Network Compression (2017) (30)
- Conference demographics and footprint changed by virtual platforms (2021) (30)
- An Embedding Model for Predicting Roll-Call Votes (2016) (29)
- Transforming Dependencies into Phrase Structures (2015) (29)
- Induction of Probabilistic Synchronous Tree-Insertion Grammars (2005) (28)
- Policing the Police: The Impact of "Pattern-or-Practice" Investigations on Crime (2020) (28)
- Automating Botnet Detection with Graph Neural Networks (2020) (26)
- Darling or Babygirl? Investigating Stylistic Bias in Sentiment Analysis (2018) (21)
- Posterior Control of Blackbox Generation (2020) (21)
- Simple Unsupervised Summarization by Contextual Matching (2019) (20)
- Low-Complexity Probing via Finding Subnetworks (2021) (19)
- AdaptivFloat: A Floating-point based Data Type for Resilient Deep Learning Inference (2019) (19)
- A Fast Variational Approach for Learning Markov Random Field Language Models (2015) (19)
- Interactive and Visual Prompt Engineering for Ad-hoc Task Adaptation with Large Language Models (2022) (19)
- Adversarial Semantic Collisions (2020) (19)
- Generating Abstractive Summaries with Finetuned Language Models (2019) (18)
- Scaling Hidden Markov Language Models (2020) (17)
- Optimal Beam Search for Machine Translation (2013) (16)
- Entity Tracking Improves Cloze-style Reading Comprehension (2018) (15)
- Template Filling with Generative Transformers (2021) (15)
- 9.8 A 25mm2 SoC for IoT Devices with 18ms Noise-Robust Speech-to-Text Latency via Bayesian Speech Denoising and Attention-Based Sequence-to-Sequence DNN Speech Recognition in 16nm FinFET (2021) (13)
- Tensor Variable Elimination for Plated Factor Graphs (2019) (13)
- A Constrained Viterbi Relaxation for Bidirectional Word Alignment (2014) (13)
- Spectral Learning of Refinement HMMs (2013) (13)
- What is Learned in Visually Grounded Neural Syntax Acquisition (2020) (12)
- Rationales for Sequential Predictions (2021) (11)
- Cascaded Text Generation with Markov Transformers (2020) (10)
- Document-level Event-based Extraction Using Generative Template-filling Transformers (2020) (10)
- Latent Template Induction with Gumbel-CRFs (2020) (9)
- Low-Rank Constraints for Fast Inference in Structured Models (2022) (8)
- Improving Event Duration Prediction via Time-aware Pre-training (2020) (8)
- EdgeBERT: Optimizing On-Chip Inference for Multi-Task NLP (2020) (8)
- End-to-end learning of multiple sequence alignments with differentiable Smith–Waterman (2021) (8)
- Named Tensor Notation (2021) (6)
- A Hierarchy of Graph Neural Networks Based on Learnable Local Features (2019) (6)
- Explaining Patterns in Data with Language Models via Interpretable Autoprompting (2022) (5)
- GenNI: Human-AI Collaboration for Data-Backed Text Generation (2021) (5)
- LAN: A Materials Notation for Two-Dimensional Layered Assemblies (2019) (5)
- Deep Latent Variable Models of Natural Language (2018) (4)
- Lie-Access Neural Turing Machines (2016) (4)
- Model Criticism for Long-Form Text Generation (2022) (3)
- Evaluate & Evaluation on the Hub: Better Best Practices for Data and Model Measurements (2022) (3)
- Propagation of Gaussian Beams in the Presence of Gain and Loss (2016) (2)
- MiniConf - A Virtual Conference Framework (2020) (2)
- Beyond the carbon footprint: Virtual conferences increase diversity, equity, and inclusion (2020) (2)
- Antecedent Prediction Without a Pipeline (2016) (2)
- A 16-nm SoC for Noise-Robust Speech and NLP Edge AI Inference With Bayesian Sound Source Separation and Attention-Based DNNs (2023) (2)
- ESB: A Benchmark For Multi-Domain End-to-End Speech Recognition (2022) (1)
- Proceedings of the 2016 Conference of the North American Chapter of the Association for Computational Linguistics: Tutorial Abstracts (2016) (1)
- Dual Decomposition for Natural Language Processing (2011) (1)
- Developmental Stage Classification of Embryos Using Two-Stream Neural Network with Linear-Chain Conditional Random Field (2021) (1)
- Sequence-to-Lattice Models for Fast Translation (2021) (0)
- SM6: A 16nm System-on-Chip for Accurate and Noise-Robust Attention-Based NLP Applications : The 33rd Hot Chips Symposium – August 22-24, 2021 (2021) (0)
- 2 Background : Latent Alignment and Neural Attention (2018) (0)
- Lagrangian relaxation for natural language decoding (2014) (0)
- Unsupervised Text Deidentification (2022) (0)
- A Neural Framework for Low-Shot Learning (2017) (0)
- Visual Interactions with Deep Models through Collaborative Semantic Inference (2019) (0)
- Commonsense Reasoning for Question Answering with Explanations (2022) (0)
- Teal: Learning-Accelerated Optimization of WAN Traffic Engineering (2022) (0)
- Word Ordering Without Syntax The Harvard community has made this article openly available. Please share how this access benefits you. Your story matters (2016) (0)
- Xatu: boosting existing DDoS detection systems using auxiliary signals (2022) (0)
- Efficient Lagrangian relaxation algorithms for exact inference in natural language tasks (2011) (0)
- Xatu (2022) (0)
- Teal: Learning-Accelerated Optimization of Traffic Engineering (2022) (0)
- Pretraining Without Attention (2022) (0)
- DATASET BIASES WITHOUT MODELING THEM (2021) (0)
- 22.9 A 12nm 18.1TFLOPs/W Sparse Transformer Processor with Entropy-Based Early Exit, Mixed-Precision Predication and Fine-Grained Power Management (2023) (0)
- Markup-to-Image Diffusion Models with Scheduled Sampling (2022) (0)
- Topological Botnet Detection (2020) (0)
This paper list is powered by the following services:
Other Resources About Alexander M. Rush
What Schools Are Affiliated With Alexander M. Rush?
Alexander M. Rush is affiliated with the following schools:
