David Eric Yarowsky

David Eric Yarowsky's AcademicInfluence.com Rankings

David Eric Yarowsky

Computer Science

#7121

World Rank

#7500

Historical Rank

Computational Linguistics

#1364

World Rank

#1379

Historical Rank

Database

#4178

World Rank

#4345

Historical Rank

computer-science Degrees

David Eric Yarowsky

Communications

#6216

World Rank

#8567

Historical Rank

Linguistics

#1104

World Rank

#1399

Historical Rank

communications Degrees

Download Badge

Computer Science
Communications

David Eric Yarowsky's Degrees

PhD Computer Science University of Pennsylvania
Bachelors Linguistics University of Pennsylvania

Similar Degrees You Can Earn

Best Online PhD of Computer Science (Doctorates) 2026

Why Is David Eric Yarowsky Influential?

(Suggest an Edit or Addition)

(See a Problem?)

David Eric Yarowsky's Published Works

Number of citations in a given year to any of this author's works

Total number of citations to an author for the works they published in a given year. This highlights publication of the most important work(s) by the author

Published Works

Unsupervised Word Sense Disambiguation Rivaling Supervised Methods (1995) (2688)
Word-Sense Disambiguation Using Statistical Models of Roget’s Categories Trained on Large Corpora (1992) (845)
Classifying latent user attributes in twitter (2010) (709)
A method for disambiguating word senses in a large corpus (1992) (665)
One Sense Per Discourse (1992) (630)
Inducing Multilingual Text Analysis Tools via Robust Projection across Aligned Corpora (2001) (555)
Decision Lists for Lexical Ambiguity Resolution: Application to Accent Restoration in Spanish and French (1994) (485)
One Sense per Collocation (1993) (485)
Unsupervised Personal Name Disambiguation (2003) (377)
Inducing Multilingual POS Taggers and NP Bracketers via Robust Projection Across Aligned Corpora (2001) (292)
Language Independent Named Entity Recognition Combining Morphological and Contextual Evidence (1999) (272)
Estimating Upper and Lower Bounds on the Performance of Word-Sense Disambiguation Programs (1992) (238)
The SIGMORPHON 2016 Shared Task—Morphological Reinflection (2016) (225)
Minimally Supervised Morphological Analysis by Multimodal Alignment (2000) (205)
Cross-lingual Dependency Parsing Based on Distributed Representations (2015) (178)
Multipath Translation Lexicon Induction via Bridge Languages (2001) (174)
Natural Language Processing Using Very Large Corpora (2014) (167)
Exploring Demographic Language Variations to Improve Multilingual Sentiment Analysis in Social Media (2013) (162)
Using bilingual materials to develop word sense disambiguation methods (1992) (150)
CoNLL-SIGMORPHON 2017 Shared Task: Universal Morphological Reinflection in 52 Languages (2017) (149)
Distinguishing systems and distinguishing senses: new evaluation methods for Word Sense Disambiguation (1999) (130)
Hierarchical Decision Lists for Word Sense Disambiguation (2000) (125)
Broadly Improving User Classification via Communication-Based Name and Location Clustering on Twitter (2013) (119)
Work on Statistical Methods for Word Sense Disambiguation (1992) (119)
Statistical Machine Translation: Final Report (1999) (117)
Inducing Translation Lexicons via Diverse Similarity Measures and Bridge Languages (2002) (117)
Stylometric Analysis of Scientific Articles (2012) (111)
The CoNLL–SIGMORPHON 2018 Shared Task: Universal Morphological Reinflection (2018) (105)
UniMorph 3.0: Universal Morphology (2018) (100)
Modeling Latent Biographic Attributes in Conversational Genres (2009) (98)
Rule Writing or Annotation: Cost-efficient Resource Usage for Base Noun Phrase Chunking (2000) (98)
Homograph Disambiguation in Text-to-Speech Synthesis (1997) (94)
A Representation Learning Framework for Multi-Source Transfer Parsing (2016) (93)
Hierarchical Bayesian Models for Latent Attribute Detection in Social Media (2011) (90)
Combining Classifiers for word sense disambiguation (2002) (89)
New Tools for Web-Scale N-grams (2010) (88)
Toward Statistical Machine Translation without Parallel Corpora (2012) (83)
Modeling and learning multilingual inflectional morphology in a minimally supervised framework (2003) (77)
A Language-Independent Feature Schema for Inflectional Morphology (2015) (75)
A corpus-based synthesizer (1992) (73)
A Comparison of Corpus-Based Techniques for Restoring Accents in Spanish and French Text (1999) (72)
Improving Translation Lexicon Induction from Monolingual Corpora via Dependency Contexts and Part-of-Speech Equivalences (2009) (71)
Modeling Consensus: Classifier Combination for Word Sense Disambiguation (2002) (69)
Very-large Scale Parsing and Normalization of Wiktionary Morphological Paradigms (2016) (65)
Exploring Sentiment in Social Media: Bootstrapping Subjectivity Clues from Multilingual Twitter Streams (2013) (64)
UniMorph 2.0: Universal Morphology (2018) (60)
Cross-Document Coreference Resolution: A Key Technology for Learning by Reading (2009) (59)
Bootstrapping a Multilingual Part-of-speech Tagger in One Person-day (2002) (56)
Massively Multilingual Adversarial Speech Recognition (2019) (55)
Inducing Information Extraction Systems for New Languages via Cross-language Projection (2002) (54)
Evaluating sense disambiguation across diverse parameter spaces (2002) (54)
Multi-Field Information Extraction and Cross-Document Fusion (2005) (52)
Taking the load off the conference chairs-towards a digital paper-routing assistant (1999) (52)
Quantifying the value of pronunciation lexicons for keyword search in lowresource languages (2013) (50)
HLTCOE Approaches to Knowledge Base Population at TAC 2009 (2009) (48)
Homograph disambiguation in speech synthesis (1994) (47)
Discrimination decisions for 100,000-dimensional spaces (1995) (46)
Dynamic Nonlocal Language Modeling via Hierarchical Topic-Based Adaptation (1999) (46)
Language Independent NER using a Unified Model of Internal and Contextual Evidence (2002) (45)
NADA: A Robust System for Non-referential Pronoun Detection (2011) (44)
A keyword search system using open source software (2014) (43)
Marrying Universal Dependencies and Universal Morphology (2018) (43)
Ranking and Semi-supervised Classification on Large Scale Graphs Using Map-Reduce (2009) (42)
Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing (2013) (41)
Language Independent, Minimally Supervised Induction of Lexical Probabilities (2000) (40)
The Johns Hopkins SENSEVAL2 system descriptions (2001) (40)
Minimally Supervised Induction of Grammatical Gender (2003) (38)
Structural, Transitive and Latent Models for Biographic Fact Extraction (2009) (36)
A Universal Feature Schema for Rich Morphological Annotation and Fine-Grained Cross-Lingual Part-of-Speech Tagging (2015) (34)
The Johns Hopkins University Bible Corpus: 1600+ Tongues for Typological Exploration (2020) (31)
Three machine learning algorithms for lexical ambiguity resolution (1996) (29)
Unsupervised Translation Induction for Chinese Abbreviations using Monolingual Corpora (2008) (27)
Multi-document statistical fact extraction and fusion (2006) (26)
Minimally Supervised Morphological Segmentation with Applications to Machine Translation (2006) (23)
Mining and Modeling Relations between Formal and Informal Chinese Phrases from Web Corpora (2008) (23)
JHU1 : An Unsupervised Approach to Person Name Disambiguation using Web Snippets (2007) (21)
Paradigm Completion for Derivational Morphology (2017) (21)
Statistical Machine Translation Using Coercive Two-Level Syntactic Transduction (2003) (20)
Improving Gender Prediction of Social Media Users via Weighted Annotator Rationales (2014) (20)
SIGMORPHON 2021 Shared Task on Morphological Reinflection: Generalization Across Languages (2021) (18)
Affinity Measures Based on the Graph Laplacian (2008) (17)
Resolving and Generating Definite Anaphora by Modeling Hypernymy using Unlabeled Corpora (2006) (16)
Learning Morphosyntactic Analyzers from the Bible via Iterative Annotation Projection across 26 Languages (2019) (16)
An Analysis of Massively Multilingual Neural Machine Translation for Low-Resource Languages (2020) (15)
Using Large Monolingual and Bilingual Corpora to Improve Coordination Disambiguation (2011) (15)
Translation discovery using diverse similarity measures (2006) (15)
Desparately Seeking Cebuano (2003) (15)
Creating a Translation Matrix of the Bible's Names Across 591 Languages (2018) (15)
Augmented Mixture Models for Lexical Disambiguation (2002) (14)
Translating Compounds by Learning Component Gloss Translation Models via Multiple Languages (2008) (14)
A Distributed Representation-Based Framework for Cross-Lingual Transfer Parsing (2016) (14)
Word Sense Disambiguation (2010) (12)
Creating Large-Scale Multilingual Cognate Tables (2018) (12)
Induction of Fine-Grained Part-of-Speech Taggers via Classifier Combination and Crosslingual Projection (2005) (12)
Computational Etymology and Word Emergence (2020) (12)
The Proceedings of the Second International Workshop on Evaluating Word Sense Disambiguation Systems (2001) (11)
Fine-grained Morphosyntactic Analysis and Generation Tools for More Than One Thousand Languages (2020) (11)
The Johns Hopkins University 2003 Chinese-English machine translation system (2003) (11)
A Comparative Study of Extremely Low-Resource Transliteration of the World's Languages (2018) (10)
Wiktionary Normalization of Translations and Morphological Information (2020) (9)
Statistical Machine Translation Final Report, Jhu Workshop 1999 (1999) (9)
Multilingual Dictionary Based Construction of Core Vocabulary (2020) (9)
Improving Low Resource Machine Translation using Morphological Glosses (Non-archival Extended Abstract) (2018) (8)
Remote Elicitation of Inflectional Paradigms to Seed Morphological Analysis in Low-Resource Languages (2016) (8)
Social Media Predictive Analytics (2015) (8)
Unsupervised Acquisition of Lexical Knowledge From N-grams : Final Report of the 2009 JHU CLSP Workshop (2010) (7)
Part of Speech Tagging and Shallow Parsing of Indian Languag es (2006) (7)
Machine Translation for Languages Lacking Bitext via Multilingual Gloss Transduction (2006) (6)
Improving Bitext Word Alignments via Syntax-based Reordering of English (2004) (6)
The John Hopkins SENSEVAL-2 System Descriptions (2001) (6)
Massively Translingual Compound Analysis and Translation Discovery (2018) (5)
A two-level syntax-based approach to Arabic-English statistical machine translation (2003) (5)
Zero-Shot Pronunciation Lexicons for Cross-Language Acoustic Model Transfer (2019) (5)
Homograph Disambiguation in Text-to-speech Synthesis 1 Problem Description (1997) (5)
Deriving Consensus for Multi-Parallel Corpora: an English Bible Study (2017) (4)
Modeling Color Terminology Across Thousands of Languages (2019) (4)
Text normalization and ambiguity resolution in speech synthesis (1993) (4)
Transformation based learning and data-driven lexical disambiguation: syntactic and semantic ambiguity resolution (2003) (4)
Neural Transduction for Multilingual Lexical Translation (2020) (4)
Minimally Supervised Multilingual Taxonomy and Translation Lexicon Induction (2008) (4)
Web N-gram workshop 2010 (2011) (3)
Exploiting Aggregate Properties of Bilingual Dictionaries For Distinguishing Senses of English Words and Inducing English Sense Clusters (2004) (3)
Arabic cross-document coreference detection (2009) (3)
MINDS Workshops Machine Translation Working Group Final Report (3)
Typed graph models for semi-supervised learning of name ethnicity (2011) (3)
Sequence Models for Computational Etymology of Borrowings (2021) (3)
Measuring the Similarity of Grammatical Gender Systems by Comparing Partitions (2020) (3)
The CoNLL--SIGMORPHON 2018 Shared Task (2018) (2)
Arabic Cross-Document Coreference Resolution (2009) (2)
UniMorph 4.0: Universal Morphology (2022) (2)
Induced Inflection-Set Keyword Search in Speech (2019) (2)
Maximizing resources for corpus-based natural language processing (2001) (1)
One Sense per Discourse 2. Our Previous Work on Word-sense Disambiguation 2.1. Data Deprivation (1992) (1)
On Pronunciations in Wiktionary: Extraction and Experiments on Multilingual Syllabification and Stress Prediction (2021) (1)
JHU1 : An Unsupervised Approach to Person Name Disambiguation using Web Snippets (2007) (1)
Learning Domain-Specific, L1-Specific Measures of Word Readability (2013) (1)
Cmp-lg/9406034 22 Jun 94 (1994) (0)
Discrimination decisions for l O 0 , O 00-dimensional spaces (0)
Language independent, minimally supervised methods in natural language ambiguity resolution (2004) (0)
Machine Translation Working Group Final Report (2007) (0)
C L ] 3 0 A ug 2 01 7 Paradigm Completion for Derivational Morphology (2018) (0)
Exploiting Nonlo al and Synta ti Word Relationships inLanguage ModelsRadu (1998) (0)
Cross-Document Coreference Detection (2009) (0)
Random Walks for Text Semantic Similarity Wikiwalk: Random Walks on Wikipedia for Semantic Relatedness Measuring Semantic Relatedness with Vector Space Models and Random Walks Ranking and Semi-supervised Classification on Large Scale Graphs Using Map-reduce Opinion Graphs for Polarity and Discourse (2009) (0)
Towards a Discourse Relation-aware Approach for Chinese-english Machine Translation a Mapping-based Approach for General Formal Human Computer Interaction Using Natural Language Learning Grammar with Explicit Annotations for Subordinating Conjunctions Multi-document Summarization Using Distortion-ra (0)
Web N-gram Workshop Workshop of the 33 rd Annual International (2010) (0)
Typed Graph Models for Learning Latent Attributes from Names (2011) (0)
Identification de sections codables dans des documents médicaux (2015) (0)
Deciphering and Characterizing Out-of-Vocabulary Words for Morphologically Rich Languages (2022) (0)
Known Words Will Do: Unknown Concept Translation via Lexical Relations (2022) (0)
On the Robustness of Cognate Generation Models (2022) (0)

This paper list is powered by the following services:

David Eric Yarowsky's Academic­Influence.com Rankings

David Eric Yarowsky's Degrees

Similar Degrees You Can Earn

Why Is David Eric Yarowsky Influential?

David Eric Yarowsky's Published Works

Published Works

David Eric Yarowsky's AcademicInfluence.com Rankings