David Eric Yarowsky
#144,248
Most Influential Person Now
David Eric Yarowsky's AcademicInfluence.com Rankings
David Eric Yarowskycomputer-science Degrees
Computer Science
#7121
World Rank
#7500
Historical Rank
Computational Linguistics
#1364
World Rank
#1379
Historical Rank
Database
#4178
World Rank
#4345
Historical Rank

David Eric Yarowskycommunications Degrees
Communications
#6216
World Rank
#8567
Historical Rank
Linguistics
#1104
World Rank
#1399
Historical Rank

Download Badge
Computer Science Communications
David Eric Yarowsky's Degrees
- PhD Computer Science University of Pennsylvania
- Bachelors Linguistics University of Pennsylvania
Similar Degrees You Can Earn
Why Is David Eric Yarowsky Influential?
(Suggest an Edit or Addition)David Eric Yarowsky's Published Works
Number of citations in a given year to any of this author's works
Total number of citations to an author for the works they published in a given year. This highlights publication of the most important work(s) by the author
Published Works
- Unsupervised Word Sense Disambiguation Rivaling Supervised Methods (1995) (2688)
- Word-Sense Disambiguation Using Statistical Models of Roget’s Categories Trained on Large Corpora (1992) (845)
- Classifying latent user attributes in twitter (2010) (709)
- A method for disambiguating word senses in a large corpus (1992) (665)
- One Sense Per Discourse (1992) (630)
- Inducing Multilingual Text Analysis Tools via Robust Projection across Aligned Corpora (2001) (555)
- Decision Lists for Lexical Ambiguity Resolution: Application to Accent Restoration in Spanish and French (1994) (485)
- One Sense per Collocation (1993) (485)
- Unsupervised Personal Name Disambiguation (2003) (377)
- Inducing Multilingual POS Taggers and NP Bracketers via Robust Projection Across Aligned Corpora (2001) (292)
- Language Independent Named Entity Recognition Combining Morphological and Contextual Evidence (1999) (272)
- Estimating Upper and Lower Bounds on the Performance of Word-Sense Disambiguation Programs (1992) (238)
- The SIGMORPHON 2016 Shared Task—Morphological Reinflection (2016) (225)
- Minimally Supervised Morphological Analysis by Multimodal Alignment (2000) (205)
- Cross-lingual Dependency Parsing Based on Distributed Representations (2015) (178)
- Multipath Translation Lexicon Induction via Bridge Languages (2001) (174)
- Natural Language Processing Using Very Large Corpora (2014) (167)
- Exploring Demographic Language Variations to Improve Multilingual Sentiment Analysis in Social Media (2013) (162)
- Using bilingual materials to develop word sense disambiguation methods (1992) (150)
- CoNLL-SIGMORPHON 2017 Shared Task: Universal Morphological Reinflection in 52 Languages (2017) (149)
- Distinguishing systems and distinguishing senses: new evaluation methods for Word Sense Disambiguation (1999) (130)
- Hierarchical Decision Lists for Word Sense Disambiguation (2000) (125)
- Broadly Improving User Classification via Communication-Based Name and Location Clustering on Twitter (2013) (119)
- Work on Statistical Methods for Word Sense Disambiguation (1992) (119)
- Statistical Machine Translation: Final Report (1999) (117)
- Inducing Translation Lexicons via Diverse Similarity Measures and Bridge Languages (2002) (117)
- Stylometric Analysis of Scientific Articles (2012) (111)
- The CoNLL–SIGMORPHON 2018 Shared Task: Universal Morphological Reinflection (2018) (105)
- UniMorph 3.0: Universal Morphology (2018) (100)
- Modeling Latent Biographic Attributes in Conversational Genres (2009) (98)
- Rule Writing or Annotation: Cost-efficient Resource Usage for Base Noun Phrase Chunking (2000) (98)
- Homograph Disambiguation in Text-to-Speech Synthesis (1997) (94)
- A Representation Learning Framework for Multi-Source Transfer Parsing (2016) (93)
- Hierarchical Bayesian Models for Latent Attribute Detection in Social Media (2011) (90)
- Combining Classifiers for word sense disambiguation (2002) (89)
- New Tools for Web-Scale N-grams (2010) (88)
- Toward Statistical Machine Translation without Parallel Corpora (2012) (83)
- Modeling and learning multilingual inflectional morphology in a minimally supervised framework (2003) (77)
- A Language-Independent Feature Schema for Inflectional Morphology (2015) (75)
- A corpus-based synthesizer (1992) (73)
- A Comparison of Corpus-Based Techniques for Restoring Accents in Spanish and French Text (1999) (72)
- Improving Translation Lexicon Induction from Monolingual Corpora via Dependency Contexts and Part-of-Speech Equivalences (2009) (71)
- Modeling Consensus: Classifier Combination for Word Sense Disambiguation (2002) (69)
- Very-large Scale Parsing and Normalization of Wiktionary Morphological Paradigms (2016) (65)
- Exploring Sentiment in Social Media: Bootstrapping Subjectivity Clues from Multilingual Twitter Streams (2013) (64)
- UniMorph 2.0: Universal Morphology (2018) (60)
- Cross-Document Coreference Resolution: A Key Technology for Learning by Reading (2009) (59)
- Bootstrapping a Multilingual Part-of-speech Tagger in One Person-day (2002) (56)
- Massively Multilingual Adversarial Speech Recognition (2019) (55)
- Inducing Information Extraction Systems for New Languages via Cross-language Projection (2002) (54)
- Evaluating sense disambiguation across diverse parameter spaces (2002) (54)
- Multi-Field Information Extraction and Cross-Document Fusion (2005) (52)
- Taking the load off the conference chairs-towards a digital paper-routing assistant (1999) (52)
- Quantifying the value of pronunciation lexicons for keyword search in lowresource languages (2013) (50)
- HLTCOE Approaches to Knowledge Base Population at TAC 2009 (2009) (48)
- Homograph disambiguation in speech synthesis (1994) (47)
- Discrimination decisions for 100,000-dimensional spaces (1995) (46)
- Dynamic Nonlocal Language Modeling via Hierarchical Topic-Based Adaptation (1999) (46)
- Language Independent NER using a Unified Model of Internal and Contextual Evidence (2002) (45)
- NADA: A Robust System for Non-referential Pronoun Detection (2011) (44)
- A keyword search system using open source software (2014) (43)
- Marrying Universal Dependencies and Universal Morphology (2018) (43)
- Ranking and Semi-supervised Classification on Large Scale Graphs Using Map-Reduce (2009) (42)
- Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing (2013) (41)
- Language Independent, Minimally Supervised Induction of Lexical Probabilities (2000) (40)
- The Johns Hopkins SENSEVAL2 system descriptions (2001) (40)
- Minimally Supervised Induction of Grammatical Gender (2003) (38)
- Structural, Transitive and Latent Models for Biographic Fact Extraction (2009) (36)
- A Universal Feature Schema for Rich Morphological Annotation and Fine-Grained Cross-Lingual Part-of-Speech Tagging (2015) (34)
- The Johns Hopkins University Bible Corpus: 1600+ Tongues for Typological Exploration (2020) (31)
- Three machine learning algorithms for lexical ambiguity resolution (1996) (29)
- Unsupervised Translation Induction for Chinese Abbreviations using Monolingual Corpora (2008) (27)
- Multi-document statistical fact extraction and fusion (2006) (26)
- Minimally Supervised Morphological Segmentation with Applications to Machine Translation (2006) (23)
- Mining and Modeling Relations between Formal and Informal Chinese Phrases from Web Corpora (2008) (23)
- JHU1 : An Unsupervised Approach to Person Name Disambiguation using Web Snippets (2007) (21)
- Paradigm Completion for Derivational Morphology (2017) (21)
- Statistical Machine Translation Using Coercive Two-Level Syntactic Transduction (2003) (20)
- Improving Gender Prediction of Social Media Users via Weighted Annotator Rationales (2014) (20)
- SIGMORPHON 2021 Shared Task on Morphological Reinflection: Generalization Across Languages (2021) (18)
- Affinity Measures Based on the Graph Laplacian (2008) (17)
- Resolving and Generating Definite Anaphora by Modeling Hypernymy using Unlabeled Corpora (2006) (16)
- Learning Morphosyntactic Analyzers from the Bible via Iterative Annotation Projection across 26 Languages (2019) (16)
- An Analysis of Massively Multilingual Neural Machine Translation for Low-Resource Languages (2020) (15)
- Using Large Monolingual and Bilingual Corpora to Improve Coordination Disambiguation (2011) (15)
- Translation discovery using diverse similarity measures (2006) (15)
- Desparately Seeking Cebuano (2003) (15)
- Creating a Translation Matrix of the Bible's Names Across 591 Languages (2018) (15)
- Augmented Mixture Models for Lexical Disambiguation (2002) (14)
- Translating Compounds by Learning Component Gloss Translation Models via Multiple Languages (2008) (14)
- A Distributed Representation-Based Framework for Cross-Lingual Transfer Parsing (2016) (14)
- Word Sense Disambiguation (2010) (12)
- Creating Large-Scale Multilingual Cognate Tables (2018) (12)
- Induction of Fine-Grained Part-of-Speech Taggers via Classifier Combination and Crosslingual Projection (2005) (12)
- Computational Etymology and Word Emergence (2020) (12)
- The Proceedings of the Second International Workshop on Evaluating Word Sense Disambiguation Systems (2001) (11)
- Fine-grained Morphosyntactic Analysis and Generation Tools for More Than One Thousand Languages (2020) (11)
- The Johns Hopkins University 2003 Chinese-English machine translation system (2003) (11)
- A Comparative Study of Extremely Low-Resource Transliteration of the World's Languages (2018) (10)
- Wiktionary Normalization of Translations and Morphological Information (2020) (9)
- Statistical Machine Translation Final Report, Jhu Workshop 1999 (1999) (9)
- Multilingual Dictionary Based Construction of Core Vocabulary (2020) (9)
- Improving Low Resource Machine Translation using Morphological Glosses (Non-archival Extended Abstract) (2018) (8)
- Remote Elicitation of Inflectional Paradigms to Seed Morphological Analysis in Low-Resource Languages (2016) (8)
- Social Media Predictive Analytics (2015) (8)
- Unsupervised Acquisition of Lexical Knowledge From N-grams : Final Report of the 2009 JHU CLSP Workshop (2010) (7)
- Part of Speech Tagging and Shallow Parsing of Indian Languag es (2006) (7)
- Machine Translation for Languages Lacking Bitext via Multilingual Gloss Transduction (2006) (6)
- Improving Bitext Word Alignments via Syntax-based Reordering of English (2004) (6)
- The John Hopkins SENSEVAL-2 System Descriptions (2001) (6)
- Massively Translingual Compound Analysis and Translation Discovery (2018) (5)
- A two-level syntax-based approach to Arabic-English statistical machine translation (2003) (5)
- Zero-Shot Pronunciation Lexicons for Cross-Language Acoustic Model Transfer (2019) (5)
- Homograph Disambiguation in Text-to-speech Synthesis 1 Problem Description (1997) (5)
- Deriving Consensus for Multi-Parallel Corpora: an English Bible Study (2017) (4)
- Modeling Color Terminology Across Thousands of Languages (2019) (4)
- Text normalization and ambiguity resolution in speech synthesis (1993) (4)
- Transformation based learning and data-driven lexical disambiguation: syntactic and semantic ambiguity resolution (2003) (4)
- Neural Transduction for Multilingual Lexical Translation (2020) (4)
- Minimally Supervised Multilingual Taxonomy and Translation Lexicon Induction (2008) (4)
- Web N-gram workshop 2010 (2011) (3)
- Exploiting Aggregate Properties of Bilingual Dictionaries For Distinguishing Senses of English Words and Inducing English Sense Clusters (2004) (3)
- Arabic cross-document coreference detection (2009) (3)
- MINDS Workshops Machine Translation Working Group Final Report (3)
- Typed graph models for semi-supervised learning of name ethnicity (2011) (3)
- Sequence Models for Computational Etymology of Borrowings (2021) (3)
- Measuring the Similarity of Grammatical Gender Systems by Comparing Partitions (2020) (3)
- The CoNLL--SIGMORPHON 2018 Shared Task (2018) (2)
- Arabic Cross-Document Coreference Resolution (2009) (2)
- UniMorph 4.0: Universal Morphology (2022) (2)
- Induced Inflection-Set Keyword Search in Speech (2019) (2)
- Maximizing resources for corpus-based natural language processing (2001) (1)
- One Sense per Discourse 2. Our Previous Work on Word-sense Disambiguation 2.1. Data Deprivation (1992) (1)
- On Pronunciations in Wiktionary: Extraction and Experiments on Multilingual Syllabification and Stress Prediction (2021) (1)
- JHU1 : An Unsupervised Approach to Person Name Disambiguation using Web Snippets (2007) (1)
- Learning Domain-Specific, L1-Specific Measures of Word Readability (2013) (1)
- Cmp-lg/9406034 22 Jun 94 (1994) (0)
- Discrimination decisions for l O 0 , O 00-dimensional spaces (0)
- Language independent, minimally supervised methods in natural language ambiguity resolution (2004) (0)
- Machine Translation Working Group Final Report (2007) (0)
- C L ] 3 0 A ug 2 01 7 Paradigm Completion for Derivational Morphology (2018) (0)
- Exploiting Nonlo al and Synta ti Word Relationships inLanguage ModelsRadu (1998) (0)
- Cross-Document Coreference Detection (2009) (0)
- Random Walks for Text Semantic Similarity Wikiwalk: Random Walks on Wikipedia for Semantic Relatedness Measuring Semantic Relatedness with Vector Space Models and Random Walks Ranking and Semi-supervised Classification on Large Scale Graphs Using Map-reduce Opinion Graphs for Polarity and Discourse (2009) (0)
- Towards a Discourse Relation-aware Approach for Chinese-english Machine Translation a Mapping-based Approach for General Formal Human Computer Interaction Using Natural Language Learning Grammar with Explicit Annotations for Subordinating Conjunctions Multi-document Summarization Using Distortion-ra (0)
- Web N-gram Workshop Workshop of the 33 rd Annual International (2010) (0)
- Typed Graph Models for Learning Latent Attributes from Names (2011) (0)
- Identification de sections codables dans des documents médicaux (2015) (0)
- Deciphering and Characterizing Out-of-Vocabulary Words for Morphologically Rich Languages (2022) (0)
- Known Words Will Do: Unknown Concept Translation via Lexical Relations (2022) (0)
- On the Robustness of Cognate Generation Models (2022) (0)
This paper list is powered by the following services: