Salim Roukos
#164,645
Most Influential Person Now
Salim Roukos's AcademicInfluence.com Rankings
Salim Roukoscomputer-science Degrees
Computer Science
#9771
World Rank
#10251
Historical Rank
Computational Linguistics
#2401
World Rank
#2425
Historical Rank
Database
#6728
World Rank
#6965
Historical Rank

Download Badge
Computer Science
Salim Roukos's Degrees
- PhD Computer Science Columbia University
- Masters Computer Science Columbia University
- Bachelors Computer Science Columbia University
Similar Degrees You Can Earn
Why Is Salim Roukos Influential?
(Suggest an Edit or Addition)Salim Roukos's Published Works
Number of citations in a given year to any of this author's works
Total number of citations to an author for the works they published in a given year. This highlights publication of the most important work(s) by the author
Published Works
- Bleu: a Method for Automatic Evaluation of Machine Translation (2002) (19349)
- A Procedure for Quantitatively Comparing the Syntactic Coverage of English Grammars (1991) (568)
- Towards History-based Grammars: Using Richer Models for Probabilistic Parsing (1993) (282)
- A Maximum Entropy Model for Prepositional Phrase Attachment (1994) (277)
- Continuous hidden Markov modeling for speaker-independent word spotting (1989) (262)
- A Mention-Synchronous Coreference Resolution Algorithm Based On the Bell Tree (2004) (255)
- Challenges in information retrieval and language modeling: report of a workshop held at the center for intelligent information retrieval, University of Massachusetts Amherst, September 2002 (2003) (243)
- A stochastic segment model for phoneme-based continuous speech recognition (1989) (221)
- Trigger-based language models: a maximum entropy approach (1993) (221)
- A Statistical Model for Multilingual Entity Detection and Tracking (2004) (207)
- Active Learning for Statistical Natural Language Parsing (2002) (192)
- Language Model Based Arabic Word Segmentation (2003) (167)
- A Dynamic Language Model for Speech Recognition (1991) (150)
- IBM's Statistical Question Answering System-TREC 11 (2001) (143)
- A Maximum Entropy Word Aligner for Arabic-English Machine Translation (2005) (125)
- tRuEcasIng (2003) (120)
- Adaptive Language Modeling Using Minimum Discriminant Estimation (1992) (119)
- Decision Tree Parsing using a Hidden Derivation Model (1994) (110)
- Maximum likelihood and discriminative training of direct translation models (1998) (108)
- Learning to Predict Readability using Diverse Linguistic Features (2010) (107)
- Performance of the IBM large vocabulary continuous speech recognition system on the ARPA Wall Street Journal task (1995) (104)
- DARPA communicator: cross-system results for the 2001 evaluation (2002) (99)
- Adaptive Language Modeling Using the Maximum Entropy Principle (1993) (76)
- Development and Evaluation of a Broad-Coverage Probabilistic Grammar of English-Language Computer Manuals (1992) (74)
- GPT-too: A Language-Model-First Approach for AMR-to-Text Generation (2020) (73)
- Ad hoc and Multilingual Information Retrieval at IBM (1998) (73)
- Feature-based language understanding (1997) (72)
- Decision Tree Models Applied to the Labeling of Text with Parts-of-Speech (1992) (68)
- System Combination for Machine Translation of Spoken and Written Language (2008) (65)
- Direct Translation Model 2 (2007) (65)
- Corpus-based comprehensive and diagnostic MT evaluation: initial Arabic, Chinese, French, and Spanish results (2002) (59)
- A maximum entropy model for parsing (1994) (57)
- Word-based confidence measures as a guide for stack search in speech recognition (1997) (55)
- DARPA communicator evaluation: progress from 2000 to 2001 (2002) (54)
- Leveraging Abstract Meaning Representation for Knowledge Base Question Answering (2020) (52)
- Rewarding Smatch: Transition-Based AMR Parsing with Reinforcement Learning (2019) (50)
- Statistical natural language understanding using hidden clumpings (1996) (44)
- Audio-Indexing For Broadcast News (1998) (42)
- Free-flow dialog management using forms (1999) (41)
- Automatic Derivation of Surface Text Patterns for a Maximum Entropy Based Question Answering System (2003) (40)
- A multistage algorithm for spotting new words in speech (2002) (40)
- The IBM conversational telephony system for financial applications (1999) (40)
- Towards a universal speech recognizer for multiple languages (1997) (40)
- Story Segmentation and Topic Detection in the Broadcast News Domain (1999) (34)
- Improving Mention Detection Robustness to Noisy Input (2010) (33)
- Fertility Models for Statistical Natural Language Understanding (1997) (33)
- Towards History-based Grammars: Using Richer Models for Probabilistic Parsing (1992) (33)
- Phrase splicing and variable substitution using the IBM trainable speech synthesis system (1999) (30)
- Classifying words for improved statistical language models (1990) (30)
- Extracting Social Networks and Biographical Facts From Conversational Speech Transcripts (2007) (29)
- The TechQA Dataset (2019) (27)
- Language model adaptation via minimum discrimination information (1995) (27)
- MDI adaptation of language models across corpora (1997) (25)
- Question Answering over Knowledge Bases by Leveraging Semantic Parsing and Neuro-Symbolic Reasoning (2020) (24)
- Multi-Stage Pretraining for Low-Resource Domain Adaptation (2020) (22)
- Iterative sentence-pair extraction from quasi-parallel corpora for machine translation (2009) (22)
- Towards speech understanding across multiple languages (1998) (22)
- Pushing the Limits of AMR Parsing with Self-Learning (2020) (21)
- Segmentation and detection at IBM: Hybrid statistical models and two-tiered clustering broadcast new (2000) (21)
- A Flexible Framework for Developing Mixed-Initiative Dialog Systems (2002) (20)
- Phone-context specific gender-dependent acoustic-models for continuous speech recognition (1997) (19)
- Leveraging Semantic Parsing for Relation Linking over Knowledge Bases (2020) (19)
- A fast vocabulary independent algorithm for spotting words in speech (1998) (18)
- Identifying and Tracking Entity Mentions in a Maximum Entropy Framework (2003) (18)
- Story segmentation and topic detection for recognized speech (1999) (17)
- Structure-aware Fine-tuning of Sequence-to-sequence Transformers for Transition-based AMR Parsing (2021) (17)
- An Iterative Algorithm to Build Chinese Language Models (1996) (16)
- A Semantic Parsing and Reasoning-Based Approach to Knowledge Base Question Answering (2021) (16)
- Speech understanding using a unification grammar (1989) (16)
- Language representation (1997) (16)
- End-to-End QA on COVID-19: Domain Adaptation with Synthetic Training (2020) (15)
- A Semantics-aware Transformer Model of Relation Linking for Knowledge Base Question Answering (2021) (14)
- Fast document translation for cross-language information retrieval (1998) (14)
- Frustratingly Easy Natural Question Answering (2019) (13)
- TREC-6 Ad-Hoc Retrieval (1997) (13)
- Proceedings of HLT-NAACL 2004: Short Papers (2004) (12)
- Towards building a Robust Industry-scale Question Answering System (2020) (11)
- IBM spoken language translation system evaluation (2004) (11)
- TRANSCRIPTION OF NEW SPEAKING STYLES - VOICEMAIL (1998) (9)
- Bootstrapping Multilingual AMR with Contextual Word Alignments (2021) (9)
- The BBN Spoken Language System (1989) (9)
- Experimental Results in Audio Indexing (1997) (8)
- TREC-5 Ad Hoc Retrieval Using K Nearest-Neighbors Re-Scoring (1996) (8)
- A Correction Model for Word Alignments (2011) (8)
- CFO: A Framework for Building Production NLP Systems (2019) (7)
- Statistical methods for topic segmentation (2000) (7)
- Distilling and exploring nuggets from a corpus (2012) (7)
- Use of recursive mumble models for confidence measuring (1999) (6)
- New word detection in audio-indexing (1997) (6)
- Heuristics for Interpretable Knowledge Graph Contextualization (2019) (5)
- Probabilistic Modeling for Information Retrieval with Unsupervised Training Data (1998) (5)
- Unsupervised adaptation of statistical parsers based on Markov trans-form (1999) (5)
- Maximum Bayes Smatch Ensemble Distillation for AMR Parsing (2021) (5)
- Corpus-based comprehensive and di-agnostic mt evaluation: Initial arabic (2002) (5)
- DocAMR: Multi-Sentence AMR Representation and Evaluation (2021) (4)
- A statistical approach to language modelling for the ATIS task (1995) (4)
- Integrating Speech and Natural Language (1989) (4)
- SYGMA: System for Generalizable Modular Question Answering OverKnowledge Bases (2021) (4)
- Natural Language Understanding (2008) (4)
- Synthetic Target Domain Supervision for Open Retrieval QA (2021) (4)
- Multi-lingual Text Leveling (2014) (3)
- Learning to Transpile AMR into SPARQL (2021) (3)
- A Multilingual Reading Comprehension System for more than 100 Languages (2020) (3)
- IBM Chinese-to-English PatentMT System for NTCIR-9 (2011) (3)
- Recent results on MT evaluation in the GALE program (2006) (3)
- Phrase splicing and variable substitution using a trainable speech synthesizer (2002) (3)
- Adaptive HTER Estimation for Document-Specific MT Post-Editing (2014) (3)
- Path-Based Contextualization of Knowledge Graphs for Textual Entailment. (2019) (3)
- Combining Rules and Embeddings via Neuro-Symbolic AI for Knowledge Base Completion (2021) (3)
- Document-Specific Statistical Machine Translation for Improving Human Translation Productivity (2012) (3)
- Infrastructure and Systems for Adaptive Speech and Text Analytics (2)
- A method for scoring correlated features in query expansion (1998) (2)
- A Benchmark for Generalizable and Interpretable Temporal Question Answering over Knowledge Bases (2022) (2)
- Logical Neural Networks for Knowledge Base Completion with Embeddings & Rules (2022) (2)
- TIPS: A Translingual Information Processing System (2003) (2)
- Real-time multilingual HMM training robust to channel variations (2000) (2)
- A novel approach for proper name transliteration verification (2010) (1)
- Adaptation of large vocabulary recognition system parameters (1992) (1)
- ARES: A Reading Comprehension Ensembling Service (2020) (1)
- Moving Beyond Downstream Task Accuracy for Information Retrieval Benchmarking (2022) (1)
- SYGMA: A System for Generalizable and Modular Question Answering Over Knowledge Bases (2022) (1)
- Active Learning for Mention Detection: A Comparison of Sentence Selection Strategies (2009) (1)
- Rethinking Full-Text Search for Multi-lingual Databases (2007) (1)
- Invited Talk: IBM Cognitive Computing - An NLP Renaissance! (2014) (1)
- UDAPDR: Unsupervised Domain Adaptation via LLM Prompting and Distillation of Rerankers (2023) (1)
- Method for forming language modeling system (1994) (0)
- Session 7: Natural Language II (1991) (0)
- AMR Parsing with Instruction Fine-tuned Pre-trained Language Models (2023) (0)
- PrimeQA: The Prime Repository for State-of-the-Art Multilingual Question Answering Research and Development (2023) (0)
- Improving MT post-editing productivity with adaptive confidence estimation for document-specific translation model (2014) (0)
- Speech recognition models combining gender-dependent and gender-independent phone states and using phonetic-context-dependence (2000) (0)
- Ensembling Strategies for Answering Natural Questions (2019) (0)
- Improving MT post-editing productivity with adaptive confidence estimation for document-specific translation model (2014) (0)
- Rosetta: an analyst’s co-pilot (2006) (0)
- 1 A Hidden Tag Model for LanguageE (1996) (0)
- Efficient Domain-Adaptive Word Segmentation with Larger Context and Co-Training (2013) (0)
- CONTINUOUS SPEECH RECOGN ITION SYSTEM ON THE ARPA WALL STREET JOURNAL TASK (1995) (0)
- Research on Narrowband Communications. (1981) (0)
- TRANSLATE ES EN SPANISH DOCUMENT POOL PAIRING DOCUMENT ENGLISH DOCUMENT POOL SPANISH DOCUMENTDOCUMENT ENGLISH SENTENCE ALIGNMENT PARALLEL SEED DATA ENGLISH SPANISH BUILD SMT (2009) (0)
- Automatic Extraction of Grammars From Annotated Text (1993) (0)
- Automatic Extraction of Grammars From Annotated Text (0)
- A novel use of MT in the development of a text level analytic for language learning (2014) (0)
- Speech Research: Near and Not-so-near Results and What They Might Mean for IUI (Panel). (1998) (0)
- Links with Answers: Query Answering for Customer Support (2019) (0)
- Acquisition of language models from text (1992) (0)
- KAAPA: Knowledge Aware Answers from PDF Analysis (2021) (0)
- Statistical Methods for Translingual Information Retrieval (2000) (0)
- Real Time Translation Services at IBM (2009) (0)
- Zero-shot Entity Linking with Less Data (2022) (0)
- Speech research (panel): near and not-so-near results and what they might mean for IUI (1998) (0)
- A Closer Look at the Calibration of Differentially Private Learners (2022) (0)
This paper list is powered by the following services: