Sanjeev P. Khudanpur

Sanjeev P. Khudanpur's AcademicInfluence.com Rankings

Engineering

#4376

World Rank

#5568

Historical Rank

Electrical Engineering

#1108

World Rank

#1191

Historical Rank

engineering Degrees

Sanjeev P. Khudanpur

Computer Science

#5622

World Rank

#5938

Historical Rank

Computational Linguistics

#832

World Rank

#845

Historical Rank

Database

#2764

World Rank

#2888

Historical Rank

computer-science Degrees

Download Badge

Engineering
Computer Science

Sanjeev P. Khudanpur's Degrees

PhD Electrical Engineering University of Southern California
Masters Electrical Engineering University of Southern California

Why Is Sanjeev P. Khudanpur Influential?

(Suggest an Edit or Addition)

(See a Problem?)

Sanjeev P. Khudanpur's Published Works

Number of citations in a given year to any of this author's works

Total number of citations to an author for the works they published in a given year. This highlights publication of the most important work(s) by the author

Published Works

Recurrent neural network based language model (2010) (5359)
Librispeech: An ASR corpus based on public domain audio books (2015) (3691)
X-Vectors: Robust DNN Embeddings for Speaker Recognition (2018) (1826)
Extensions of recurrent neural network language model (2011) (1517)
A time delay neural network architecture for efficient modeling of long temporal contexts (2015) (913)
Audio augmentation for speech recognition (2015) (885)
Purely Sequence-Trained Neural Networks for ASR Based on Lattice-Free MMI (2016) (830)
A study on data augmentation of reverberant speech for robust speech recognition (2017) (622)
Deep Neural Network Embeddings for Text-Independent Speaker Verification (2017) (614)
Semi-Orthogonal Low-Rank Matrix Factorization for Deep Neural Networks (2018) (368)
A Smorgasbord of Features for Statistical Machine Translation (2004) (328)
Deep neural network-based speaker embeddings for end-to-end speaker verification (2016) (317)
Improving deep neural network acoustic models using generalized maxout networks (2014) (313)
A pitch extraction algorithm tuned for automatic speech recognition (2014) (309)
Highway long short-term memory RNNS for distant speech recognition (2015) (279)
JHU-ISI Gesture and Skill Assessment Working Set ( JIGSAWS ) : A Surgical Activity Dataset for Human Motion Modeling (2014) (272)
Speaker Recognition for Multi-speaker Conversations Using X-vectors (2019) (208)
Transliteration of Proper Names in Cross-Lingual Information Retrieval (2003) (195)
Developments and directions in speech recognition and understanding, Part 1 [DSP Education] (2009) (192)
Diarization is Hard: Some Experiences and Lessons Learned for the JHU Team in the Inaugural DIHARD Challenge (2018) (183)
Parallel training of DNNs with Natural Gradient and Parameter Averaging (2014) (175)
A Dataset and Benchmarks for Segmentation and Recognition of Gestures in Robotic Surgery (2017) (169)
Spoken Language Recognition using X-vectors (2018) (166)
Stochastic pronunciation modelling from hand-labelled phonetic corpora (1999) (163)
Demonstration of Joshua: An Open Source Toolkit for Parsing-based Machine Translation (2009) (157)
Pronunciation modeling by sharing gaussian densities across phonetic models (1999) (153)
Low Latency Acoustic Modeling Using Temporal Convolution and LSTMs (2018) (150)
Research Developments and Directions in Speech Recognition and Understanding, Part 1 (2009) (146)
End-to-end Speech Recognition Using Lattice-free MMI (2018) (146)
Parallel training of Deep Neural Networks with Natural Gradient and Parameter Averaging (2014) (141)
Sparse Hidden Markov Models for Surgical Gesture Classification and Skill Evaluation (2012) (138)
Towards language independent acoustic modeling (2000) (131)
A Time-Restricted Self-Attention Layer for ASR (2018) (125)
Improved speech-to-text translation with the Fisher and Callhome Spanish-English speech translation corpus (2013) (115)
A summary of the 2012 JHU CLSP workshop on zero resource speech technologies and models of early language acquisition (2013) (106)
Maximum entropy techniques for exploiting syntactic, semantic and collocational dependencies in language modeling (2000) (105)
JHU ASpIRE system: Robust LVCSR with TDNNS, iVector adaptation and RNN-LMS (2015) (104)
Unsupervised Learning of Acoustic Sub-word Units (2008) (103)
Using proxies for OOV keywords in the keyword search task (2013) (100)
Hidden Markov models for automatic annotation and content-based retrieval of images and video (2005) (99)
An Exploration of Dropout with LSTMs (2017) (97)
Data-Derived Models for Segmentation with Application to Surgical Assessment and Training (2009) (95)
A Pruned Rnnlm Lattice-Rescoring Algorithm for Automatic Speech Recognition (2018) (94)
Acoustic Modelling from the Signal Domain Using CNNs (2016) (86)
Structure and performance of a dependency language model (1997) (85)
State-of-the-Art Speaker Recognition for Telephone and Video Speech: The JHU-MIT Submission for NIST SRE18 (2019) (84)
Generative Content Models for Structural Analysis of Medical Abstracts (2006) (83)
Reverberation robust acoustic modeling using i-vectors with time delay neural networks (2015) (82)
On large vocabulary continuous speech recognition of highly inflectional language - czech (2001) (81)
Automatic Recognition of Surgical Motions Using Statistical Modeling for Capturing Variability (2008) (81)
GigaSpeech: An Evolving, Multi-domain ASR Corpus with 10, 000 Hours of Transcribed Audio (2021) (74)
Investigation of transfer learning for ASR using LF-MMI trained neural networks (2017) (72)
Combination of strongly and weakly constrained recognizers for reliable detection of OOVS (2008) (72)
Semi-Supervised Training of Acoustic Models Using Lattice-Free MMI (2018) (71)
Machine Translation System Combination using ITG-based Alignments (2008) (70)
Updated MINDS report on speech recognition and understanding, Part 2 [DSP Education] (2009) (69)
Neural Network Language Modeling with Letter-Based Features and Importance Sampling (2018) (68)
Syntax for Statistical Machine Translation (2003) (67)
Variational approximation of long-span language models for lvcsr (2011) (65)
Mandarin-English Information (MEI): Investigating Translingual Speech Retrieval (2004) (62)
Probing the Information Encoded in X-Vectors (2019) (61)
Variational Decoding for Statistical Machine Translation (2009) (61)
A maximum entropy language model integrating N-grams and topic dependencies for conversational speech recognition (1999) (61)
Pronunciation modelling using a hand-labelled corpus for conversational speech recognition (1998) (55)
String Motif-Based Description of Tool Motion for Detecting Skill and Gestures in Robotic Surgery (2013) (53)
Automatic Learning of Word Pronunciation from Data (1996) (53)
Espresso: A Fast End-to-End Neural Speech Recognition Toolkit (2019) (51)
Pronunciation and silence probability modeling for ASR (2015) (51)
Quantifying the value of pronunciation lexicons for keyword search in lowresource languages (2013) (50)
A Scalable Decoder for Parsing-Based Machine Translation with Equivalent Language Model State Maintenance (2008) (49)
Developments and Directions in Speech Recognition and Understanding , Part 1 T (49)
Transliteration of proper names in cross-language applications (2003) (48)
Recurrent Neural Network Language Model Adaptation for Conversational Speech Recognition (2018) (46)
Joint visual-text modeling for automatic retrieval of multimedia documents (2005) (46)
Semi-supervised maximum mutual information training of deep neural network acoustic models (2015) (45)
Speaker Diarization with Region Proposal Network (2020) (45)
Far-Field ASR Without Parallel Data (2016) (44)
Combining nonlocal, syntactic and n-gram dependencies in language modeling (1999) (43)
A keyword search system using open source software (2014) (43)
JHU Kaldi system for Arabic MGB-3 ASR challenge using diarization, audio-transcript alignment and transfer learning (2017) (41)
Is automatic speech recognition ready for non-native speech? A data collection effort and initial experiments in modelling conversational Hispanic English (1998) (41)
End-to-end Deep Neural Network Age Estimation (2018) (41)
Rapid speech recognizer adaptation to new speakers (1999) (40)
Efficient training methods for maximum entropy language modeling (2000) (39)
x-Vector DNN Refinement with Full-Length Recordings for Speaker Recognition (2019) (39)
A Teacher-Student Learning Approach for Unsupervised Domain Adaptation of Sequence-Trained ASR Models (2018) (38)
Pronunciation change in conversational speech and its implications for automatic speech recognition (2004) (37)
Self-supervised discriminative training of statistical language models (2009) (35)
Maximum Likelihood Set for Estimating a Probability Mass Function (2005) (35)
The Kaldi OpenKWS System: Improving Low Resource Keyword Search (2017) (35)
Cross-lingual latent semantic analysis for language modeling (2004) (35)
Flat-Start Single-Stage Discriminatively Trained HMM-Based Models for ASR (2018) (34)
Comparing Reordering Constraints for SMT Using Efficient BLEU Oracle Computation (2007) (34)
DOVER-Lap: A Method for Combining Overlap-Aware Diarization Outputs (2020) (33)
Pronunciation modeling for conversational speech recognition (2001) (33)
Lexical triggers and latent semantic analysis for cross-lingual language model adaptation (2004) (33)
Large-scale Discriminative n-gram Language Models for Statistical Machine Translation (2008) (32)
Stepwise Optimal Subspace Pursuit for Improving Sparse Recovery (2011) (32)
Pronunciation ambiguity vs. pronunciation variability in speech recognition (2000) (31)
Analysis of the Structure of Surgical Activity for a Suturing and Knot-Tying Task (2016) (30)
Learning and inference algorithms for dynamical system models of dextrous motion (2011) (29)
Advances in Automatic Speech Recognition for Child Speech Using Factored Time Delay Neural Network (2019) (29)
Efficient Subsampling for Training Complex Language Models (2011) (28)
Making MIRACLEs: Interactive translingual search for Cebuano and Hindi (2003) (28)
LVCSR rescoring with modified loss functions: a decision theoretic perspective (1998) (27)
Backstitch: Counteracting Finite-Sample Bias via Negative Steps (2017) (27)
Pronunciation modelling for conversational speech recognition: a status report from WS97 (1997) (27)
The JHU Speaker Recognition System for the VOiCES 2019 Challenge (2019) (27)
Adapting ASR for under-resourced languages using mismatched transcriptions (2016) (26)
Joshua 2.0: A Toolkit for Parsing-Based Machine Translation with Syntax, Semirings, Discriminative Training and Other Goodies (2010) (26)
Hallucinated n-best lists for discriminative language modeling (2012) (25)
An investigation of acoustic models for multilingual code-switching (2008) (25)
PyChain: A Fully Parallelized PyTorch Implementation of LF-MMI for End-to-End ASR (2020) (25)
WEB-derived pronunciations (2009) (25)
Task-Level vs. Segment-Level Quantitative Metrics for Surgical Skill Assessment. (2016) (23)
Building a topic-dependent maximum entropy model for very large corpora (2002) (23)
A Comparative Study of Word Co-occurrence for Term Clustering in Language Model-based Sentence Retrieval (2010) (23)
The Hitachi-JHU DIHARD III System: Competitive End-to-End Neural Diarization and X-Vector Clustering Systems Combined by DOVER-Lap (2021) (23)
Some insights from translating conversational telephone speech (2014) (22)
Decoding in Joshua: Open Source, Parsing-Based Machine Translation (2009) (22)
Adversarial Attacks and Defenses for Speech Recognition Systems (2021) (21)
Efficient Extraction of Oracle-best Translations from Hypergraphs (2009) (21)
Large-scale random forest language models for speech recognition (2007) (20)
Latent Semantic Information in Maximum Entropy Language Models for Conversational Speech Recognition (2003) (19)
Historical Development and Future Directions in Speech Recognition and Understanding (2007) (19)
Language model adaptation for automatic speech recognition and statistical machine translation (2005) (19)
Maximum entropy language modeling with non-local dependencies (2003) (19)
Investigating Self-Supervised Learning for Speech Enhancement and Separation (2022) (19)
Characterizing Performance of Speaker Diarization Systems on Far-Field Speech Using Standard Methods (2018) (19)
Cross-Lingual Lexical Triggers in Statistical Language Modeling (2003) (18)
Acoustic Modeling for Overlapping Speech Recognition: Jhu Chime-5 Challenge System (2019) (18)
Output-Gate Projected Gated Recurrent Unit for Speech Recognition (2018) (18)
An Empirical Study of Transformer-Based Neural Language Model Adaptation (2020) (17)
Topic Identification for Speech Without ASR (2017) (17)
Wake Word Detection with Streaming Transformers (2021) (17)
A diversity-penalizing ensemble training method for deep learning (2015) (17)
Using cross-language cues for story-specific language modeling (2002) (17)
Using ASR Methods for OCR (2019) (17)
Semi-supervised discriminative language modeling for Turkish ASR (2012) (17)
Web derived pronunciations for spoken term detection (2009) (16)
Bayesian Models for Unit Discovery on a Very Low Resource Language (2018) (16)
Multi-Class Spectral Clustering with Overlaps for Speaker Diarization (2020) (16)
A GPU-based WFST Decoder with Exact Lattice Generation (2018) (16)
Large Vocabulary Speech Recognition for Read and Broadcast Czech (1999) (16)
Hill climbing on speech lattices: A new rescoring framework (2011) (16)
Structured variability in acoustic realization: a corpus study of voice onset time in American English stops (2015) (15)
WS96 project report: Automatic learning of word pronunciation from data (1997) (15)
Desparately Seeking Cebuano (2003) (15)
Translations of the Callhome Egyptian Arabic corpus for conversational speech translation (2014) (14)
An empirical evaluation of zero resource acoustic unit discovery (2017) (14)
Unsupervised surgical data alignment with application to automatic activity annotation (2016) (14)
Low-resource open vocabulary keyword search using point process models (2014) (14)
Unsupervised Discriminative Language Model Training for Machine Translation using Simulated Confusion Sets (2010) (14)
Forest Reranking for Machine Translation with the Perceptron Algorithm (2009) (13)
Unsupervised classification via decision trees: an information-theoretic perspective (2005) (13)
Acoustic Modeling from Frequency Domain Representations of Speech (2018) (13)
Order estimation for a special class of hidden Markov sources and binary renewal processes (2002) (13)
Estimating document frequencies in a speech corpus (2011) (13)
Wake Word Detection with Alignment-Free Lattice-Free MMI (2020) (13)
A Parallelizable Lattice Rescoring Strategy with Neural Language Models (2021) (12)
Cross-Instance Tuning of Unsupervised Document Clustering Algorithms (2007) (12)
Improving LF-MMI Using Unconstrained Supervisions for ASR (2018) (11)
The Johns Hopkins University 2003 Chinese-English machine translation system (2003) (11)
Unsupervised Arabic Dialect Adaptation with Self-Training (2011) (11)
Towards machines that know when they do not know: Summary of work done at 2014 Frederick Jelinek Memorial Workshop (2015) (11)
Iterative Denoising using Jensen-Renyi Divergences with an Application to Unsupervised Document Categorization (2007) (11)
Topic identification of spoken documents using unsupervised acoustic unit discovery (2017) (10)
Limited resource term detection for effective topic identification of speech (2014) (10)
Language model adaptation using cross-lingual information (2003) (10)
Analysis of Robustness of Deep Single-Channel Speech Separation Using Corpora Constructed From Multiple Domains (2019) (10)
Query-by-example surgical activity detection (2016) (10)
Enhancement and Analysis of Conversational Speech: JSALT 2017 (2018) (10)
Smoothing issues in the structured language model (2001) (10)
Mandarin-English Information (MEI) (2000) (10)
Contemporaneous text as side-information in statistical language modeling (2004) (9)
End-to-End Language Diarization for Bilingual Code-Switching Speech (2021) (9)
Mandarin-English Information: Investigating Translingual Speech Retrieval (2001) (9)
Speaker Recognition Benchmark Using the CHiME-5 Corpus (2019) (9)
TRECVID 2005 Experiment at Johns Hopkins University: Using Hidden Markov Models for Video Retrieval (2005) (9)
Acoustic data-driven pronunciation lexicon generation for logographic languages (2016) (9)
A Coarse-Grained Model for Optimal Coupling of ASR and SMT Systems for Speech Translation (2015) (9)
Automatic Speech Recognition and Topic Identification for Almost-Zero-Resource Languages (2018) (9)
Multi-PLDA Diarization on Children's Speech (2019) (9)
Continuous space discriminative language modeling (2012) (9)
Proceedings of the NAACL-HLT 2012 Workshop: Will We Ever Really Replace the N-gram Model? On the Future of Language Modeling for HLT - Workshop Notes (2012) (9)
Minimum Imputed-Risk: Unsupervised Discriminative Training for Machine Translation (2011) (8)
Fine-Grained Activity Recognition for Assembly Videos (2020) (8)
Phrasal Cohort Based Unsupervised Discriminative Language Modeling (2012) (8)
Constraints and Development in Children's Block Construction (2018) (8)
Efficient Structured Language Modeling for Speech Recognition (2012) (8)
Getting more from automatic transcripts for semi-supervised language modeling (2016) (8)
Automatically learning speaker-independent acoustic subword units (2008) (8)
Building Corpora for Single-Channel Speech Separation Across Multiple Domains (2018) (8)
Modeling phonetic context with non-random forests for speech recognition (2015) (8)
Acoustic Data-Driven Lexicon Learning Based on a Greedy Pronunciation Selection Framework (2017) (8)
Toward Computer Vision Systems That Understand Real-World Assembly Processes (2019) (7)
Sequential system combination for machine translation of speech (2008) (7)
Typicality of a Good Rate-Distortion Code (2004) (7)
On projections of Gaussian distributions using maximum likelihood criteria (2009) (7)
Tree-structured models of parameter dependence for rapid adaptation in large vocabulary conversational speech recognition (1999) (6)
Combining local and broad topic context to improve term detection (2014) (6)
Multilingual Spoken Term Detection: Finding and Testing New Pronunciations (2008) (6)
Training Noisy Single-Channel Speech Separation with Noisy Oracle Sources: A Large Gap and a Small Step (2020) (6)
Improving Passage Retrieval Using Interactive Elicition and Statistical Modeling (2004) (6)
Online Learning in Tensor Space (2014) (6)
Fast Syntactic Analysis for Statistical Language Modeling via Substructure Sharing and Uptraining (2012) (6)
The JHU Multi-Microphone Multi-Speaker ASR System for the CHiME-6 Challenge (2020) (5)
Hypothesis ranking and two-pass approaches for machine translation system combination (2010) (5)
Sample Selection for Large-scale MT Discriminative Training (2012) (5)
Zero-Shot Pronunciation Lexicons for Cross-Language Acoustic Model Transfer (2019) (5)
Adapting n-gram maximum entropy language models with conditional entropy regularization (2011) (5)
The JHU ASR System for VOiCES from a Distance Challenge 2019 (2019) (5)
Efficient discriminative training of long-span language models (2011) (5)
Pretraining by Backtranslation for End-to-End ASR in Low-Resource Settings (2018) (5)
Sample selection for automatic language identification (2008) (5)
Characterizing spatial construction processes: Toward computational tools to understand cognition (2017) (5)
New release of Mixer-6: Improved validity for phonetic study of speaker variation and identification (2016) (5)
Dirichlet Mixture Models of neural net posteriors for HMM-based speech recognition (2011) (5)
Randomized maximum entropy language models (2011) (5)
Context-dependent point process models for keyword search and detection-based ASR (2016) (4)
Lhotse: a speech data representation library for the modern deep learning ecosystem (2021) (4)
Joint Visual-Text Modeling for Multimedia Retrieval (2004) (4)
Can You Repeat That? Using Word Repetition to Improve Spoken Term Detection (2014) (4)
Revisiting the Case for Explicit Syntactic Information in Language Models (2012) (4)
Injecting Text and Cross-Lingual Supervision in Few-Shot Learning from Self-Supervised Models (2021) (4)
Defense against Adversarial Attacks on Hybrid Speech Recognition using Joint Adversarial Fine-tuning with Denoiser (2022) (3)
International Workshop on Spoken Language Translation (IWSLT 2013) (2013) (3)
Phone Duration Modeling for LVCSR Using Neural Networks (2017) (3)
Multilingual Language Modeling (2006) (3)
Semi-Supervised Methods for Improving Keyword Search of Unseen Terms (2012) (3)
Enhance Language Identification using Dual-mode Model with Knowledge Distillation (2022) (3)
Language Modeling with the Maximum Likelihood Set: Complexity Issues and the Back-off Formula (2006) (3)
Training Hybrid Models on Noisy Transliterated Transcripts for Code-Switched Speech Recognition (2021) (3)
PHO-LID: A Unified Model Incorporating Acoustic-Phonetic and Phonotactic Information for Language Identification (2022) (3)
Source Adaptation for Improved Content-Based Video Retrieval (2006) (3)
Frustratingly Easy Noise-aware Training of Acoustic Models (2020) (3)
Impact of novel sources on content-based image and video retrieval (2009) (3)
on Speech Recognition and Understanding , Part 2 (2009) (3)
JHU IWSLT 2022 Dialect Speech Translation System Description (2022) (2)
Syntactic heads in statistical language modeling (2000) (2)
Building Speech Recognition System from Untranscribed Data Report from JHU workshop 2016 (2016) (2)
OOV Recovery with Efficient 2nd Pass Decoding and Open-vocabulary Word-level RNNLM Rescoring for Hybrid ASR (2020) (2)
Neural Language Modeling with Implicit Cache Pointers (2020) (2)
Error Bounds and Improved Probability Estimation using the Maximum Likelihood Set (2007) (2)
Mixture of Speaker-type PLDAs for Children's Speech Diarization (2020) (2)
Recovery from Model Inconsistency in Multilingual Speech Recognition Report from JHU workshop 2007 (2)
TOWARDS LANGUAGE INDEPENDENT ACOUSTIC (1999) (2)
Efficient Self-Supervised Learning Representations for Spoken Language Identification (2022) (2)
Imperial College and Johns Hopkins University at TRECVID (2006) (2)
Efficient MDI Adaptation for n-gram Language Models (2020) (1)
LET-Decoder: A WFST-Based Lazy-Evaluation Token-Group Decoder With Exact Lattice Generation (2021) (1)
Joint speaker diarization and speech recognition based on region proposal networks (2021) (1)
Learning Policies for Multilingual Training of Neural Machine Translation Systems (2021) (1)
Estimating Probabilities from Small Samples (1)
Open Source, Parsing-Based Machine Translation (2009) (1)
Unsupervised estimation of the language model scaling factor (2009) (1)
Computation of Csiszár’s mutual Information of order α (2008) (1)
Discriminative training and variational decoding in machine translation via novel algorithms for weighted hypergraphs (2010) (1)
Reformulating DOVER-Lap Label Mapping as a Graph Partitioning Problem (2021) (1)
Reducing Language confusion for Code-switching Speech Recognition with Token-level Language Diarization (2022) (1)
Robust Knowledge Discovery from Parallel Speech and Text Sources (2001) (1)
Defense against Adversarial Attacks on Hybrid Speech Recognition System using Adversarial Fine-tuning with Denoiser (2022) (1)
An Alternative to MFCCs for ASR (2020) (1)
Speaker Verification-Based Evaluation of Single-Channel Speech Separation (2021) (1)
Learning Feature Weights using Reward Modeling for Denoising Parallel Corpora (2021) (1)
Learning and inference algorithms for partially observed structured switching vector autoregressive models (2011) (1)
Using of heterogeneous corpora for training of an ASR system (2017) (1)
Low-Resource Contextual Topic Identification on Speech (2018) (1)
GPU-accelerated Guided Source Separation for Meeting Transcription (2022) (1)
Maximum Entropy Language Modeling with Non-local and Syntactic Dependencies (2002) (1)
Confusion Network Decoding for MT System Combination (2012) (1)
Textual Data Augmentation for Arabic-English Code-Switching Speech Recognition (2022) (1)
Estimating Conditional Densities from Sparse Data for Statistical Language Modeling (2006) (0)
Typicality of a Good Rate-Distortion Code Angelos Kanlis (1996) (0)
Estimation of Probability Mass Functions from Small Samples (0)
Hidden Markov Models for Image and Video Retrieval Using Textual Queries (0)
Likelihood-Based Semi-Supervised Model Selection With Applications to Speech Processing (2009) (0)
EURO: ESPnet Unsupervised ASR Open-source Toolkit (2022) (0)
Estimating Confusions in the ASR Channel for Improved Topic-based Language Model Adaptation (2013) (0)
Bottom-Up Unsupervised Word Discovery via Acoustic Units (2019) (0)
A greedy algorithm for sparse recovery using precise metrics (2010) (0)
Using Word Repetition to Improve Spoken Term Detection (2014) (0)
Data-Driven Statistical Models for Computer Integrated Surgery (2011) (0)
Learning Curricula for Multilingual Neural Machine Translation Training (2021) (0)
RESCORING A DECISION (1998) (0)
Practical and efficient incorporation of syntactic features into statistical language models (2012) (0)
An Asynchronous WFST-Based Decoder for Automatic Speech Recognition (2021) (0)
Two Self-supervised Learning Techniques for Speech Recognition (2010) (0)
Modeling data-source variability for content-based video retrieval using hidden markov models (2009) (0)
Adapting self-supervised models to multi-talker speech recognition using speaker embeddings (2022) (0)
Optical Character Recognition with Chinese and Korean Character Decomposition (2019) (0)
INTERSPEECH 2012, 13th Annual Conference of the International Speech Communication Association, Portland, Oregon, USA, September 9-13, 2012 (2012) (0)
Automatic Speech Recognition and Topic Identification from Speech for Almost-Zero-Resource Languages (2018) (0)
Hallucinating system outputs for discriminative language modeling (2012) (0)
Characterizing the Details of Spatial Construction: Cognitive Constraints and Variability (2022) (0)
Deriving conversation-based features from unlabeled speech for discriminative language modeling (2012) (0)
Incremental Lattice Determinization for WFST Decoders (2019) (0)
Chunking Defense for Adversarial Attacks on ASR (2022) (0)
A dilemma of ground truth in noisy speech separation and an approach to lessen the impact of imperfect training data (2022) (0)
On the minimization of concave information functionals for unsupervised classification via decision trees (2008) (0)
Explorer Continuous space discriminative language modeling (0)
FOR MULTI-SPEAKER CONVERSATIONS USING X-VECTORS (2018) (0)
LATTICE-RESCORING ALGORITHM FOR AUTOMATIC SPEECH RECOGNITION (2017) (0)
The JHU Speech LOREHLT 2017 System: Cross-Language Transfer for Situation-Frame Detection (2018) (0)
Building Keyword Search System from End-To-End Asr Systems (2023) (0)

This paper list is powered by the following services:

What Schools Are Affiliated With Sanjeev P. Khudanpur?

Sanjeev P. Khudanpur is affiliated with the following schools:

Johns Hopkins University

Sanjeev P. Khudanpur's Academic­Influence.com Rankings

Sanjeev P. Khudanpur's Degrees

Why Is Sanjeev P. Khudanpur Influential?

Sanjeev P. Khudanpur's Published Works

Published Works

What Schools Are Affiliated With Sanjeev P. Khudanpur?

Sanjeev P. Khudanpur's AcademicInfluence.com Rankings