Sanjeev P. Khudanpur
#129,391
Most Influential Person Now
Sanjeev P. Khudanpur's AcademicInfluence.com Rankings
Sanjeev P. Khudanpurengineering Degrees
Engineering
#4376
World Rank
#5568
Historical Rank
Electrical Engineering
#1108
World Rank
#1191
Historical Rank

Sanjeev P. Khudanpurcomputer-science Degrees
Computer Science
#5622
World Rank
#5938
Historical Rank
Computational Linguistics
#832
World Rank
#845
Historical Rank
Database
#2764
World Rank
#2888
Historical Rank

Download Badge
Engineering Computer Science
Sanjeev P. Khudanpur's Degrees
- PhD Electrical Engineering University of Southern California
- Masters Electrical Engineering University of Southern California
Why Is Sanjeev P. Khudanpur Influential?
(Suggest an Edit or Addition)Sanjeev P. Khudanpur's Published Works
Number of citations in a given year to any of this author's works
Total number of citations to an author for the works they published in a given year. This highlights publication of the most important work(s) by the author
Published Works
- Recurrent neural network based language model (2010) (5359)
- Librispeech: An ASR corpus based on public domain audio books (2015) (3691)
- X-Vectors: Robust DNN Embeddings for Speaker Recognition (2018) (1826)
- Extensions of recurrent neural network language model (2011) (1517)
- A time delay neural network architecture for efficient modeling of long temporal contexts (2015) (913)
- Audio augmentation for speech recognition (2015) (885)
- Purely Sequence-Trained Neural Networks for ASR Based on Lattice-Free MMI (2016) (830)
- A study on data augmentation of reverberant speech for robust speech recognition (2017) (622)
- Deep Neural Network Embeddings for Text-Independent Speaker Verification (2017) (614)
- Semi-Orthogonal Low-Rank Matrix Factorization for Deep Neural Networks (2018) (368)
- A Smorgasbord of Features for Statistical Machine Translation (2004) (328)
- Deep neural network-based speaker embeddings for end-to-end speaker verification (2016) (317)
- Improving deep neural network acoustic models using generalized maxout networks (2014) (313)
- A pitch extraction algorithm tuned for automatic speech recognition (2014) (309)
- Highway long short-term memory RNNS for distant speech recognition (2015) (279)
- JHU-ISI Gesture and Skill Assessment Working Set ( JIGSAWS ) : A Surgical Activity Dataset for Human Motion Modeling (2014) (272)
- Speaker Recognition for Multi-speaker Conversations Using X-vectors (2019) (208)
- Transliteration of Proper Names in Cross-Lingual Information Retrieval (2003) (195)
- Developments and directions in speech recognition and understanding, Part 1 [DSP Education] (2009) (192)
- Diarization is Hard: Some Experiences and Lessons Learned for the JHU Team in the Inaugural DIHARD Challenge (2018) (183)
- Parallel training of DNNs with Natural Gradient and Parameter Averaging (2014) (175)
- A Dataset and Benchmarks for Segmentation and Recognition of Gestures in Robotic Surgery (2017) (169)
- Spoken Language Recognition using X-vectors (2018) (166)
- Stochastic pronunciation modelling from hand-labelled phonetic corpora (1999) (163)
- Demonstration of Joshua: An Open Source Toolkit for Parsing-based Machine Translation (2009) (157)
- Pronunciation modeling by sharing gaussian densities across phonetic models (1999) (153)
- Low Latency Acoustic Modeling Using Temporal Convolution and LSTMs (2018) (150)
- Research Developments and Directions in Speech Recognition and Understanding, Part 1 (2009) (146)
- End-to-end Speech Recognition Using Lattice-free MMI (2018) (146)
- Parallel training of Deep Neural Networks with Natural Gradient and Parameter Averaging (2014) (141)
- Sparse Hidden Markov Models for Surgical Gesture Classification and Skill Evaluation (2012) (138)
- Towards language independent acoustic modeling (2000) (131)
- A Time-Restricted Self-Attention Layer for ASR (2018) (125)
- Improved speech-to-text translation with the Fisher and Callhome Spanish-English speech translation corpus (2013) (115)
- A summary of the 2012 JHU CLSP workshop on zero resource speech technologies and models of early language acquisition (2013) (106)
- Maximum entropy techniques for exploiting syntactic, semantic and collocational dependencies in language modeling (2000) (105)
- JHU ASpIRE system: Robust LVCSR with TDNNS, iVector adaptation and RNN-LMS (2015) (104)
- Unsupervised Learning of Acoustic Sub-word Units (2008) (103)
- Using proxies for OOV keywords in the keyword search task (2013) (100)
- Hidden Markov models for automatic annotation and content-based retrieval of images and video (2005) (99)
- An Exploration of Dropout with LSTMs (2017) (97)
- Data-Derived Models for Segmentation with Application to Surgical Assessment and Training (2009) (95)
- A Pruned Rnnlm Lattice-Rescoring Algorithm for Automatic Speech Recognition (2018) (94)
- Acoustic Modelling from the Signal Domain Using CNNs (2016) (86)
- Structure and performance of a dependency language model (1997) (85)
- State-of-the-Art Speaker Recognition for Telephone and Video Speech: The JHU-MIT Submission for NIST SRE18 (2019) (84)
- Generative Content Models for Structural Analysis of Medical Abstracts (2006) (83)
- Reverberation robust acoustic modeling using i-vectors with time delay neural networks (2015) (82)
- On large vocabulary continuous speech recognition of highly inflectional language - czech (2001) (81)
- Automatic Recognition of Surgical Motions Using Statistical Modeling for Capturing Variability (2008) (81)
- GigaSpeech: An Evolving, Multi-domain ASR Corpus with 10, 000 Hours of Transcribed Audio (2021) (74)
- Investigation of transfer learning for ASR using LF-MMI trained neural networks (2017) (72)
- Combination of strongly and weakly constrained recognizers for reliable detection of OOVS (2008) (72)
- Semi-Supervised Training of Acoustic Models Using Lattice-Free MMI (2018) (71)
- Machine Translation System Combination using ITG-based Alignments (2008) (70)
- Updated MINDS report on speech recognition and understanding, Part 2 [DSP Education] (2009) (69)
- Neural Network Language Modeling with Letter-Based Features and Importance Sampling (2018) (68)
- Syntax for Statistical Machine Translation (2003) (67)
- Variational approximation of long-span language models for lvcsr (2011) (65)
- Mandarin-English Information (MEI): Investigating Translingual Speech Retrieval (2004) (62)
- Probing the Information Encoded in X-Vectors (2019) (61)
- Variational Decoding for Statistical Machine Translation (2009) (61)
- A maximum entropy language model integrating N-grams and topic dependencies for conversational speech recognition (1999) (61)
- Pronunciation modelling using a hand-labelled corpus for conversational speech recognition (1998) (55)
- String Motif-Based Description of Tool Motion for Detecting Skill and Gestures in Robotic Surgery (2013) (53)
- Automatic Learning of Word Pronunciation from Data (1996) (53)
- Espresso: A Fast End-to-End Neural Speech Recognition Toolkit (2019) (51)
- Pronunciation and silence probability modeling for ASR (2015) (51)
- Quantifying the value of pronunciation lexicons for keyword search in lowresource languages (2013) (50)
- A Scalable Decoder for Parsing-Based Machine Translation with Equivalent Language Model State Maintenance (2008) (49)
- Developments and Directions in Speech Recognition and Understanding , Part 1 T (49)
- Transliteration of proper names in cross-language applications (2003) (48)
- Recurrent Neural Network Language Model Adaptation for Conversational Speech Recognition (2018) (46)
- Joint visual-text modeling for automatic retrieval of multimedia documents (2005) (46)
- Semi-supervised maximum mutual information training of deep neural network acoustic models (2015) (45)
- Speaker Diarization with Region Proposal Network (2020) (45)
- Far-Field ASR Without Parallel Data (2016) (44)
- Combining nonlocal, syntactic and n-gram dependencies in language modeling (1999) (43)
- A keyword search system using open source software (2014) (43)
- JHU Kaldi system for Arabic MGB-3 ASR challenge using diarization, audio-transcript alignment and transfer learning (2017) (41)
- Is automatic speech recognition ready for non-native speech? A data collection effort and initial experiments in modelling conversational Hispanic English (1998) (41)
- End-to-end Deep Neural Network Age Estimation (2018) (41)
- Rapid speech recognizer adaptation to new speakers (1999) (40)
- Efficient training methods for maximum entropy language modeling (2000) (39)
- x-Vector DNN Refinement with Full-Length Recordings for Speaker Recognition (2019) (39)
- A Teacher-Student Learning Approach for Unsupervised Domain Adaptation of Sequence-Trained ASR Models (2018) (38)
- Pronunciation change in conversational speech and its implications for automatic speech recognition (2004) (37)
- Self-supervised discriminative training of statistical language models (2009) (35)
- Maximum Likelihood Set for Estimating a Probability Mass Function (2005) (35)
- The Kaldi OpenKWS System: Improving Low Resource Keyword Search (2017) (35)
- Cross-lingual latent semantic analysis for language modeling (2004) (35)
- Flat-Start Single-Stage Discriminatively Trained HMM-Based Models for ASR (2018) (34)
- Comparing Reordering Constraints for SMT Using Efficient BLEU Oracle Computation (2007) (34)
- DOVER-Lap: A Method for Combining Overlap-Aware Diarization Outputs (2020) (33)
- Pronunciation modeling for conversational speech recognition (2001) (33)
- Lexical triggers and latent semantic analysis for cross-lingual language model adaptation (2004) (33)
- Large-scale Discriminative n-gram Language Models for Statistical Machine Translation (2008) (32)
- Stepwise Optimal Subspace Pursuit for Improving Sparse Recovery (2011) (32)
- Pronunciation ambiguity vs. pronunciation variability in speech recognition (2000) (31)
- Analysis of the Structure of Surgical Activity for a Suturing and Knot-Tying Task (2016) (30)
- Learning and inference algorithms for dynamical system models of dextrous motion (2011) (29)
- Advances in Automatic Speech Recognition for Child Speech Using Factored Time Delay Neural Network (2019) (29)
- Efficient Subsampling for Training Complex Language Models (2011) (28)
- Making MIRACLEs: Interactive translingual search for Cebuano and Hindi (2003) (28)
- LVCSR rescoring with modified loss functions: a decision theoretic perspective (1998) (27)
- Backstitch: Counteracting Finite-Sample Bias via Negative Steps (2017) (27)
- Pronunciation modelling for conversational speech recognition: a status report from WS97 (1997) (27)
- The JHU Speaker Recognition System for the VOiCES 2019 Challenge (2019) (27)
- Adapting ASR for under-resourced languages using mismatched transcriptions (2016) (26)
- Joshua 2.0: A Toolkit for Parsing-Based Machine Translation with Syntax, Semirings, Discriminative Training and Other Goodies (2010) (26)
- Hallucinated n-best lists for discriminative language modeling (2012) (25)
- An investigation of acoustic models for multilingual code-switching (2008) (25)
- PyChain: A Fully Parallelized PyTorch Implementation of LF-MMI for End-to-End ASR (2020) (25)
- WEB-derived pronunciations (2009) (25)
- Task-Level vs. Segment-Level Quantitative Metrics for Surgical Skill Assessment. (2016) (23)
- Building a topic-dependent maximum entropy model for very large corpora (2002) (23)
- A Comparative Study of Word Co-occurrence for Term Clustering in Language Model-based Sentence Retrieval (2010) (23)
- The Hitachi-JHU DIHARD III System: Competitive End-to-End Neural Diarization and X-Vector Clustering Systems Combined by DOVER-Lap (2021) (23)
- Some insights from translating conversational telephone speech (2014) (22)
- Decoding in Joshua: Open Source, Parsing-Based Machine Translation (2009) (22)
- Adversarial Attacks and Defenses for Speech Recognition Systems (2021) (21)
- Efficient Extraction of Oracle-best Translations from Hypergraphs (2009) (21)
- Large-scale random forest language models for speech recognition (2007) (20)
- Latent Semantic Information in Maximum Entropy Language Models for Conversational Speech Recognition (2003) (19)
- Historical Development and Future Directions in Speech Recognition and Understanding (2007) (19)
- Language model adaptation for automatic speech recognition and statistical machine translation (2005) (19)
- Maximum entropy language modeling with non-local dependencies (2003) (19)
- Investigating Self-Supervised Learning for Speech Enhancement and Separation (2022) (19)
- Characterizing Performance of Speaker Diarization Systems on Far-Field Speech Using Standard Methods (2018) (19)
- Cross-Lingual Lexical Triggers in Statistical Language Modeling (2003) (18)
- Acoustic Modeling for Overlapping Speech Recognition: Jhu Chime-5 Challenge System (2019) (18)
- Output-Gate Projected Gated Recurrent Unit for Speech Recognition (2018) (18)
- An Empirical Study of Transformer-Based Neural Language Model Adaptation (2020) (17)
- Topic Identification for Speech Without ASR (2017) (17)
- Wake Word Detection with Streaming Transformers (2021) (17)
- A diversity-penalizing ensemble training method for deep learning (2015) (17)
- Using cross-language cues for story-specific language modeling (2002) (17)
- Using ASR Methods for OCR (2019) (17)
- Semi-supervised discriminative language modeling for Turkish ASR (2012) (17)
- Web derived pronunciations for spoken term detection (2009) (16)
- Bayesian Models for Unit Discovery on a Very Low Resource Language (2018) (16)
- Multi-Class Spectral Clustering with Overlaps for Speaker Diarization (2020) (16)
- A GPU-based WFST Decoder with Exact Lattice Generation (2018) (16)
- Large Vocabulary Speech Recognition for Read and Broadcast Czech (1999) (16)
- Hill climbing on speech lattices: A new rescoring framework (2011) (16)
- Structured variability in acoustic realization: a corpus study of voice onset time in American English stops (2015) (15)
- WS96 project report: Automatic learning of word pronunciation from data (1997) (15)
- Desparately Seeking Cebuano (2003) (15)
- Translations of the Callhome Egyptian Arabic corpus for conversational speech translation (2014) (14)
- An empirical evaluation of zero resource acoustic unit discovery (2017) (14)
- Unsupervised surgical data alignment with application to automatic activity annotation (2016) (14)
- Low-resource open vocabulary keyword search using point process models (2014) (14)
- Unsupervised Discriminative Language Model Training for Machine Translation using Simulated Confusion Sets (2010) (14)
- Forest Reranking for Machine Translation with the Perceptron Algorithm (2009) (13)
- Unsupervised classification via decision trees: an information-theoretic perspective (2005) (13)
- Acoustic Modeling from Frequency Domain Representations of Speech (2018) (13)
- Order estimation for a special class of hidden Markov sources and binary renewal processes (2002) (13)
- Estimating document frequencies in a speech corpus (2011) (13)
- Wake Word Detection with Alignment-Free Lattice-Free MMI (2020) (13)
- A Parallelizable Lattice Rescoring Strategy with Neural Language Models (2021) (12)
- Cross-Instance Tuning of Unsupervised Document Clustering Algorithms (2007) (12)
- Improving LF-MMI Using Unconstrained Supervisions for ASR (2018) (11)
- The Johns Hopkins University 2003 Chinese-English machine translation system (2003) (11)
- Unsupervised Arabic Dialect Adaptation with Self-Training (2011) (11)
- Towards machines that know when they do not know: Summary of work done at 2014 Frederick Jelinek Memorial Workshop (2015) (11)
- Iterative Denoising using Jensen-Renyi Divergences with an Application to Unsupervised Document Categorization (2007) (11)
- Topic identification of spoken documents using unsupervised acoustic unit discovery (2017) (10)
- Limited resource term detection for effective topic identification of speech (2014) (10)
- Language model adaptation using cross-lingual information (2003) (10)
- Analysis of Robustness of Deep Single-Channel Speech Separation Using Corpora Constructed From Multiple Domains (2019) (10)
- Query-by-example surgical activity detection (2016) (10)
- Enhancement and Analysis of Conversational Speech: JSALT 2017 (2018) (10)
- Smoothing issues in the structured language model (2001) (10)
- Mandarin-English Information (MEI) (2000) (10)
- Contemporaneous text as side-information in statistical language modeling (2004) (9)
- End-to-End Language Diarization for Bilingual Code-Switching Speech (2021) (9)
- Mandarin-English Information: Investigating Translingual Speech Retrieval (2001) (9)
- Speaker Recognition Benchmark Using the CHiME-5 Corpus (2019) (9)
- TRECVID 2005 Experiment at Johns Hopkins University: Using Hidden Markov Models for Video Retrieval (2005) (9)
- Acoustic data-driven pronunciation lexicon generation for logographic languages (2016) (9)
- A Coarse-Grained Model for Optimal Coupling of ASR and SMT Systems for Speech Translation (2015) (9)
- Automatic Speech Recognition and Topic Identification for Almost-Zero-Resource Languages (2018) (9)
- Multi-PLDA Diarization on Children's Speech (2019) (9)
- Continuous space discriminative language modeling (2012) (9)
- Proceedings of the NAACL-HLT 2012 Workshop: Will We Ever Really Replace the N-gram Model? On the Future of Language Modeling for HLT - Workshop Notes (2012) (9)
- Minimum Imputed-Risk: Unsupervised Discriminative Training for Machine Translation (2011) (8)
- Fine-Grained Activity Recognition for Assembly Videos (2020) (8)
- Phrasal Cohort Based Unsupervised Discriminative Language Modeling (2012) (8)
- Constraints and Development in Children's Block Construction (2018) (8)
- Efficient Structured Language Modeling for Speech Recognition (2012) (8)
- Getting more from automatic transcripts for semi-supervised language modeling (2016) (8)
- Automatically learning speaker-independent acoustic subword units (2008) (8)
- Building Corpora for Single-Channel Speech Separation Across Multiple Domains (2018) (8)
- Modeling phonetic context with non-random forests for speech recognition (2015) (8)
- Acoustic Data-Driven Lexicon Learning Based on a Greedy Pronunciation Selection Framework (2017) (8)
- Toward Computer Vision Systems That Understand Real-World Assembly Processes (2019) (7)
- Sequential system combination for machine translation of speech (2008) (7)
- Typicality of a Good Rate-Distortion Code (2004) (7)
- On projections of Gaussian distributions using maximum likelihood criteria (2009) (7)
- Tree-structured models of parameter dependence for rapid adaptation in large vocabulary conversational speech recognition (1999) (6)
- Combining local and broad topic context to improve term detection (2014) (6)
- Multilingual Spoken Term Detection: Finding and Testing New Pronunciations (2008) (6)
- Training Noisy Single-Channel Speech Separation with Noisy Oracle Sources: A Large Gap and a Small Step (2020) (6)
- Improving Passage Retrieval Using Interactive Elicition and Statistical Modeling (2004) (6)
- Online Learning in Tensor Space (2014) (6)
- Fast Syntactic Analysis for Statistical Language Modeling via Substructure Sharing and Uptraining (2012) (6)
- The JHU Multi-Microphone Multi-Speaker ASR System for the CHiME-6 Challenge (2020) (5)
- Hypothesis ranking and two-pass approaches for machine translation system combination (2010) (5)
- Sample Selection for Large-scale MT Discriminative Training (2012) (5)
- Zero-Shot Pronunciation Lexicons for Cross-Language Acoustic Model Transfer (2019) (5)
- Adapting n-gram maximum entropy language models with conditional entropy regularization (2011) (5)
- The JHU ASR System for VOiCES from a Distance Challenge 2019 (2019) (5)
- Efficient discriminative training of long-span language models (2011) (5)
- Pretraining by Backtranslation for End-to-End ASR in Low-Resource Settings (2018) (5)
- Sample selection for automatic language identification (2008) (5)
- Characterizing spatial construction processes: Toward computational tools to understand cognition (2017) (5)
- New release of Mixer-6: Improved validity for phonetic study of speaker variation and identification (2016) (5)
- Dirichlet Mixture Models of neural net posteriors for HMM-based speech recognition (2011) (5)
- Randomized maximum entropy language models (2011) (5)
- Context-dependent point process models for keyword search and detection-based ASR (2016) (4)
- Lhotse: a speech data representation library for the modern deep learning ecosystem (2021) (4)
- Joint Visual-Text Modeling for Multimedia Retrieval (2004) (4)
- Can You Repeat That? Using Word Repetition to Improve Spoken Term Detection (2014) (4)
- Revisiting the Case for Explicit Syntactic Information in Language Models (2012) (4)
- Injecting Text and Cross-Lingual Supervision in Few-Shot Learning from Self-Supervised Models (2021) (4)
- Defense against Adversarial Attacks on Hybrid Speech Recognition using Joint Adversarial Fine-tuning with Denoiser (2022) (3)
- International Workshop on Spoken Language Translation (IWSLT 2013) (2013) (3)
- Phone Duration Modeling for LVCSR Using Neural Networks (2017) (3)
- Multilingual Language Modeling (2006) (3)
- Semi-Supervised Methods for Improving Keyword Search of Unseen Terms (2012) (3)
- Enhance Language Identification using Dual-mode Model with Knowledge Distillation (2022) (3)
- Language Modeling with the Maximum Likelihood Set: Complexity Issues and the Back-off Formula (2006) (3)
- Training Hybrid Models on Noisy Transliterated Transcripts for Code-Switched Speech Recognition (2021) (3)
- PHO-LID: A Unified Model Incorporating Acoustic-Phonetic and Phonotactic Information for Language Identification (2022) (3)
- Source Adaptation for Improved Content-Based Video Retrieval (2006) (3)
- Frustratingly Easy Noise-aware Training of Acoustic Models (2020) (3)
- Impact of novel sources on content-based image and video retrieval (2009) (3)
- on Speech Recognition and Understanding , Part 2 (2009) (3)
- JHU IWSLT 2022 Dialect Speech Translation System Description (2022) (2)
- Syntactic heads in statistical language modeling (2000) (2)
- Building Speech Recognition System from Untranscribed Data Report from JHU workshop 2016 (2016) (2)
- OOV Recovery with Efficient 2nd Pass Decoding and Open-vocabulary Word-level RNNLM Rescoring for Hybrid ASR (2020) (2)
- Neural Language Modeling with Implicit Cache Pointers (2020) (2)
- Error Bounds and Improved Probability Estimation using the Maximum Likelihood Set (2007) (2)
- Mixture of Speaker-type PLDAs for Children's Speech Diarization (2020) (2)
- Recovery from Model Inconsistency in Multilingual Speech Recognition Report from JHU workshop 2007 (2)
- TOWARDS LANGUAGE INDEPENDENT ACOUSTIC (1999) (2)
- Efficient Self-Supervised Learning Representations for Spoken Language Identification (2022) (2)
- Imperial College and Johns Hopkins University at TRECVID (2006) (2)
- Efficient MDI Adaptation for n-gram Language Models (2020) (1)
- LET-Decoder: A WFST-Based Lazy-Evaluation Token-Group Decoder With Exact Lattice Generation (2021) (1)
- Joint speaker diarization and speech recognition based on region proposal networks (2021) (1)
- Learning Policies for Multilingual Training of Neural Machine Translation Systems (2021) (1)
- Estimating Probabilities from Small Samples (1)
- Open Source, Parsing-Based Machine Translation (2009) (1)
- Unsupervised estimation of the language model scaling factor (2009) (1)
- Computation of Csiszár’s mutual Information of order α (2008) (1)
- Discriminative training and variational decoding in machine translation via novel algorithms for weighted hypergraphs (2010) (1)
- Reformulating DOVER-Lap Label Mapping as a Graph Partitioning Problem (2021) (1)
- Reducing Language confusion for Code-switching Speech Recognition with Token-level Language Diarization (2022) (1)
- Robust Knowledge Discovery from Parallel Speech and Text Sources (2001) (1)
- Defense against Adversarial Attacks on Hybrid Speech Recognition System using Adversarial Fine-tuning with Denoiser (2022) (1)
- An Alternative to MFCCs for ASR (2020) (1)
- Speaker Verification-Based Evaluation of Single-Channel Speech Separation (2021) (1)
- Learning Feature Weights using Reward Modeling for Denoising Parallel Corpora (2021) (1)
- Learning and inference algorithms for partially observed structured switching vector autoregressive models (2011) (1)
- Using of heterogeneous corpora for training of an ASR system (2017) (1)
- Low-Resource Contextual Topic Identification on Speech (2018) (1)
- GPU-accelerated Guided Source Separation for Meeting Transcription (2022) (1)
- Maximum Entropy Language Modeling with Non-local and Syntactic Dependencies (2002) (1)
- Confusion Network Decoding for MT System Combination (2012) (1)
- Textual Data Augmentation for Arabic-English Code-Switching Speech Recognition (2022) (1)
- Estimating Conditional Densities from Sparse Data for Statistical Language Modeling (2006) (0)
- Typicality of a Good Rate-Distortion Code Angelos Kanlis (1996) (0)
- Estimation of Probability Mass Functions from Small Samples (0)
- Hidden Markov Models for Image and Video Retrieval Using Textual Queries (0)
- Likelihood-Based Semi-Supervised Model Selection With Applications to Speech Processing (2009) (0)
- EURO: ESPnet Unsupervised ASR Open-source Toolkit (2022) (0)
- Estimating Confusions in the ASR Channel for Improved Topic-based Language Model Adaptation (2013) (0)
- Bottom-Up Unsupervised Word Discovery via Acoustic Units (2019) (0)
- A greedy algorithm for sparse recovery using precise metrics (2010) (0)
- Using Word Repetition to Improve Spoken Term Detection (2014) (0)
- Data-Driven Statistical Models for Computer Integrated Surgery (2011) (0)
- Learning Curricula for Multilingual Neural Machine Translation Training (2021) (0)
- RESCORING A DECISION (1998) (0)
- Practical and efficient incorporation of syntactic features into statistical language models (2012) (0)
- An Asynchronous WFST-Based Decoder for Automatic Speech Recognition (2021) (0)
- Two Self-supervised Learning Techniques for Speech Recognition (2010) (0)
- Modeling data-source variability for content-based video retrieval using hidden markov models (2009) (0)
- Adapting self-supervised models to multi-talker speech recognition using speaker embeddings (2022) (0)
- Optical Character Recognition with Chinese and Korean Character Decomposition (2019) (0)
- INTERSPEECH 2012, 13th Annual Conference of the International Speech Communication Association, Portland, Oregon, USA, September 9-13, 2012 (2012) (0)
- Automatic Speech Recognition and Topic Identification from Speech for Almost-Zero-Resource Languages (2018) (0)
- Hallucinating system outputs for discriminative language modeling (2012) (0)
- Characterizing the Details of Spatial Construction: Cognitive Constraints and Variability (2022) (0)
- Deriving conversation-based features from unlabeled speech for discriminative language modeling (2012) (0)
- Incremental Lattice Determinization for WFST Decoders (2019) (0)
- Chunking Defense for Adversarial Attacks on ASR (2022) (0)
- A dilemma of ground truth in noisy speech separation and an approach to lessen the impact of imperfect training data (2022) (0)
- On the minimization of concave information functionals for unsupervised classification via decision trees (2008) (0)
- Explorer Continuous space discriminative language modeling (0)
- FOR MULTI-SPEAKER CONVERSATIONS USING X-VECTORS (2018) (0)
- LATTICE-RESCORING ALGORITHM FOR AUTOMATIC SPEECH RECOGNITION (2017) (0)
- The JHU Speech LOREHLT 2017 System: Cross-Language Transfer for Situation-Frame Detection (2018) (0)
- Building Keyword Search System from End-To-End Asr Systems (2023) (0)
This paper list is powered by the following services:
What Schools Are Affiliated With Sanjeev P. Khudanpur?
Sanjeev P. Khudanpur is affiliated with the following schools: