Chin‐hui Lee

Chin‐hui Lee's AcademicInfluence.com Rankings

Chin‐hui Lee

Engineering

#2706

World Rank

#3676

Historical Rank

Applied Physics

#380

World Rank

#396

Historical Rank

Electrical Engineering

#522

World Rank

#576

Historical Rank

engineering Degrees

Download Badge

Engineering

Chin‐hui Lee's Degrees

PhD Electrical Engineering University of Southern California
Masters Electrical Engineering University of Southern California
Bachelors Electrical Engineering National Taiwan University

Why Is Chin‐hui Lee Influential?

(Suggest an Edit or Addition)

(See a Problem?)

Chin‐hui Lee's Published Works

Number of citations in a given year to any of this author's works

Total number of citations to an author for the works they published in a given year. This highlights publication of the most important work(s) by the author

Published Works

Maximum a posteriori estimation for multivariate Gaussian mixture observations of Markov chains (1994) (2573)
A Regression Approach to Speech Enhancement Based on Deep Neural Networks (2015) (1039)
An Experimental Study on Speech Enhancement Based on Deep Neural Networks (2014) (768)
Minimum classification error rate methods for speech recognition (1997) (738)
Automatic recognition of keywords in unconstrained speech using hidden Markov models (1990) (451)
A maximum-likelihood approach to stochastic matching for robust speech recognition (1996) (417)
A study on speaker adaptation of the parameters of continuous density hidden Markov models (1991) (320)
Automatic Speech and Speaker Recognition: Advanced Topics (1999) (278)
A Vector Space Modeling Approach to Spoken Language Identification (2007) (255)
The use of cohort normalized scores for speaker verification (1992) (253)
Evaluation of sliding window correlation performance for characterizing dynamic functional connectivity and brain states (2016) (205)
Developments and directions in speech recognition and understanding, Part 1 [DSP Education] (2009) (192)
A structural Bayes approach to speaker adaptation (2001) (181)
Acoustic modeling for large vocabulary speech recognition (1990) (180)
Vocabulary independent discriminative utterance verification for nonkeyword rejection in subword based speech recognition (1996) (179)
Segmental GPD training of HMM based speech recognizer (1992) (177)
Pattern recognition using a family of design algorithms based upon the generalized probabilistic descent method (1998) (158)
Discriminative utterance verification for connected digits recognition (1995) (157)
Multiple-target deep learning for LSTM-RNN based speech enhancement (2017) (154)
On stochastic feature and model compensation approaches to robust speech recognition (1998) (152)
Cepstral channel normalization techniques for HMM-based speaker verification (1994) (149)
Maximum a posteriori linear regression for hidden Markov model adaptation (1999) (148)
Structural maximum a posteriori linear regression for fast HMM adaptation (2002) (144)
On adaptive decision rules and decision parameter adaptation for automatic speech recognition (2000) (143)
A segment model based approach to speech recognition (1988) (136)
On-line adaptive learning of the continuous density hidden Markov model based on approximate recursive Bayes estimate (1997) (135)
A MFoM learning approach to robust multiclass multi-label text categorization (2004) (131)
A frame-synchronous network search algorithm for connected word recognition (1989) (127)
Convolutional-Recurrent Neural Networks for Speech Enhancement (2018) (124)
On robust linear prediction of speech (1988) (123)
HMM clustering for connected word recognition (1989) (119)
A study on multilingual acoustic modeling for large vocabulary ASR (2009) (119)
Automatic Speech and Speaker Recognition (1996) (119)
Exploiting deep neural networks for detection-based speech recognition (2013) (118)
Bayesian adaptive learning of the parameters of hidden Markov model for speech recognition (1995) (104)
A deep neural network approach to speech bandwidth expansion (2015) (104)
Robust speech recognition with speech enhanced deep neural networks (2014) (102)
A speech understanding system based on statistical representation of semantics (1992) (100)
Discriminative training of language models for speech recognition (2002) (99)
Minimum error rate training based on N-best string models (1993) (98)
Bayesian learning for hidden Markov model with Gaussian mixture state observation densities (1991) (94)
Connected word talker verification using whole word hidden Markov models (1991) (92)
Multi-objective learning and mask-based post-processing for deep neural network based speech enhancement (2017) (92)
A Bayesian predictive classification approach to robust speech recognition (1997) (88)
Experiments on Cross-Language Attribute Detection and Phone Recognition With Minimal Target-Specific Training Data (2012) (86)
A Deep Denoising Autoencoder Approach to Improving the Intelligibility of Vocoded Speech in Cochlear Implant Simulation (2017) (84)
An Overview of Automatic Speech Recognition (1996) (82)
Flexible speech understanding based on combined key-phrase detection and verification (1998) (82)
MAP Estimation of Continuous Density HMM : Theory and Applications (1992) (82)
Sub-word unit talker verification using hidden Markov models (1990) (82)
An Adaptive Image Content Representation and Segmentation Approach to Automatic Image Annotation (2004) (81)
Bayesian Learning of Gaussian Mixture Densities for Hidden Markov Models (1991) (81)
Hermitian Polynomial for Speaker Adaptation of Connectionist Speech Recognition Systems (2013) (81)
Dynamic noise aware training for speech enhancement based on deep neural networks (2014) (80)
A training procedure for verifying string hypotheses in continuous speech recognition (1995) (80)
Improved acoustic modeling for large vocabulary continuous speech recognition (1992) (80)
Structural MAP speaker adaptation using hierarchical priors (1997) (77)
Boosting attribute and phone estimation accuracies with deep neural networks for detection-based speech recognition (2012) (77)
On Mean Absolute Error for Deep Neural Network Based Vector-to-Vector Regression (2020) (76)
Robust speech recognition based on stochastic matching (1995) (73)
A Minimax Classification Approach With Application To Robust Speech Recognition (1991) (73)
Improving non-native mispronunciation detection and enriching diagnostic feedback with DNN-based speech attribute modeling (2016) (72)
Speaker verification using normalized log-likelihood score (1996) (71)
A study on integrating acoustic-phonetic information into lattice rescoring for automatic speech recognition (2009) (71)
A Reverberation-Time-Aware Approach to Speech Dereverberation Based on Deep Neural Networks (2017) (71)
A Regression Approach to Single-Channel Speech Separation Via High-Resolution Deep Neural Networks (2016) (71)
An overview on automatic speech attribute transcription (ASAT) (2007) (71)
Joint training of front-end and back-end deep neural networks for robust speech recognition (2015) (70)
Updated MINDS report on speech recognition and understanding, Part 2 [DSP Education] (2009) (69)
The USTC-iFlytek System for CHiME-4 Challenge (2016) (69)
SNR-Based Progressive Learning of Deep Neural Network for Speech Enhancement (2016) (68)
Speech recognition using weighted HMM and subspace projection approaches (1994) (67)
An artificial neural network approach to automatic speech processing (2014) (66)
Rapid adaptation for deep neural networks through multi-task learning (2015) (66)
Densely Connected Progressive Learning for LSTM-Based Speech Enhancement (2018) (65)
Speech separation of a target speaker based on deep neural networks (2014) (64)
Improvements in connected digit recognition using higher order spectral and energy features (1991) (64)
Discriminative training of natural language call routers (2003) (64)
Stochastic Representation of Conceptual Structure in the ATIS Task (1991) (64)
Utterance verification of keyword strings using word-based minimum verification error (WB-MVE) training (1996) (63)
Automatic verbal information verification for user authentication (2000) (61)
A study on minimum error discriminative training for speaker recognition (1995) (61)
Joint maximum a posteriori adaptation of transformation and HMM parameters (2001) (61)
Toward a detector-based universal phone recognizer (2008) (60)
A new hybrid algorithm for speech recognition based on HMM segmentation and learning vector quantization (1993) (58)
An Information-Extraction Approach to Speech Processing: Analysis, Detection, Verification, and Recognition (2013) (58)
A unified approach to transfer learning of deep neural networks with applications to speaker adaptation in automatic speech recognition (2016) (58)
Speech separation based on improved deep neural networks with dual outputs of speech features for both target and interfering speakers (2014) (57)
Universal attribute characterization of spoken languages for automatic spoken language recognition (2013) (57)
Approximate Test Risk Bound Minimization Through Soft Margin Estimation (2007) (56)
The segmentation of news video into story units (2002) (56)
Vocabulary independent discriminative utterance verification for non-keyword rejection in subword based speech recognition (1998) (55)
Word recognition using whole word and subword models (1989) (55)
Deep Learning–Based Noise Reduction Approach to Improve Speech Intelligibility for Cochlear Implant Recipients (2018) (54)
Decentralizing Feature Extraction with Quantum Convolutional Neural Network for Automatic Speech Recognition (2020) (54)
Sign Transition Modeling and a Scalable Solution to Continuous Sign Language Recognition for Real-World Applications (2016) (54)
On-line adaptive learning of the correlated continuous density hidden Markov models for speech recognition (1996) (54)
A study on knowledge source integration for candidate rescoring in automatic speech recognition (2005) (54)
A Study on Music Genre Classification Based on Universal Acoustic Models (2006) (54)
Application of hidden Markov models for recognition of a limited set of words in unconstrained speech (1989) (53)
A maximal figure-of-merit learning approach to text categorization (2003) (53)
Speech recognition and utterance verification based on a generalized confidence score (2001) (51)
A Multi-Modal Approach to Story Segmentation for News Video (2003) (51)
Word juncture modeling using phonological rules for HMM-based continuous speech recognition (1990) (51)
DNN-based speech bandwidth expansion and its application to adding high-frequency missing features for automatic speech recognition of narrowband speech (2015) (51)
An End-to-End Deep Learning Approach to Simultaneous Speech Dereverberation and Acoustic Modeling for Robust Speech Recognition (2017) (50)
Maximum a posteriori adaptation of network parameters in deep models (2015) (50)
Developments and Directions in Speech Recognition and Understanding , Part 1 T (49)
An Ensemble Speaker and Speaking Environment Modeling Approach to Robust Speech Recognition (2009) (49)
Automatic Image Annotation through Multi-Topic Text Categorization (2006) (48)
A maximal figure-of-merit (MFoM)-learning approach to robust classifier design for text categorization (2006) (48)
Bayesian Adaptive Learning and Map Estimation of HMM (1996) (46)
Soft margin estimation of hidden Markov model parameters (2006) (45)
On the asymptotic statistical behavior of empirical cepstral coefficients (1993) (44)
A study on speaker adaptation of continuous density HMM parameters (1990) (44)
Towards knowledge-based features for HMM based large vocabulary automatic speech recognition (2002) (44)
Multilingual speech recognition with language identification (2002) (42)
Characterizing Speech Adversarial Examples Using Self-Attention U-Net Enhancement (2020) (42)
On designing and evaluating speech event detectors (2005) (42)
A Minimum Error Rate Pattern Recognition Approach to Speech Recognition (1994) (42)
Towards bottom-up continuous phone recognition (2007) (41)
Robust utterance verification for connected digits recognition (1995) (40)
Low-resource keyword search strategies for tamil (2015) (39)
Speaker Diarization with Enhancing Speech for the First DIHARD Challenge (2018) (39)
Speech Enhancement Based on Teacher–Student Deep Learning Using Improved Speech Presence Probability for Noise-Robust Speech Recognition (2019) (38)
A universal VAD based on jointly trained deep neural networks (2015) (37)
Key-phrase detection and verification for flexible speech understanding (1996) (37)
Bayesian adaptation in speech recognition (1983) (37)
Nonlinear compensation for stochastic matching (1999) (36)
A maximal figure-of-merit learning approach to maximizing mean average precision with deep neural network based classifiers (2014) (36)
A blind segmentation approach to acoustic event detection based on i-vector (2013) (36)
Device-Robust Acoustic Scene Classification Based on Two-Stage Categorization and Data Augmentation (2020) (36)
Global variance equalization for improving deep neural network based speech enhancement (2014) (35)
Exploring universal attribute characterization of spoken languages for spoken language recognition (2009) (35)
Verbal information verification (1997) (35)
Speech Recognition Using Long-Span Temporal Patterns in a Deep Network Model (2013) (35)
An acoustic segment modeling approach to automatic language identification (2005) (35)
HIDDEN MARKOV MODEL ADAPTATION USING MAXIMUM A POSTERIORI LINEAR REGRESSION (1999) (34)
i-Vector Modeling of Speech Attributes for Automatic Foreign Accent Recognition (2016) (34)
A study on lattice rescoring with knowledge scores for automatic speech recognition (2006) (33)
A study on task-independent subword selection and modeling for speech recognition (1996) (33)
Deep neural network based speech separation for robust speech recognition (2014) (33)
High-Accuracy Phone Recognition By Combining High-Performance Lattice Generation and Knowledge Based Rescoring (2007) (33)
Verifying and correcting recognition string hypotheses using discriminative utterance verification (1997) (32)
Statistical and Discriminative Methods for Speech Recognition (1996) (31)
Robustness and discrimination oriented speech recognition using weighted HMM and subspace projection approaches (1991) (31)
Unsupervised adaptation using structural Bayes approach (1998) (31)
Improved Acoustic Modeling for Continuous Speech Recognition (1990) (30)
Discriminative utterance verification using minimum string verification error (MSVE) training (1996) (30)
Discriminative training in natural language call routing (2000) (30)
A Study on the Generalization Capability of Acoustic Models for Robust Speech Recognition (2010) (29)
A Kernel Framework for Content-Based Artist Recommendation System in Music (2011) (29)
Simultaneous ANN feature and HMM recognizer design using string-based minimum classification error (MCE) training (1996) (28)
Speaker recognition based on minimum error discriminative training (1994) (28)
A Multiobjective Learning and Ensembling Approach to High-Performance Speech Enhancement With Compact Neural Network Architectures (2018) (28)
Robust, real-time endpoint detector with energy normalization for ASR in adverse environments (2001) (28)
A new approach to utterance verification based on neighborhood information in model space (2003) (28)
Improved acoustic modeling for speaker independent large vocabulary continuous speech recognition (1991) (28)
On natural language call routing (2000) (28)
Acoustic modeling of subword units for speech recognition (1990) (28)
International Conference on Multimodal Interfaces and the Workshop on Machine Learning for Multimodal Interaction (2010) (27)
A Gender Mixture Detection Approach to Unsupervised Single-Channel Speech Separation Based on Deep Neural Networks (2017) (27)
Boosting and combination of classifiers for natural language call routing systems (2003) (27)
Auto-induced semantic classes (2004) (27)
An automatic dialogue generation platform for personalized dialogue applications (2004) (26)
Minimax i-vector extractor for short duration speaker verification (2013) (26)
Large vocabulary speech recognition using subword units (1993) (26)
A Four-Stage Data Augmentation Approach to ResNet-Conformer Based Acoustic Modeling for Sound Event Localization and Detection (2021) (25)
Progress Report on the Chronus System: ATIS Benchmark Results (1992) (25)
A Novel LSTM-Based Speech Preprocessor for Speaker Diarization in Realistic Mismatch Conditions (2018) (25)
String-based minimum verification error (SB-MVE) training for speech recognition (1997) (25)
The USTC-iFlytek systems for CHiME-5 Challenge (2018) (25)
Bayesian Learning of Hierarchical Multinomial Mixture Models of Concepts for Automatic Image Annotation (2006) (25)
An ensemble classifier learning approach to ROC optimization (2006) (25)
A Theory on Deep Neural Network Based Vector-to-Vector Regression With an Illustration of Its Expressive Power in Speech Enhancement (2019) (24)
Minimum Classification Error Training to Improve Isolated Chord Recognition (2009) (24)
Stochastic matching for robust speech recognition (1994) (24)
OASIS natural language call steering trial (2001) (24)
A hybrid HMM/DNN approach to keyword spotting of short words (2013) (24)
On Automatic Speech Recognition at the Dawn of the 21st Century (2003) (24)
Context dependent anti subword modeling for utterance verification (1998) (24)
Improved Bayesian learning of hidden Markov models for speaker adaptation (1997) (23)
A Hybrid Approach to Combining Conventional and Deep Learning Techniques for Single-Channel Speech Enhancement and Recognition (2018) (23)
Enhancing image annotation by integrating concept ontology and text-based bayesian learning model (2007) (23)
DESIGN PRINCIPLES AND TOOLS FOR MULTIMODAL DIALOG SYSTEMS (2000) (23)
Improving Mispronunciation Detection for Non-Native Learners with Multisource Information and LSTM-Based Deep Models (2017) (23)
On the importance of modeling temporal information in music tag annotation (2009) (23)
Analyzing Upper Bounds on Mean Absolute Errors for Deep Neural Network-Based Vector-to-Vector Regression (2020) (23)
On Design of Robust Deep Models for CHiME-4 Multi-Channel Speech Recognition with Multiple Configurations of Array Microphones (2017) (23)
Hermitian based Hidden Activation Functions for Adaptation of Hybrid HMM/ANN Models (2012) (22)
THE USTC-IFLYTEK SYSTEM FOR SOUND EVENT LOCALIZATION AND DETECTION OF DCASE2020 CHALLENGE Technical Report (2020) (22)
A hybrid algorithm for speaker adaptation using MAP transformation and adaptation (1997) (22)
Improved acoustic modeling with Bayesian learning (1992) (22)
A comparative study on system combination schemes for LVCSR (2010) (22)
Improving Deep Neural Network Based Speech Enhancement in Low SNR Environments (2015) (21)
A detection-based approach to broadcast news video story segmentation (2009) (21)
Background model design for flexible and portable speaker verification systems (1999) (21)
Detecting Mispronunciations of L2 Learners and Providing Corrective Feedback Using Knowledge-Guided and Data-Driven Decision Trees (2016) (21)
Statistical segmentation and word modeling techniques in isolated word recognition (1990) (21)
Combining key-phrase detection and subword-based verification for flexible speech understanding (1997) (20)
Feature space maximum a posteriori linear regression for adaptation of deep neural networks (2014) (20)
A Ridge Ensemble Empirical Mode Decomposition Approach to Clutter Rejection for Ultrasound Color Flow Imaging (2013) (20)
Cross-language transfer learning for deep neural network based speech enhancement (2014) (20)
A Bottom-Up Modular Search Approach to Large Vocabulary Continuous Speech Recognition (2013) (20)
A Two-Stage Approach to Device-Robust Acoustic Scene Classification (2020) (20)
An adaptive learning approach to music tempo and beat analysis (2004) (20)
An iterative mask estimation approach to deep learning based multi-channel speech recognition (2019) (20)
Unsupervised anchor shot detection using multi-modal spectral clustering (2008) (20)
A unified DNN approach to speaker-dependent simultaneous speech enhancement and speech separation in low SNR environments (2017) (20)
Discriminative training for call classification and routing (2002) (20)
Speech Separation based on signal-noise-dependent deep neural networks for robust speech recognition (2015) (20)
Recent advancements in automatic speaker authentication (1999) (19)
Enhanced Adversarial Strategically-Timed Attacks Against Deep Reinforcement Learning (2020) (19)
Joint noise and mask aware training for DNN-based speech enhancement with SUB-band features (2017) (19)
Historical Development and Future Directions in Speech Recognition and Understanding (2007) (19)
Video segmentation using spatial and temporal statistical analysis method (2000) (19)
Improving Mandarin Tone Recognition Based on DNN by Combining Acoustic and Articulatory Features Using Extended Recognition Networks (2018) (19)
USTC-NELSLIP System Description for DIHARD-III Challenge (2021) (18)
Selective feature extraction via signal decomposition (1997) (18)
A network-based frame-synchronous level building algorithm for connected word recognition (1988) (18)
A Maximum Likelihood Approach to Deep Neural Network Based Nonlinear Spectral Mapping for Single-Channel Speech Separation (2017) (18)
Exemplar-inspired strategies for low-resource spoken keyword search in Swahili (2016) (18)
A new decoder based on a generalized confidence score (1998) (18)
A dynamic in-search data selection method with its applications to acoustic modeling and utterance verification (2005) (18)
A study of on-line Bayesian adaptation for HMM-based speech recognition (1993) (18)
A Bottom-Up Stepwise Knowledge-Integration Approach to Large Vocabulary Continuous Speech Recognition Using Weighted Finite State Machines (2011) (17)
A Probabilistic Framework for Representing Dialog Systems and Entropy-Based Dialog Management Through Dynamic Stochastic State Evolution (2015) (17)
Exploiting context-dependency and acoustic resolution of universal speech attribute models in spoken language recognition (2010) (17)
Automatic dialogue generator creates user defined applications (1999) (17)
Optimizing the Performance of Spoken Language Recognition With Discriminative Training (2008) (17)
A study on detection based automatic speech recognition (2006) (17)
On frequency dependencies of sliding window correlation (2015) (17)
A novel keyword+LVCSR-filler based grammar network representation for spoken keyword search (2014) (17)
A Cross-Entropy-Guided Measure (CEGM) for Assessing Speech Recognition Performance and Optimizing DNN-Based Speech Enhancement (2021) (17)
Speech technology integration and research platform: a system study (1997) (17)
Approximate Test Risk Minimization Through Soft Margin Estimation (2007) (17)
A new connected word recognition algorithm based on HMM/LVQ segmentation and LVQ classification (1991) (17)
Explicit Performance Metric Optimization for Fusion-Based Video Retrieval (2012) (17)
Metrics for measuring domain independence of semantic classes (2001) (17)
A comparison of four metrics for auto-inducing semantic classes (2001) (17)
An acoustic segment model approach to incorporating temporal information into speaker modeling for text-independent speaker recognition (2010) (16)
A hidden Markov model based approach to music segmentation and identification (2003) (16)
Robust speech recognition based on adaptive classification and decision strategies (2000) (16)
Decision tree based tone modeling with corrective feedbacks for automatic Mandarin tone assessment (2010) (16)
A study on model-based error rate estimation for automatic speech recognition (2003) (16)
Tensor-To-Vector Regression for Multi-Channel Speech Enhancement Based on Tensor-Train Network (2020) (16)
A study on word detector design and knowledge-based pruning and rescoring (2007) (16)
Hierarchical class n-gram language models: towards better estimation of unseen events in speech recognition (2003) (15)
Robust linear prediction for speech analysis (1987) (15)
L-Vector: Neural Label Embedding for Domain Adaptation (2020) (15)
Transformation-based Bayesian prediction for adaptation of HMMs (2000) (15)
Hierarchical stochastic feature matching for robust speech recognition (2001) (15)
Using Generalized Gaussian Distributions to Improve Regression Error Modeling for Deep Learning-Based Speech Enhancement (2019) (15)
A Hybrid Approach to Acoustic Scene Classification Based on Universal Acoustic Models (2019) (15)
Combined on-line model adaptation and Bayesian predictive classification for robust speech recognition (1997) (15)
A Survey on Automatic Speech Recognition with an Illustrative Example on Continuous Speech Recognition of Mandarin (1996) (15)
Beyond cross-entropy: towards better frame-level objective functions for deep neural network training in automatic speech recognition (2014) (15)
Parametric Dependencies of Sliding Window Correlation (2018) (15)
Structural maximum a-posteriori linear regression for unsupervised speaker adaptation (2000) (15)
A study on soft margin estimation for LVCSR (2007) (15)
Tweet Normalization with Syllables (2015) (15)
Consumer-level multimedia event detection through unsupervised audio signal modeling (2012) (14)
Experiments in automatic talker verification using sub-word unit hidden Markov models (1990) (14)
An SNR-incremental stochastic matching algorithm for noisy speech recognition (2001) (14)
On-line adaptation of the SCHMM parameters based on the segmental quasi-Bayes learning for speech recognition (1996) (14)
Speech Science and Technology. (1993) (14)
Maximum-likelihood stochastic matching approach to non-linear equalization for robust speech recognition (1996) (14)
A study on separation between acoustic models and its applications (2005) (14)
An information fusion framework with multi-channel feature concatenation and multi-perspective system combination for the deep-learning-based robust recognition of microphone array speech (2017) (14)
A study of prior sensitivity for Bayesian predictive classification based robust speech recognition (1998) (14)
Applying a Speaker-Dependent Speech Compression Technique to Concatenative TTS Synthesizers (2007) (14)
PATE-AAE: Incorporating Adversarial Autoencoder into Private Aggregation of Teacher Ensembles for Spoken Command Classification (2021) (14)
Image region annotation based on segmentation and semantic correlation analysis (2018) (14)
An algorithm of high resolution and efficient multiple string hypothesization for continuous speech recognition using inter-word models (1994) (14)
An integrated approach to feature compensation combining particle filters and hidden Markov models for robust speech recognition (2012) (14)
A study on target feature activation and normalization and their impacts on the performance of DNN based speech dereverberation systems (2016) (14)
Principles of Spoken Language Recognition (2008) (13)
A Speaker-Dependent Approach to Separation of Far-Field Multi-Talker Microphone Array Speech for Front-End Processing in the CHiME-5 Challenge (2019) (13)
An information fusion approach to recognizing microphone array speech in the CHiME-3 challenge based on a deep learning framework (2015) (13)
Detection-based accented speech recognition using articulatory features (2011) (13)
An ensemble modeling approach to joint characterization of speaker and speaking environments (2007) (13)
Bayesian Unsupervised Batch and Online Speaker Adaptation of Activation Function Parameters in Deep Models for Automatic Speech Recognition (2017) (13)
Unsupervised single-channel speech separation via deep neural network for different gender mixtures (2016) (13)
Introducing attribute features to foreign accent recognition (2014) (13)
A priori threshold selection for fixed vocabulary speaker verification systems (2000) (12)
Combination of boosting and discriminative training for natural language call steering systems (2002) (12)
Ensemble speaker and speaking environment modeling approach with advanced online estimation process (2009) (12)
A Cross-Task Transfer Learning Approach to Adapting Deep Speech Enhancement Models to Unseen Background Noise Using Paired Senone Classifiers (2020) (12)
The First Multimodal Information Based Speech Processing (Misp) Challenge: Data, Tasks, Baselines And Results (2022) (12)
Boosting of Maximal Figure of Merit Classifiers for Automatic Image Annotation (2007) (12)
A phonetic feature based lattice rescoring approach to LVCSR (2009) (12)
Joint maximum a posteriori estimation of transformation and hidden Markov model parameters (2000) (12)
Improving Mispronunciation Detection of Mandarin Tones for Non-Native Learners With Soft-Target Tone Labels and BLSTM-Based Deep Tone Models (2019) (12)
A keyword-aware grammar framework for LVCSR-based spoken keyword search (2015) (12)
A portability study on natural language call steering (2001) (12)
Soft margin feature extraction for automatic speech recognition (2007) (11)
LSTM-based iterative mask estimation and post-processing for multi-channel speech enhancement (2017) (11)
Improving Audio-visual Speech Recognition Performance with Cross-modal Student-teacher Training (2019) (11)
A Study on Attribute-Based Taxonomy for Music Information Retrieval (2007) (11)
Language Recognition Based on Score Distribution Feature Vectors and Discriminative Classifier Fusion (2006) (11)
A study on robust utterance verification for connected digits recognition (1997) (11)
Implementation Aspects of Large Vocabulary Recognition Based on Intraword and Interword Phonetic Units (1990) (11)
Reliable Accent-Specific Unit Generation With Discriminative Dynamic Gaussian Mixture Selection for Multi-Accent Chinese Speech Recognition (2013) (11)
Information Fusion in Attention Networks Using Adaptive and Multi-Level Factorized Bilinear Pooling for Audio-Visual Emotion Recognition (2021) (11)
Minimum error rate training for PHMM-based text recognition (1999) (11)
Evaluating the Aurora connected digit recognition task - a bell labs approach (2001) (11)
A transfer learning and progressive stacking approach to reducing deep model sizes with an application to speech enhancement (2017) (11)
Progressive Multi-Target Network Based Speech Enhancement with Snr-Preselection for Robust Speaker Diarization (2020) (11)
Deep learning vector quantization for acoustic information retrieval (2014) (11)
A data selection strategy for utterance verification in continuous speech recognition (2001) (11)
Relational Teacher Student Learning with Neural Label Embedding for Device Adaptation in Acoustic Scene Classification (2020) (10)
Two extensions to ensemble speaker and speaking environment modeling for robust automatic speech recognition (2007) (10)
A hierarchical approach to story segmentation of large broadcast news video corpus (2004) (10)
A keyword-boosted sMBR criterion to enhance keyword search performance in deep neural network based acoustic modeling (2014) (10)
Backoff hierarchical class n-gram language modelling for automatic speech recognition systems (2002) (10)
Predictive adaptation and compensation for robust speech recognition (1998) (10)
Exploring Deep Hybrid Tensor-to-Vector Network Architectures for Regression Based Speech Enhancement (2020) (10)
GENIE TRECVID 2011 Multimedia Event Detection: Late-Fusion Approaches to Combine Multiple Audio-Visual features (2011) (10)
Cluster-based analysis for characterizing dynamic functional connectivity (2014) (10)
Factorization of Language Constraints in Speech Recognition (1991) (10)
Iterative noise and channel estimation under the stochastic matching algorithm framework (1997) (10)
Acoustic Modeling of Subword Units for Large Vocabulary Speaker Independent Speech Recognition (1989) (10)
On the use of a family of signal limiters for recognition of noisy speech (1993) (10)
High-Resolution Attention Network with Acoustic Segment Model for Acoustic Scene Classification (2020) (9)
A vocabulary independent discriminatively trained method for rejection of non-keywords in sub word based speech recognition (1995) (9)
Speaker Independent Continuous Speech Recognition Using Continuous Density Hidden Markov Models (1992) (9)
Hierarchical Bayesian combination of plug-in maximum a posteriori decoders in deep neural networks-based speech recognition and speaker adaptation (2017) (9)
LASSO model adaptation for automatic speech recognition (2011) (9)
A Flexible Classifier Design Framework Based on Multiobjective Programming (2008) (9)
A dynamic in-search discriminative training approach for large vocabulary speech recognition (2002) (9)
Matching for Robust Speech Rec (1996) (9)
A Lottery Ticket Hypothesis Framework for Low-Complexity Device-Robust Neural Acoustic Scene Classification (2021) (9)
Speaking-style dependent lexicalized filler model for key-phrase detection and verification (1997) (9)
A reverberation-time-aware DNN approach leveraging spatial information for microphone array dereverberation (2017) (9)
Structural Bayesian language modeling and adaptation (2007) (9)
Shrinkage model adaptation in automatic speech recognition (2010) (9)
Correlating Subword Articulation with Lip Shapes for Embedding Aware Audio-Visual Speech Enhancement (2020) (8)
An unsupervised learning approach to musical event detection (2004) (8)
An efficient gradient computation approach to discriminative fusion optimization in semantic concept detection (2008) (8)
A study of on-line quasi-Bayes adaptation for CDHMM-based speech recognition (1996) (8)
An Iterative Phase Recovery Framework with Phase Mask for Spectral Mapping with an Application to Speech Enhancement (2016) (8)
An experimental study on discriminative concept classifier combination for TRECVID high-level feature extraction (2008) (8)
A study on cross-language knowledge integration in Mandarin LVCSR (2012) (8)
Detection of repetitions in spontaneous speech in dialogue sessions (2008) (8)
Bayesian learning of the SCHMM parameters for speech recognition (1994) (8)
The USTC-NELSLIP Systems for CHiME-6 Challenge (2020) (8)
On the use of inter-word context-dependent units for word juncture modeling (1992) (8)
A text categorization approach to automatic language identification (2005) (8)
Upper and lower bounds on the mean of noisy speech: application to minimax classification (2002) (8)
Scenario-Dependent Speaker Diarization for DIHARD-III Challenge (2021) (8)
Attribute based lattice rescoring in spontaneous speech recognition (2014) (8)
Automatic image region annotation through segmentation based visual semantic analysis and discriminative classification (2016) (8)
Language-resource independent speech segmentation using cues from a spectrogram image (2015) (8)
Speech recognition under additive noise (1984) (7)
Experimental studies on continuous speech recognition using neural architectures with “adaptive” hidden activation functions (2010) (7)
TRECVID 2012 GENIE: Multimedia Event Detection and Recounting (2012) (7)
Continuous phone recognition without target language training data (2008) (7)
Dialect levelling in Finnish: a universal speech attribute approach (2014) (7)
A MAP-based Online Estimation Approach to Ensemble Speaker and Speaking Environment Modeling (2014) (7)
An experimental study on joint modeling of mixed-bandwidth data via deep neural networks for robust speech recognition (2016) (7)
Fusion of Region and Image-Based Techniques for Automatic Image Annotation (2007) (7)
A new hybrid decoding algorithm for speech recognition and utterance verification (1997) (7)
Automatic speech recognition of small vocabularies within the context of unconstrained input (1988) (7)
A lasso based ensemble empirical mode decomposition approach to designing adaptive clutter suppression filters (2012) (7)
A Speaker-Dependent Approach to Single-Channel Joint Speech Separation and Acoustic Modeling Based on Deep Neural Networks for Robust Recognition of Multi-Talker Speech (2018) (7)
Context‐dependent acoustic subword modeling for connected digit recognition (1993) (7)
A language for creating speech applications (1998) (7)
DNN Training Based on Classic Gain Function for Single-channel Speech Enhancement and Recognition (2019) (7)
A penalized logistic regression approach to detection based phone classification (2008) (7)
Connected digit recognition based on improved acoustic resolution (1993) (7)
Bayesian learning of the parameters of discrete and tied mixture HMMs for speech recognition (1993) (7)
Discriminative learning for optimizing detection performance in spoken language recognition (2008) (7)
A study on soft margin estimation of linear regression parameters for speaker adaptation (2009) (7)
On a generalization of margin-based discriminative training to robust speech recognition (2008) (7)
Improving Deep Neural Network Based Speech Synthesis through Contextual Feature Parametrization and Multi-Task Learning (2018) (6)
A real-time Japanese broadcast news closed-captioning system (2001) (6)
Simplifying design specification for automatic training of robust natural language call router (2001) (6)
Bayesian affine transformation of HMM parameters for instantaneous and supervised adaptation in telephone speech recognition (1997) (6)
An incremental learning framework combining sample confidence and discrimination with an application to automatic image annotation (2009) (6)
An entropy minimization framework for goal-driven dialogue management (2015) (6)
Improved training procedures for hidden Markov models (1988) (6)
Acoustics-guided evaluation (AGE): a new measure for estimating performance of speech enhancement algorithms for robust ASR (2018) (6)
Natural language call routing: towards combination and boosting of classifiers (2001) (6)
Extended maximum a posterior linear regression (EMAPLR) model adaptation for speech recognition (2000) (6)
Adaptive compensation for robust speech recognition (1997) (6)
A unified speaker-dependent speech separation and enhancement system based on deep neural networks (2015) (6)
A fusion approach to spoken language identification based on combining multiple phone recognizers and speech attribute detectors (2014) (6)
Two-Stage Enhancement of Noisy and Reverberant Microphone Array Speech for Automatic Speech Recognition Systems Trained with Only Clean Speech (2018) (6)
A unified deep modeling approach to simultaneous speech dereverberation and recognition for the reverb challenge (2017) (6)
A MODEL ENSEMBLE APPROACH FOR AUDIO-VISUAL SCENE CLASSIFICATION Technical Report (2021) (6)
Soft margin estimation with various separation levels for LVCSR (2008) (6)
Automatic Application Generator Matches User Expectations to System Capabilities (2000) (6)
Segmental quasi-Bayesian learning of the mixture coefficients in SCHMM for speech recognition (1994) (5)
A Comparison of Single- and Multi-Objective Programming Approaches to Problems with Multiple Design Objectives (2010) (5)
A Regularized Maximum Figure-of-Merit (rMFoM) Approach to Supervised and Semi-Supervised Learning (2011) (5)
An Efficient Gradient-based Approach to Optimizing Average Precision Through Maximal Figure-of-Merit Learning (2014) (5)
A study on hidden Markov model's generalization capability for speech recognition (2009) (5)
Enhancing model-based skin color detection: From low-level RGB features to high-level discriminative binary-class features (2012) (5)
A Progressive Deep Learning Approach to Child Speech Separation (2018) (5)
Lip-reading with Hierarchical Pyramidal Convolution and Self-Attention (2020) (5)
Speaker verification based on combining speaker individuality parameter selection and decision (2005) (5)
Word juncture modeling using inter-word context-dependent phone-like units (1991) (5)
Dialogue session: management using voiceXML (2001) (5)
Phrase language models for detection and verification-based speech understanding (1997) (5)
Weighted graph based decision tree optimization for high accuracy acoustic modeling (2002) (5)
Statistical Analysis of Musical Instruments (2002) (5)
A resource-dependent approach to word modeling for keyword spotting (2013) (5)
Using tone-based extended recognition network to detect non-native Mandarin tone mispronunciations (2016) (5)
An Improved Parametric Relaxation Approach to Blood Flow Signal Estimation with Single-Ensemble Samples in Color Flow Imaging (2013) (5)
Systems, methods and articles of manufacture for improving recognition confidence in hypothesized keywords (1998) (5)
Online whole-word and stroke-based modeling for hand-written letter recognition in in-car environments (2013) (5)
Title On-line adaptive learning of the correlated continuous densityhidden Markov models for speech recognition (1998) (5)
Vector-Based Spoken Language Classification (2008) (4)
Complexity reduction in a large vocabulary speech recognizer (1991) (4)
A vector space approach to environment modeling for robust speech recognition (2006) (4)
Audio-Visual Speech Recognition in MISP2021 Challenge: Dataset Release and Deep Analysis (2022) (4)
Speaker independent recognition of spontaneously spoken connected digits (1991) (4)
Online LSTM-based Iterative Mask Estimation for Multi-Channel Speech Enhancement and ASR (2018) (4)
Multiple time resolution analysis of speech signal using MCE training with application to speech recognition (2009) (4)
A particle filter feature compensation approach to robust speech recognition (2010) (4)
An Iterative Constrained Optimization Approach to Classifier Design (2006) (4)
An attribute detection based approach to automatic speech processing (2014) (4)
Optimization of average precision with Maximal Figure-of-Merit Learning (2011) (4)
Performance Analysis for Tensor-Train Decomposition to Deep Neural Network Based Vector-to-Vector Regression (2020) (4)
Acoustic Modeling for Multi-Array Conversational Speech Recognition in the Chime-6 Challenge (2021) (4)
On the use of some robust modeling techniques for speech recognition (1989) (4)
Joint training of DNNs by incorporating an explicit dereverberation structure for distant speech recognition (2016) (4)
Information fusion techniques for automatic image annotation (2007) (4)
A speaker-dependent deep learning approach to joint speech separation and acoustic modeling for multi-talker automatic speech recognition (2016) (4)
Improving the ensemble speaker and speaking environment modeling approach by enhancing the precision of the online estimation process (2008) (4)
Subword-based large-vocabulary speech recognition (1993) (4)
On project-based learning through the vertically-integrated projects program (2011) (4)
A discriminative decision tree learning approach to acoustic modeling (2003) (4)
A hierarchical grid feature representation framework for automatic image annotation (2009) (3)
Improving Mandarin Tone Mispronunciation Detection for Non-Native Learners with Soft-Target Tone Labels and BLSTM-Based Deep Models (2018) (3)
Soft margin estimation on improving environment structures for ensemble speaker and speaking environment modeling (2009) (3)
A Maximum Likelihood Approach to Multi-Objective Learning Using Generalized Gaussian Distributions for Dnn-Based Speech Enhancement (2020) (3)
On generating mixing noise signals with basis functions for simulating noisy speech and learning dnn-based speech enhancement models (2017) (3)
A Progressive Learning Approach to Adaptive Noise and Speech Estimation for Speech Enhancement and Noisy Speech Recognition (2021) (3)
Indexing with musical events and its application to content-based music identification (2004) (3)
Applications of dynamic programming to speech and language processing (1989) (3)
A Keyword-Aware Language Modeling Approach to Spoken Keyword Search (2016) (3)
A Study of Child Speech Extraction Using Joint Speech Enhancement and Separation in Realistic Conditions (2020) (3)
A Two-stage Single-channel Speaker-dependent Speech Separation Approach for Chime-5 Challenge (2019) (3)
Maximum likelihood learning of auditory feature maps for stationary vowels (1996) (3)
A Multi-Target SNR-Progressive Learning Approach to Regression Based Speech Enhancement (2020) (3)
From decoding-driven to detection-based paradigms for automatic speech recognition (2004) (3)
A kernelized maximal-figure-of-merit learning approach based on subspace distance minimization (2011) (3)
Fundamentals Of Speaker And Utterance Verification With Applications (1997) (3)
MAP estimation of online mapping parameters in ensemble speaker and speaking environment modeling (2009) (3)
Unsupervised, smooth training of feed-forward neural networks for mismatch compensation (1997) (3)
on Speech Recognition and Understanding , Part 2 (2009) (3)
Knowledge integration for improving performance in LVCSR (2013) (3)
Speech Recognition and Production by Machines (2015) (3)
Minimax classification with parametric neighborhoods for noisy speech recognition (2001) (3)
An enhanced minimum classification error learning framework for balancing insertion, deletion and substitution errors (2007) (3)
Error Modeling via Asymmetric Laplace Distribution for Deep Neural Network Based Single-Channel Speech Enhancement (2018) (3)
Deep neural network based voice conversion with a large synthesized parallel corpus (2016) (3)
The USTC-Ximalaya System for the ICASSP 2022 Multi-Channel Multi-Party Meeting Transcription (M2met) Challenge (2022) (3)
2D-to-2D Mask Estimation for Speech Enhancement Based on Fully Convolutional Neural Network (2020) (3)
Zero resource anti-spoofing detection for unit selection based synthetic speech using image spectrogram artifacts (2016) (3)
A Cross-Entropy-Guided (CEG) Measure for Speech Enhancement Front-End Assessing Performances of Back-End Automatic Speech Recognition (2019) (3)
A Forward-Backward Subsequence Smoothing Based Eigen Approach to Clutter Rejection in Color Flow Imaging (2014) (3)
Some techniques for creating robust stochastic models for speech recognition (1987) (3)
Stochastic modeling in spoken dialogue system design (1994) (3)
A study on subword modeling for utterance verification in Mexican Spanish (1997) (3)
A new confidence measure combining Hidden Markov Models and Artificial Neural Networks of phonemes for effective keyword spotting (2012) (3)
A Study on Using Word-Level HMMs to Improve ASR Performance over State-of-the-Art Phone-Level Acoustic Modeling for LVCSR (2012) (3)
Generating alternative pronunciations from a dictionary (1999) (2)
A Study of Designing Compact Audio-Visual Wake Word Spotting System Based on Iterative Fine-Tuning in Neural Network Pruning (2022) (2)
An Acoustic Segment Model Based Segment Unit Selection Approach to Acoustic Scene Classification with Partial Utterances (2020) (2)
Information and Services Manager Customizes Dialogue-Based Applications (2000) (2)
Utterance verification based on neighborhood information and Bayes factors (2002) (2)
Learning auxiliary categorical information for speech synthesis based on deep and recurrent neural networks (2016) (2)
High-resolution acoustic modeling and compact language modeling of language-universal speech attributes for spoken language identification (2015) (2)
Speaker set identification through speaker group modeling (1992) (2)
An i-vector based descriptor for alphabetical gesture recognition (2014) (2)
On discriminative semi-supervised incremental learning with a multi-view perspective for image concept modeling (2012) (2)
New model-based HMM distances with applications to run-time ASR error estimation and model tuning (2003) (2)
Acoustic Model Ensembling Using Effective Data Augmentation for CHiME-5 Challenge (2019) (2)
A Space-and-Speaker-Aware Iterative Mask Estimation Approach to Multi-Channel Speech Recognition in the CHiME-6 Challenge (2020) (2)
A Multi-Objective Programming Approach to Compromising Classification Performance Metrics (2007) (2)
Audio-Visual Wake Word Spotting in MISP2021 Challenge: Dataset Release and Deep Analysis (2022) (2)
Automatic Lip-Reading with Hierarchical Pyramidal Convolution and Self-Attention for Image Sequences with No Word Boundaries (2021) (2)
An mcmc approach to joint estimation of clean speech and noise for robust speech recognition (2013) (2)
Minimum verification error training for topic verification (2003) (2)
A study on sampling of STFT modifications in time and frequency domains for DNN-based speech dereverberation (2016) (2)
A Speech Enhancement Neural Network Architecture with SNR-Progressive Multi-Target Learning for Robust Speech Recognition (2019) (2)
Adaptive change point detection of dynamic functional connectivity networks (2016) (2)
Speech Emotion Recognition Based on Acoustic Segment Model (2021) (2)
Joint tracking of clean speech and noise using HMMs and particle filters for robust speech recognition (2012) (2)
AN EFFICIENT DECODING APPROACH FOR DIALOGUE SYSTEMS (2000) (2)
Adaptive change point detection of dynamic functional connectivity networks (2016) (2)
A Variational Bayesian Approach to Learning Latent Variables for Acoustic Knowledge Transfer (2021) (2)
Separation Guided Speaker Diarization in Realistic Mismatched Conditions (2021) (2)
An Experimental Study on Continuous Phone Recognition with Little or No Language-Specific Training Data (2008) (2)
A Model Ensemble Approach for Sound Event Localization and Detection (2021) (2)
A single-ensemble-based hybrid approach to clutter rejection combining bilinear Hankel with regression (2013) (2)
End-to-End Audio-Visual Neural Speaker Diarization (2022) (2)
Acoustic modeling of context dependent units, for large vocabulary speech recognition in Spanish (1995) (1)
Maximum Confidence Measure Based Interaural Phase Difference Estimation for Noise Masking in Dual-Microphone Robust Speech Recognition (2011) (1)
Speaker‐independent recognition of the DARPA Naval Resource Management Task (1989) (1)
Adaptive Learning in Acoustic and Language Modeling (1995) (1)
The Multimodal Information based Speech Processing (MISP) 2022 Challenge: Audio-Visual Diarization and Recognition (2023) (1)
Directions in automatic speech recognition (1995) (1)
Tunable keyword-aware language modeling and context dependent fillers for LVCSR-based spoken keyword search (2015) (1)
A PREFERENCE RANKING MODEL USING A DISCRIMINATIVELY-TRAINED CLASSIFIER (2008) (1)
Geometry Constrained Progressive Learning for Lstm-Based Speech Enhancement (2020) (1)
Transition features for CRF-based speech recognition and boundary detection (2009) (1)
An Experimental Study on Private Aggregation of Teacher Ensemble Learning for End-to-End Speech Recognition (2022) (1)
Joint Training of Multi-Channel-Condition Dereverberation and Acoustic Modeling of Microphone Array Speech for Robust Distant Speech Recognition (2017) (1)
Using Speech Enhancement Preprocessing for Speech Emotion Recognition in Realistic Noisy Conditions (2020) (1)
Towards a direct Bayesian adaptation framework for deep models (2016) (1)
KL-Divergence Regularized Deep Neural Network Adaptation for Low-Resource Speaker-Dependent Speech Enhancement (2019) (1)
Guest Editorial: Special Issue on Machine Learning Methods in Signal Processing (2004) (1)
A LSTM-Based Joint Progressive Learning Framework for Simultaneous Speech Dereverberation and Denoising (2019) (1)
An Efficient Structure for Continuous Speech Recognition (1992) (1)
Discriminative dynamic Gaussian mixture selection with enhanced robustness and performance for multi-accent speech recognition (2012) (1)
Audio-Visual Information Fusion Using Cross-Modal Teacher-Student Learning for Voice Activity Detection in Realistic Environments (2021) (1)
A Study on Joint Modeling and Data Augmentation of Multi-Modalities for Audio-Visual Scene Classification (2022) (1)
A Maximum Likelihood Approach to Masking-based Speech Enhancement Using Deep Neural Network (2018) (1)
Model-based margin estimation for hidden Markov model learning and generalisation (2013) (1)
Improving Separation-Based Speaker Diarization Via Iterative Model Refinement And Speaker Embedding Based Post-Processing (2022) (1)
A voice user interface demonstration system for mexican Spanish (1998) (1)
Speech Enhancement with Convolutional-Recurrent Networks (2018) (0)
Space-and-Speaker-Aware Acoustic Modeling with Effective Data Augmentation for Recognition of Multi-Array Conversational Speech (2022) (0)
Spoken Language Systems - Technical Challenges for Speech and Natural Language Processing (1999) (0)
A Speaker-Dependent Approach to Single-Channel Joint Speech Separation and Acoustic Modeling Based on Deep Neural Networks for Robust Recognition of Multi-Talker Speech (2017) (0)
Automatic recognition of connected digit strings in a credit card authorization task (1990) (0)
USEOFGENERALIZEDPATTERNMODEL FOR VIDEOANNOTATION (2007) (0)
ROBUSTNESS AND DISCRIMINATION ORIENTED SPEECH REC USING WEIGHTED HMM AND SUBSPACE PROJECTION APPR (1990) (0)
ITR-(NHS+ASE) automatic speech attribute transcription (ASAT): (2011) (0)
A Noise-Aware Memory-Attention Network Architecture for Regression-Based Speech Enhancement (2020) (0)
ACOUSTIC \IODELISG OF SLBtC'ORD UNITS FOR SPEECH RECOGNITIOS (1990) (0)
High‐resolution and efficient multiple‐string hypothesization using interword models (1993) (0)
Title A Bayesian predictive classification approach to robust speechrecognition (2000) (0)
Iterative Training Techniques for Phonetic Template Based Speech Recognition with a Speaker-Independent Phonetic Recognizer (2005) (0)
Deep Segment Model for Acoustic Scene Classification (2022) (0)
Media Annotation-Fusion of Region and Image-Based Techniques for Automatic Image Annotation (2006) (0)
A Keyword-Aware Language Modeling Approach to Spoken Keyword Search (2015) (0)
Nanyang Technological University Model-Based Noise Robust Speech Recognition (2012) (0)
Discovering knowledge in & extracting information from multimedia patterns (2004) (0)
Speech Enhancement Autoencoder with Hierarchical Latent Structure (2021) (0)
A new model using artificial intelligence to predict recurrence after surgical resection of stage I-II non-small cell lung cancer. (2021) (0)
A Quantum Kernel Learning Approach to Acoustic Modeling for Spoken Command Recognition (2022) (0)
Bandwidth expansion of speech based on wavelet transform modulus maxima vector mapping (2010) (0)
TclBLASR: an automatic speech recognition extension for tcl (2001) (0)
Minimum mistake ratio training for connected string model (1994) (0)
An experimental study on structural-MAP approaches to implementing very large vocabulary speech recognition systems for real-world tasks (2013) (0)
A particle filter compensation approach to robust LVCSR (2013) (0)
QDM-SSD: Quality-Aware Dynamic Masking for Separation-Based Speaker Diarization (2023) (0)
A Study on Subword eling for Utterance (1997) (0)
A Maximum Likelihood Approach to SNR-Progressive Learning Using Generalized Gaussian Distribution for LSTM-Based Speech Enhancement (2021) (0)
Method and apparatus for speaker recognition through testing of oral information by means of forced decoding (1998) (0)
Unsupervised Speaker Adaptation for Phonetic Transcription Based Voice Dialing (2005) (0)
PHRASE LANGUAGE MODELS FOR SPEECH UNDERSTANDING DETECTION AND VERIFICATION-BASED (1997) (0)
Speech recognition using keywords and non-keywords-modeling (1990) (0)
ANSD-MA-MSE: Adaptive Neural Speaker Diarization Using Memory-Aware Multi-Speaker Embedding (2023) (0)
Voice identification for recognition of interconnected numeral (1996) (0)
Speaker Adaptation for Voice Dialing (2002) (0)
Speech Enhancement Based on Deep Neural Networks (2014) (0)
An Ensemble Teacher-Student Learning Approach with Poisson Sub-sampling to Differential Privacy Preserving Speech Recognition (2022) (0)
A Study on Detection Based Autom (2006) (0)
Riemannian Stochastic Gradient Descent for Tensor-Train Recurrent Neural Networks (2018) (0)
Speaker verification method using group normalization scoring (1993) (0)
CCPR 2008 Keynote Speech 2 (2008) (0)
Joint training of DNNs by incorporating an explicit dereverberation structure for distant speech recognition (2016) (0)
Automatic Speech Recognition by Machines (2021) (0)
Proceedings - 2010 IEEE International Symposium on Multimedia, ISM 2010: Message from the conference co-chairs (2010) (0)
Developments and Directions in Speech Recognition and Understanding , Part 1 Citation (2009) (0)
Title Bayesian adaptive learning of the parameters of hidden Markovmodel for speech recognition (2004) (0)
Distinctive verification of statements for the recognition of connected digits (1996) (0)
The USTC-iFlytek System for the First DIHARD Challenge (2018) (0)
Use of Generalized Pattern Model for Video Annotation (2007) (0)
Correction to "An SNR-incremental stochastic matching algorithm for noisy speech recognition" (2002) (0)
Speech and audio processing for multimedia communications (1997) (0)
Per-Exemplar Fusion Learning for Video Retrieval and Recounting (2012) (0)
An Information-Extraction Approach to Speech Analysis and Processing (2012) (0)
A survey on recent progress in the ASAT/SIRKUS paradigm (2010) (0)
Soft Margin Estimation of Hidde (2006) (0)
Feature and model compensation for robust speech recognition (1996) (0)
Improving Deep Neural Network Based Speech Synthesis through Contextual Feature Parametrization and Multi-Task Learning (2017) (0)
Keyword Recognition and Correction Based on Utterance Verification and Knowledge Integration of Acoustic-Phonetic Features (2018) (0)
Using Paralinguistic Information to Disambiguate User Intentions for Distinguishing Phrase Structure and Sarcasm in Spoken Dialog Systems (2021) (0)
Iterative Constrained Optimization for Flexible Classifier Design With Multiple Competing Objectives (2007) (0)
Keynote speech 1: An integrated deep learning approach to acoustic signal pre-processing and acoustic modeling with applications to robust automatic speech recognition (2017) (0)

This paper list is powered by the following services:

Chin‐hui Lee's Academic­Influence.com Rankings

Chin‐hui Lee's Degrees

Why Is Chin‐hui Lee Influential?

Chin‐hui Lee's Published Works

Published Works

Chin‐hui Lee's AcademicInfluence.com Rankings