Chin‐hui Lee
#102,551
Most Influential Person Now
Chin‐hui Lee's AcademicInfluence.com Rankings
Chin‐hui Leeengineering Degrees
Engineering
#2706
World Rank
#3676
Historical Rank
Applied Physics
#380
World Rank
#396
Historical Rank
Electrical Engineering
#522
World Rank
#576
Historical Rank
Download Badge
Engineering
Chin‐hui Lee's Degrees
- PhD Electrical Engineering University of Southern California
- Masters Electrical Engineering University of Southern California
- Bachelors Electrical Engineering National Taiwan University
Why Is Chin‐hui Lee Influential?
(Suggest an Edit or Addition)Chin‐hui Lee's Published Works
Published Works
- Maximum a posteriori estimation for multivariate Gaussian mixture observations of Markov chains (1994) (2573)
- A Regression Approach to Speech Enhancement Based on Deep Neural Networks (2015) (1039)
- An Experimental Study on Speech Enhancement Based on Deep Neural Networks (2014) (768)
- Minimum classification error rate methods for speech recognition (1997) (738)
- Automatic recognition of keywords in unconstrained speech using hidden Markov models (1990) (451)
- A maximum-likelihood approach to stochastic matching for robust speech recognition (1996) (417)
- A study on speaker adaptation of the parameters of continuous density hidden Markov models (1991) (320)
- Automatic Speech and Speaker Recognition: Advanced Topics (1999) (278)
- A Vector Space Modeling Approach to Spoken Language Identification (2007) (255)
- The use of cohort normalized scores for speaker verification (1992) (253)
- Evaluation of sliding window correlation performance for characterizing dynamic functional connectivity and brain states (2016) (205)
- Developments and directions in speech recognition and understanding, Part 1 [DSP Education] (2009) (192)
- A structural Bayes approach to speaker adaptation (2001) (181)
- Acoustic modeling for large vocabulary speech recognition (1990) (180)
- Vocabulary independent discriminative utterance verification for nonkeyword rejection in subword based speech recognition (1996) (179)
- Segmental GPD training of HMM based speech recognizer (1992) (177)
- Pattern recognition using a family of design algorithms based upon the generalized probabilistic descent method (1998) (158)
- Discriminative utterance verification for connected digits recognition (1995) (157)
- Multiple-target deep learning for LSTM-RNN based speech enhancement (2017) (154)
- On stochastic feature and model compensation approaches to robust speech recognition (1998) (152)
- Cepstral channel normalization techniques for HMM-based speaker verification (1994) (149)
- Maximum a posteriori linear regression for hidden Markov model adaptation (1999) (148)
- Structural maximum a posteriori linear regression for fast HMM adaptation (2002) (144)
- On adaptive decision rules and decision parameter adaptation for automatic speech recognition (2000) (143)
- A segment model based approach to speech recognition (1988) (136)
- On-line adaptive learning of the continuous density hidden Markov model based on approximate recursive Bayes estimate (1997) (135)
- A MFoM learning approach to robust multiclass multi-label text categorization (2004) (131)
- A frame-synchronous network search algorithm for connected word recognition (1989) (127)
- Convolutional-Recurrent Neural Networks for Speech Enhancement (2018) (124)
- On robust linear prediction of speech (1988) (123)
- HMM clustering for connected word recognition (1989) (119)
- A study on multilingual acoustic modeling for large vocabulary ASR (2009) (119)
- Automatic Speech and Speaker Recognition (1996) (119)
- Exploiting deep neural networks for detection-based speech recognition (2013) (118)
- Bayesian adaptive learning of the parameters of hidden Markov model for speech recognition (1995) (104)
- A deep neural network approach to speech bandwidth expansion (2015) (104)
- Robust speech recognition with speech enhanced deep neural networks (2014) (102)
- A speech understanding system based on statistical representation of semantics (1992) (100)
- Discriminative training of language models for speech recognition (2002) (99)
- Minimum error rate training based on N-best string models (1993) (98)
- Bayesian learning for hidden Markov model with Gaussian mixture state observation densities (1991) (94)
- Connected word talker verification using whole word hidden Markov models (1991) (92)
- Multi-objective learning and mask-based post-processing for deep neural network based speech enhancement (2017) (92)
- A Bayesian predictive classification approach to robust speech recognition (1997) (88)
- Experiments on Cross-Language Attribute Detection and Phone Recognition With Minimal Target-Specific Training Data (2012) (86)
- A Deep Denoising Autoencoder Approach to Improving the Intelligibility of Vocoded Speech in Cochlear Implant Simulation (2017) (84)
- An Overview of Automatic Speech Recognition (1996) (82)
- Flexible speech understanding based on combined key-phrase detection and verification (1998) (82)
- MAP Estimation of Continuous Density HMM : Theory and Applications (1992) (82)
- Sub-word unit talker verification using hidden Markov models (1990) (82)
- An Adaptive Image Content Representation and Segmentation Approach to Automatic Image Annotation (2004) (81)
- Bayesian Learning of Gaussian Mixture Densities for Hidden Markov Models (1991) (81)
- Hermitian Polynomial for Speaker Adaptation of Connectionist Speech Recognition Systems (2013) (81)
- Dynamic noise aware training for speech enhancement based on deep neural networks (2014) (80)
- A training procedure for verifying string hypotheses in continuous speech recognition (1995) (80)
- Improved acoustic modeling for large vocabulary continuous speech recognition (1992) (80)
- Structural MAP speaker adaptation using hierarchical priors (1997) (77)
- Boosting attribute and phone estimation accuracies with deep neural networks for detection-based speech recognition (2012) (77)
- On Mean Absolute Error for Deep Neural Network Based Vector-to-Vector Regression (2020) (76)
- Robust speech recognition based on stochastic matching (1995) (73)
- A Minimax Classification Approach With Application To Robust Speech Recognition (1991) (73)
- Improving non-native mispronunciation detection and enriching diagnostic feedback with DNN-based speech attribute modeling (2016) (72)
- Speaker verification using normalized log-likelihood score (1996) (71)
- A study on integrating acoustic-phonetic information into lattice rescoring for automatic speech recognition (2009) (71)
- A Reverberation-Time-Aware Approach to Speech Dereverberation Based on Deep Neural Networks (2017) (71)
- A Regression Approach to Single-Channel Speech Separation Via High-Resolution Deep Neural Networks (2016) (71)
- An overview on automatic speech attribute transcription (ASAT) (2007) (71)
- Joint training of front-end and back-end deep neural networks for robust speech recognition (2015) (70)
- Updated MINDS report on speech recognition and understanding, Part 2 [DSP Education] (2009) (69)
- The USTC-iFlytek System for CHiME-4 Challenge (2016) (69)
- SNR-Based Progressive Learning of Deep Neural Network for Speech Enhancement (2016) (68)
- Speech recognition using weighted HMM and subspace projection approaches (1994) (67)
- An artificial neural network approach to automatic speech processing (2014) (66)
- Rapid adaptation for deep neural networks through multi-task learning (2015) (66)
- Densely Connected Progressive Learning for LSTM-Based Speech Enhancement (2018) (65)
- Speech separation of a target speaker based on deep neural networks (2014) (64)
- Improvements in connected digit recognition using higher order spectral and energy features (1991) (64)
- Discriminative training of natural language call routers (2003) (64)
- Stochastic Representation of Conceptual Structure in the ATIS Task (1991) (64)
- Utterance verification of keyword strings using word-based minimum verification error (WB-MVE) training (1996) (63)
- Automatic verbal information verification for user authentication (2000) (61)
- A study on minimum error discriminative training for speaker recognition (1995) (61)
- Joint maximum a posteriori adaptation of transformation and HMM parameters (2001) (61)
- Toward a detector-based universal phone recognizer (2008) (60)
- A new hybrid algorithm for speech recognition based on HMM segmentation and learning vector quantization (1993) (58)
- An Information-Extraction Approach to Speech Processing: Analysis, Detection, Verification, and Recognition (2013) (58)
- A unified approach to transfer learning of deep neural networks with applications to speaker adaptation in automatic speech recognition (2016) (58)
- Speech separation based on improved deep neural networks with dual outputs of speech features for both target and interfering speakers (2014) (57)
- Universal attribute characterization of spoken languages for automatic spoken language recognition (2013) (57)
- Approximate Test Risk Bound Minimization Through Soft Margin Estimation (2007) (56)
- The segmentation of news video into story units (2002) (56)
- Vocabulary independent discriminative utterance verification for non-keyword rejection in subword based speech recognition (1998) (55)
- Word recognition using whole word and subword models (1989) (55)
- Deep Learning–Based Noise Reduction Approach to Improve Speech Intelligibility for Cochlear Implant Recipients (2018) (54)
- Decentralizing Feature Extraction with Quantum Convolutional Neural Network for Automatic Speech Recognition (2020) (54)
- Sign Transition Modeling and a Scalable Solution to Continuous Sign Language Recognition for Real-World Applications (2016) (54)
- On-line adaptive learning of the correlated continuous density hidden Markov models for speech recognition (1996) (54)
- A study on knowledge source integration for candidate rescoring in automatic speech recognition (2005) (54)
- A Study on Music Genre Classification Based on Universal Acoustic Models (2006) (54)
- Application of hidden Markov models for recognition of a limited set of words in unconstrained speech (1989) (53)
- A maximal figure-of-merit learning approach to text categorization (2003) (53)
- Speech recognition and utterance verification based on a generalized confidence score (2001) (51)
- A Multi-Modal Approach to Story Segmentation for News Video (2003) (51)
- Word juncture modeling using phonological rules for HMM-based continuous speech recognition (1990) (51)
- DNN-based speech bandwidth expansion and its application to adding high-frequency missing features for automatic speech recognition of narrowband speech (2015) (51)
- An End-to-End Deep Learning Approach to Simultaneous Speech Dereverberation and Acoustic Modeling for Robust Speech Recognition (2017) (50)
- Maximum a posteriori adaptation of network parameters in deep models (2015) (50)
- Developments and Directions in Speech Recognition and Understanding , Part 1 T (49)
- An Ensemble Speaker and Speaking Environment Modeling Approach to Robust Speech Recognition (2009) (49)
- Automatic Image Annotation through Multi-Topic Text Categorization (2006) (48)
- A maximal figure-of-merit (MFoM)-learning approach to robust classifier design for text categorization (2006) (48)
- Bayesian Adaptive Learning and Map Estimation of HMM (1996) (46)
- Soft margin estimation of hidden Markov model parameters (2006) (45)
- On the asymptotic statistical behavior of empirical cepstral coefficients (1993) (44)
- A study on speaker adaptation of continuous density HMM parameters (1990) (44)
- Towards knowledge-based features for HMM based large vocabulary automatic speech recognition (2002) (44)
- Multilingual speech recognition with language identification (2002) (42)
- Characterizing Speech Adversarial Examples Using Self-Attention U-Net Enhancement (2020) (42)
- On designing and evaluating speech event detectors (2005) (42)
- A Minimum Error Rate Pattern Recognition Approach to Speech Recognition (1994) (42)
- Towards bottom-up continuous phone recognition (2007) (41)
- Robust utterance verification for connected digits recognition (1995) (40)
- Low-resource keyword search strategies for tamil (2015) (39)
- Speaker Diarization with Enhancing Speech for the First DIHARD Challenge (2018) (39)
- Speech Enhancement Based on Teacher–Student Deep Learning Using Improved Speech Presence Probability for Noise-Robust Speech Recognition (2019) (38)
- A universal VAD based on jointly trained deep neural networks (2015) (37)
- Key-phrase detection and verification for flexible speech understanding (1996) (37)
- Bayesian adaptation in speech recognition (1983) (37)
- Nonlinear compensation for stochastic matching (1999) (36)
- A maximal figure-of-merit learning approach to maximizing mean average precision with deep neural network based classifiers (2014) (36)
- A blind segmentation approach to acoustic event detection based on i-vector (2013) (36)
- Device-Robust Acoustic Scene Classification Based on Two-Stage Categorization and Data Augmentation (2020) (36)
- Global variance equalization for improving deep neural network based speech enhancement (2014) (35)
- Exploring universal attribute characterization of spoken languages for spoken language recognition (2009) (35)
- Verbal information verification (1997) (35)
- Speech Recognition Using Long-Span Temporal Patterns in a Deep Network Model (2013) (35)
- An acoustic segment modeling approach to automatic language identification (2005) (35)
- HIDDEN MARKOV MODEL ADAPTATION USING MAXIMUM A POSTERIORI LINEAR REGRESSION (1999) (34)
- i-Vector Modeling of Speech Attributes for Automatic Foreign Accent Recognition (2016) (34)
- A study on lattice rescoring with knowledge scores for automatic speech recognition (2006) (33)
- A study on task-independent subword selection and modeling for speech recognition (1996) (33)
- Deep neural network based speech separation for robust speech recognition (2014) (33)
- High-Accuracy Phone Recognition By Combining High-Performance Lattice Generation and Knowledge Based Rescoring (2007) (33)
- Verifying and correcting recognition string hypotheses using discriminative utterance verification (1997) (32)
- Statistical and Discriminative Methods for Speech Recognition (1996) (31)
- Robustness and discrimination oriented speech recognition using weighted HMM and subspace projection approaches (1991) (31)
- Unsupervised adaptation using structural Bayes approach (1998) (31)
- Improved Acoustic Modeling for Continuous Speech Recognition (1990) (30)
- Discriminative utterance verification using minimum string verification error (MSVE) training (1996) (30)
- Discriminative training in natural language call routing (2000) (30)
- A Study on the Generalization Capability of Acoustic Models for Robust Speech Recognition (2010) (29)
- A Kernel Framework for Content-Based Artist Recommendation System in Music (2011) (29)
- Simultaneous ANN feature and HMM recognizer design using string-based minimum classification error (MCE) training (1996) (28)
- Speaker recognition based on minimum error discriminative training (1994) (28)
- A Multiobjective Learning and Ensembling Approach to High-Performance Speech Enhancement With Compact Neural Network Architectures (2018) (28)
- Robust, real-time endpoint detector with energy normalization for ASR in adverse environments (2001) (28)
- A new approach to utterance verification based on neighborhood information in model space (2003) (28)
- Improved acoustic modeling for speaker independent large vocabulary continuous speech recognition (1991) (28)
- On natural language call routing (2000) (28)
- Acoustic modeling of subword units for speech recognition (1990) (28)
- International Conference on Multimodal Interfaces and the Workshop on Machine Learning for Multimodal Interaction (2010) (27)
- A Gender Mixture Detection Approach to Unsupervised Single-Channel Speech Separation Based on Deep Neural Networks (2017) (27)
- Boosting and combination of classifiers for natural language call routing systems (2003) (27)
- Auto-induced semantic classes (2004) (27)
- An automatic dialogue generation platform for personalized dialogue applications (2004) (26)
- Minimax i-vector extractor for short duration speaker verification (2013) (26)
- Large vocabulary speech recognition using subword units (1993) (26)
- A Four-Stage Data Augmentation Approach to ResNet-Conformer Based Acoustic Modeling for Sound Event Localization and Detection (2021) (25)
- Progress Report on the Chronus System: ATIS Benchmark Results (1992) (25)
- A Novel LSTM-Based Speech Preprocessor for Speaker Diarization in Realistic Mismatch Conditions (2018) (25)
- String-based minimum verification error (SB-MVE) training for speech recognition (1997) (25)
- The USTC-iFlytek systems for CHiME-5 Challenge (2018) (25)
- Bayesian Learning of Hierarchical Multinomial Mixture Models of Concepts for Automatic Image Annotation (2006) (25)
- An ensemble classifier learning approach to ROC optimization (2006) (25)
- A Theory on Deep Neural Network Based Vector-to-Vector Regression With an Illustration of Its Expressive Power in Speech Enhancement (2019) (24)
- Minimum Classification Error Training to Improve Isolated Chord Recognition (2009) (24)
- Stochastic matching for robust speech recognition (1994) (24)
- OASIS natural language call steering trial (2001) (24)
- A hybrid HMM/DNN approach to keyword spotting of short words (2013) (24)
- On Automatic Speech Recognition at the Dawn of the 21st Century (2003) (24)
- Context dependent anti subword modeling for utterance verification (1998) (24)
- Improved Bayesian learning of hidden Markov models for speaker adaptation (1997) (23)
- A Hybrid Approach to Combining Conventional and Deep Learning Techniques for Single-Channel Speech Enhancement and Recognition (2018) (23)
- Enhancing image annotation by integrating concept ontology and text-based bayesian learning model (2007) (23)
- DESIGN PRINCIPLES AND TOOLS FOR MULTIMODAL DIALOG SYSTEMS (2000) (23)
- Improving Mispronunciation Detection for Non-Native Learners with Multisource Information and LSTM-Based Deep Models (2017) (23)
- On the importance of modeling temporal information in music tag annotation (2009) (23)
- Analyzing Upper Bounds on Mean Absolute Errors for Deep Neural Network-Based Vector-to-Vector Regression (2020) (23)
- On Design of Robust Deep Models for CHiME-4 Multi-Channel Speech Recognition with Multiple Configurations of Array Microphones (2017) (23)
- Hermitian based Hidden Activation Functions for Adaptation of Hybrid HMM/ANN Models (2012) (22)
- THE USTC-IFLYTEK SYSTEM FOR SOUND EVENT LOCALIZATION AND DETECTION OF DCASE2020 CHALLENGE Technical Report (2020) (22)
- A hybrid algorithm for speaker adaptation using MAP transformation and adaptation (1997) (22)
- Improved acoustic modeling with Bayesian learning (1992) (22)
- A comparative study on system combination schemes for LVCSR (2010) (22)
- Improving Deep Neural Network Based Speech Enhancement in Low SNR Environments (2015) (21)
- A detection-based approach to broadcast news video story segmentation (2009) (21)
- Background model design for flexible and portable speaker verification systems (1999) (21)
- Detecting Mispronunciations of L2 Learners and Providing Corrective Feedback Using Knowledge-Guided and Data-Driven Decision Trees (2016) (21)
- Statistical segmentation and word modeling techniques in isolated word recognition (1990) (21)
- Combining key-phrase detection and subword-based verification for flexible speech understanding (1997) (20)
- Feature space maximum a posteriori linear regression for adaptation of deep neural networks (2014) (20)
- A Ridge Ensemble Empirical Mode Decomposition Approach to Clutter Rejection for Ultrasound Color Flow Imaging (2013) (20)
- Cross-language transfer learning for deep neural network based speech enhancement (2014) (20)
- A Bottom-Up Modular Search Approach to Large Vocabulary Continuous Speech Recognition (2013) (20)
- A Two-Stage Approach to Device-Robust Acoustic Scene Classification (2020) (20)
- An adaptive learning approach to music tempo and beat analysis (2004) (20)
- An iterative mask estimation approach to deep learning based multi-channel speech recognition (2019) (20)
- Unsupervised anchor shot detection using multi-modal spectral clustering (2008) (20)
- A unified DNN approach to speaker-dependent simultaneous speech enhancement and speech separation in low SNR environments (2017) (20)
- Discriminative training for call classification and routing (2002) (20)
- Speech Separation based on signal-noise-dependent deep neural networks for robust speech recognition (2015) (20)
- Recent advancements in automatic speaker authentication (1999) (19)
- Enhanced Adversarial Strategically-Timed Attacks Against Deep Reinforcement Learning (2020) (19)
- Joint noise and mask aware training for DNN-based speech enhancement with SUB-band features (2017) (19)
- Historical Development and Future Directions in Speech Recognition and Understanding (2007) (19)
- Video segmentation using spatial and temporal statistical analysis method (2000) (19)
- Improving Mandarin Tone Recognition Based on DNN by Combining Acoustic and Articulatory Features Using Extended Recognition Networks (2018) (19)
- USTC-NELSLIP System Description for DIHARD-III Challenge (2021) (18)
- Selective feature extraction via signal decomposition (1997) (18)
- A network-based frame-synchronous level building algorithm for connected word recognition (1988) (18)
- A Maximum Likelihood Approach to Deep Neural Network Based Nonlinear Spectral Mapping for Single-Channel Speech Separation (2017) (18)
- Exemplar-inspired strategies for low-resource spoken keyword search in Swahili (2016) (18)
- A new decoder based on a generalized confidence score (1998) (18)
- A dynamic in-search data selection method with its applications to acoustic modeling and utterance verification (2005) (18)
- A study of on-line Bayesian adaptation for HMM-based speech recognition (1993) (18)
- A Bottom-Up Stepwise Knowledge-Integration Approach to Large Vocabulary Continuous Speech Recognition Using Weighted Finite State Machines (2011) (17)
- A Probabilistic Framework for Representing Dialog Systems and Entropy-Based Dialog Management Through Dynamic Stochastic State Evolution (2015) (17)
- Exploiting context-dependency and acoustic resolution of universal speech attribute models in spoken language recognition (2010) (17)
- Automatic dialogue generator creates user defined applications (1999) (17)
- Optimizing the Performance of Spoken Language Recognition With Discriminative Training (2008) (17)
- A study on detection based automatic speech recognition (2006) (17)
- On frequency dependencies of sliding window correlation (2015) (17)
- A novel keyword+LVCSR-filler based grammar network representation for spoken keyword search (2014) (17)
- A Cross-Entropy-Guided Measure (CEGM) for Assessing Speech Recognition Performance and Optimizing DNN-Based Speech Enhancement (2021) (17)
- Speech technology integration and research platform: a system study (1997) (17)
- Approximate Test Risk Minimization Through Soft Margin Estimation (2007) (17)
- A new connected word recognition algorithm based on HMM/LVQ segmentation and LVQ classification (1991) (17)
- Explicit Performance Metric Optimization for Fusion-Based Video Retrieval (2012) (17)
- Metrics for measuring domain independence of semantic classes (2001) (17)
- A comparison of four metrics for auto-inducing semantic classes (2001) (17)
- An acoustic segment model approach to incorporating temporal information into speaker modeling for text-independent speaker recognition (2010) (16)
- A hidden Markov model based approach to music segmentation and identification (2003) (16)
- Robust speech recognition based on adaptive classification and decision strategies (2000) (16)
- Decision tree based tone modeling with corrective feedbacks for automatic Mandarin tone assessment (2010) (16)
- A study on model-based error rate estimation for automatic speech recognition (2003) (16)
- Tensor-To-Vector Regression for Multi-Channel Speech Enhancement Based on Tensor-Train Network (2020) (16)
- A study on word detector design and knowledge-based pruning and rescoring (2007) (16)
- Hierarchical class n-gram language models: towards better estimation of unseen events in speech recognition (2003) (15)
- Robust linear prediction for speech analysis (1987) (15)
- L-Vector: Neural Label Embedding for Domain Adaptation (2020) (15)
- Transformation-based Bayesian prediction for adaptation of HMMs (2000) (15)
- Hierarchical stochastic feature matching for robust speech recognition (2001) (15)
- Using Generalized Gaussian Distributions to Improve Regression Error Modeling for Deep Learning-Based Speech Enhancement (2019) (15)
- A Hybrid Approach to Acoustic Scene Classification Based on Universal Acoustic Models (2019) (15)
- Combined on-line model adaptation and Bayesian predictive classification for robust speech recognition (1997) (15)
- A Survey on Automatic Speech Recognition with an Illustrative Example on Continuous Speech Recognition of Mandarin (1996) (15)
- Beyond cross-entropy: towards better frame-level objective functions for deep neural network training in automatic speech recognition (2014) (15)
- Parametric Dependencies of Sliding Window Correlation (2018) (15)
- Structural maximum a-posteriori linear regression for unsupervised speaker adaptation (2000) (15)
- A study on soft margin estimation for LVCSR (2007) (15)
- Tweet Normalization with Syllables (2015) (15)
- Consumer-level multimedia event detection through unsupervised audio signal modeling (2012) (14)
- Experiments in automatic talker verification using sub-word unit hidden Markov models (1990) (14)
- An SNR-incremental stochastic matching algorithm for noisy speech recognition (2001) (14)
- On-line adaptation of the SCHMM parameters based on the segmental quasi-Bayes learning for speech recognition (1996) (14)
- Speech Science and Technology. (1993) (14)
- Maximum-likelihood stochastic matching approach to non-linear equalization for robust speech recognition (1996) (14)
- A study on separation between acoustic models and its applications (2005) (14)
- An information fusion framework with multi-channel feature concatenation and multi-perspective system combination for the deep-learning-based robust recognition of microphone array speech (2017) (14)
- A study of prior sensitivity for Bayesian predictive classification based robust speech recognition (1998) (14)
- Applying a Speaker-Dependent Speech Compression Technique to Concatenative TTS Synthesizers (2007) (14)
- PATE-AAE: Incorporating Adversarial Autoencoder into Private Aggregation of Teacher Ensembles for Spoken Command Classification (2021) (14)
- Image region annotation based on segmentation and semantic correlation analysis (2018) (14)
- An algorithm of high resolution and efficient multiple string hypothesization for continuous speech recognition using inter-word models (1994) (14)
- An integrated approach to feature compensation combining particle filters and hidden Markov models for robust speech recognition (2012) (14)
- A study on target feature activation and normalization and their impacts on the performance of DNN based speech dereverberation systems (2016) (14)
- Principles of Spoken Language Recognition (2008) (13)
- A Speaker-Dependent Approach to Separation of Far-Field Multi-Talker Microphone Array Speech for Front-End Processing in the CHiME-5 Challenge (2019) (13)
- An information fusion approach to recognizing microphone array speech in the CHiME-3 challenge based on a deep learning framework (2015) (13)
- Detection-based accented speech recognition using articulatory features (2011) (13)
- An ensemble modeling approach to joint characterization of speaker and speaking environments (2007) (13)
- Bayesian Unsupervised Batch and Online Speaker Adaptation of Activation Function Parameters in Deep Models for Automatic Speech Recognition (2017) (13)
- Unsupervised single-channel speech separation via deep neural network for different gender mixtures (2016) (13)
- Introducing attribute features to foreign accent recognition (2014) (13)
- A priori threshold selection for fixed vocabulary speaker verification systems (2000) (12)
- Combination of boosting and discriminative training for natural language call steering systems (2002) (12)
- Ensemble speaker and speaking environment modeling approach with advanced online estimation process (2009) (12)
- A Cross-Task Transfer Learning Approach to Adapting Deep Speech Enhancement Models to Unseen Background Noise Using Paired Senone Classifiers (2020) (12)
- The First Multimodal Information Based Speech Processing (Misp) Challenge: Data, Tasks, Baselines And Results (2022) (12)
- Boosting of Maximal Figure of Merit Classifiers for Automatic Image Annotation (2007) (12)
- A phonetic feature based lattice rescoring approach to LVCSR (2009) (12)
- Joint maximum a posteriori estimation of transformation and hidden Markov model parameters (2000) (12)
- Improving Mispronunciation Detection of Mandarin Tones for Non-Native Learners With Soft-Target Tone Labels and BLSTM-Based Deep Tone Models (2019) (12)
- A keyword-aware grammar framework for LVCSR-based spoken keyword search (2015) (12)
- A portability study on natural language call steering (2001) (12)
- Soft margin feature extraction for automatic speech recognition (2007) (11)
- LSTM-based iterative mask estimation and post-processing for multi-channel speech enhancement (2017) (11)
- Improving Audio-visual Speech Recognition Performance with Cross-modal Student-teacher Training (2019) (11)
- A Study on Attribute-Based Taxonomy for Music Information Retrieval (2007) (11)
- Language Recognition Based on Score Distribution Feature Vectors and Discriminative Classifier Fusion (2006) (11)
- A study on robust utterance verification for connected digits recognition (1997) (11)
- Implementation Aspects of Large Vocabulary Recognition Based on Intraword and Interword Phonetic Units (1990) (11)
- Reliable Accent-Specific Unit Generation With Discriminative Dynamic Gaussian Mixture Selection for Multi-Accent Chinese Speech Recognition (2013) (11)
- Information Fusion in Attention Networks Using Adaptive and Multi-Level Factorized Bilinear Pooling for Audio-Visual Emotion Recognition (2021) (11)
- Minimum error rate training for PHMM-based text recognition (1999) (11)
- Evaluating the Aurora connected digit recognition task - a bell labs approach (2001) (11)
- A transfer learning and progressive stacking approach to reducing deep model sizes with an application to speech enhancement (2017) (11)
- Progressive Multi-Target Network Based Speech Enhancement with Snr-Preselection for Robust Speaker Diarization (2020) (11)
- Deep learning vector quantization for acoustic information retrieval (2014) (11)
- A data selection strategy for utterance verification in continuous speech recognition (2001) (11)
- Relational Teacher Student Learning with Neural Label Embedding for Device Adaptation in Acoustic Scene Classification (2020) (10)
- Two extensions to ensemble speaker and speaking environment modeling for robust automatic speech recognition (2007) (10)
- A hierarchical approach to story segmentation of large broadcast news video corpus (2004) (10)
- A keyword-boosted sMBR criterion to enhance keyword search performance in deep neural network based acoustic modeling (2014) (10)
- Backoff hierarchical class n-gram language modelling for automatic speech recognition systems (2002) (10)
- Predictive adaptation and compensation for robust speech recognition (1998) (10)
- Exploring Deep Hybrid Tensor-to-Vector Network Architectures for Regression Based Speech Enhancement (2020) (10)
- GENIE TRECVID 2011 Multimedia Event Detection: Late-Fusion Approaches to Combine Multiple Audio-Visual features (2011) (10)
- Cluster-based analysis for characterizing dynamic functional connectivity (2014) (10)
- Factorization of Language Constraints in Speech Recognition (1991) (10)
- Iterative noise and channel estimation under the stochastic matching algorithm framework (1997) (10)
- Acoustic Modeling of Subword Units for Large Vocabulary Speaker Independent Speech Recognition (1989) (10)
- On the use of a family of signal limiters for recognition of noisy speech (1993) (10)
- High-Resolution Attention Network with Acoustic Segment Model for Acoustic Scene Classification (2020) (9)
- A vocabulary independent discriminatively trained method for rejection of non-keywords in sub word based speech recognition (1995) (9)
- Speaker Independent Continuous Speech Recognition Using Continuous Density Hidden Markov Models (1992) (9)
- Hierarchical Bayesian combination of plug-in maximum a posteriori decoders in deep neural networks-based speech recognition and speaker adaptation (2017) (9)
- LASSO model adaptation for automatic speech recognition (2011) (9)
- A Flexible Classifier Design Framework Based on Multiobjective Programming (2008) (9)
- A dynamic in-search discriminative training approach for large vocabulary speech recognition (2002) (9)
- Matching for Robust Speech Rec (1996) (9)
- A Lottery Ticket Hypothesis Framework for Low-Complexity Device-Robust Neural Acoustic Scene Classification (2021) (9)
- Speaking-style dependent lexicalized filler model for key-phrase detection and verification (1997) (9)
- A reverberation-time-aware DNN approach leveraging spatial information for microphone array dereverberation (2017) (9)
- Structural Bayesian language modeling and adaptation (2007) (9)
- Shrinkage model adaptation in automatic speech recognition (2010) (9)
- Correlating Subword Articulation with Lip Shapes for Embedding Aware Audio-Visual Speech Enhancement (2020) (8)
- An unsupervised learning approach to musical event detection (2004) (8)
- An efficient gradient computation approach to discriminative fusion optimization in semantic concept detection (2008) (8)
- A study of on-line quasi-Bayes adaptation for CDHMM-based speech recognition (1996) (8)
- An Iterative Phase Recovery Framework with Phase Mask for Spectral Mapping with an Application to Speech Enhancement (2016) (8)
- An experimental study on discriminative concept classifier combination for TRECVID high-level feature extraction (2008) (8)
- A study on cross-language knowledge integration in Mandarin LVCSR (2012) (8)
- Detection of repetitions in spontaneous speech in dialogue sessions (2008) (8)
- Bayesian learning of the SCHMM parameters for speech recognition (1994) (8)
- The USTC-NELSLIP Systems for CHiME-6 Challenge (2020) (8)
- On the use of inter-word context-dependent units for word juncture modeling (1992) (8)
- A text categorization approach to automatic language identification (2005) (8)
- Upper and lower bounds on the mean of noisy speech: application to minimax classification (2002) (8)
- Scenario-Dependent Speaker Diarization for DIHARD-III Challenge (2021) (8)
- Attribute based lattice rescoring in spontaneous speech recognition (2014) (8)
- Automatic image region annotation through segmentation based visual semantic analysis and discriminative classification (2016) (8)
- Language-resource independent speech segmentation using cues from a spectrogram image (2015) (8)
- Speech recognition under additive noise (1984) (7)
- Experimental studies on continuous speech recognition using neural architectures with “adaptive” hidden activation functions (2010) (7)
- TRECVID 2012 GENIE: Multimedia Event Detection and Recounting (2012) (7)
- Continuous phone recognition without target language training data (2008) (7)
- Dialect levelling in Finnish: a universal speech attribute approach (2014) (7)
- A MAP-based Online Estimation Approach to Ensemble Speaker and Speaking Environment Modeling (2014) (7)
- An experimental study on joint modeling of mixed-bandwidth data via deep neural networks for robust speech recognition (2016) (7)
- Fusion of Region and Image-Based Techniques for Automatic Image Annotation (2007) (7)
- A new hybrid decoding algorithm for speech recognition and utterance verification (1997) (7)
- Automatic speech recognition of small vocabularies within the context of unconstrained input (1988) (7)
- A lasso based ensemble empirical mode decomposition approach to designing adaptive clutter suppression filters (2012) (7)
- A Speaker-Dependent Approach to Single-Channel Joint Speech Separation and Acoustic Modeling Based on Deep Neural Networks for Robust Recognition of Multi-Talker Speech (2018) (7)
- Context‐dependent acoustic subword modeling for connected digit recognition (1993) (7)
- A language for creating speech applications (1998) (7)
- DNN Training Based on Classic Gain Function for Single-channel Speech Enhancement and Recognition (2019) (7)
- A penalized logistic regression approach to detection based phone classification (2008) (7)
- Connected digit recognition based on improved acoustic resolution (1993) (7)
- Bayesian learning of the parameters of discrete and tied mixture HMMs for speech recognition (1993) (7)
- Discriminative learning for optimizing detection performance in spoken language recognition (2008) (7)
- A study on soft margin estimation of linear regression parameters for speaker adaptation (2009) (7)
- On a generalization of margin-based discriminative training to robust speech recognition (2008) (7)
- Improving Deep Neural Network Based Speech Synthesis through Contextual Feature Parametrization and Multi-Task Learning (2018) (6)
- A real-time Japanese broadcast news closed-captioning system (2001) (6)
- Simplifying design specification for automatic training of robust natural language call router (2001) (6)
- Bayesian affine transformation of HMM parameters for instantaneous and supervised adaptation in telephone speech recognition (1997) (6)
- An incremental learning framework combining sample confidence and discrimination with an application to automatic image annotation (2009) (6)
- An entropy minimization framework for goal-driven dialogue management (2015) (6)
- Improved training procedures for hidden Markov models (1988) (6)
- Acoustics-guided evaluation (AGE): a new measure for estimating performance of speech enhancement algorithms for robust ASR (2018) (6)
- Natural language call routing: towards combination and boosting of classifiers (2001) (6)
- Extended maximum a posterior linear regression (EMAPLR) model adaptation for speech recognition (2000) (6)
- Adaptive compensation for robust speech recognition (1997) (6)
- A unified speaker-dependent speech separation and enhancement system based on deep neural networks (2015) (6)
- A fusion approach to spoken language identification based on combining multiple phone recognizers and speech attribute detectors (2014) (6)
- Two-Stage Enhancement of Noisy and Reverberant Microphone Array Speech for Automatic Speech Recognition Systems Trained with Only Clean Speech (2018) (6)
- A unified deep modeling approach to simultaneous speech dereverberation and recognition for the reverb challenge (2017) (6)
- A MODEL ENSEMBLE APPROACH FOR AUDIO-VISUAL SCENE CLASSIFICATION Technical Report (2021) (6)
- Soft margin estimation with various separation levels for LVCSR (2008) (6)
- Automatic Application Generator Matches User Expectations to System Capabilities (2000) (6)
- Segmental quasi-Bayesian learning of the mixture coefficients in SCHMM for speech recognition (1994) (5)
- A Comparison of Single- and Multi-Objective Programming Approaches to Problems with Multiple Design Objectives (2010) (5)
- A Regularized Maximum Figure-of-Merit (rMFoM) Approach to Supervised and Semi-Supervised Learning (2011) (5)
- An Efficient Gradient-based Approach to Optimizing Average Precision Through Maximal Figure-of-Merit Learning (2014) (5)
- A study on hidden Markov model's generalization capability for speech recognition (2009) (5)
- Enhancing model-based skin color detection: From low-level RGB features to high-level discriminative binary-class features (2012) (5)
- A Progressive Deep Learning Approach to Child Speech Separation (2018) (5)
- Lip-reading with Hierarchical Pyramidal Convolution and Self-Attention (2020) (5)
- Speaker verification based on combining speaker individuality parameter selection and decision (2005) (5)
- Word juncture modeling using inter-word context-dependent phone-like units (1991) (5)
- Dialogue session: management using voiceXML (2001) (5)
- Phrase language models for detection and verification-based speech understanding (1997) (5)
- Weighted graph based decision tree optimization for high accuracy acoustic modeling (2002) (5)
- Statistical Analysis of Musical Instruments (2002) (5)
- A resource-dependent approach to word modeling for keyword spotting (2013) (5)
- Using tone-based extended recognition network to detect non-native Mandarin tone mispronunciations (2016) (5)
- An Improved Parametric Relaxation Approach to Blood Flow Signal Estimation with Single-Ensemble Samples in Color Flow Imaging (2013) (5)
- Systems, methods and articles of manufacture for improving recognition confidence in hypothesized keywords (1998) (5)
- Online whole-word and stroke-based modeling for hand-written letter recognition in in-car environments (2013) (5)
- Title On-line adaptive learning of the correlated continuous densityhidden Markov models for speech recognition (1998) (5)
- Vector-Based Spoken Language Classification (2008) (4)
- Complexity reduction in a large vocabulary speech recognizer (1991) (4)
- A vector space approach to environment modeling for robust speech recognition (2006) (4)
- Audio-Visual Speech Recognition in MISP2021 Challenge: Dataset Release and Deep Analysis (2022) (4)
- Speaker independent recognition of spontaneously spoken connected digits (1991) (4)
- Online LSTM-based Iterative Mask Estimation for Multi-Channel Speech Enhancement and ASR (2018) (4)
- Multiple time resolution analysis of speech signal using MCE training with application to speech recognition (2009) (4)
- A particle filter feature compensation approach to robust speech recognition (2010) (4)
- An Iterative Constrained Optimization Approach to Classifier Design (2006) (4)
- An attribute detection based approach to automatic speech processing (2014) (4)
- Optimization of average precision with Maximal Figure-of-Merit Learning (2011) (4)
- Performance Analysis for Tensor-Train Decomposition to Deep Neural Network Based Vector-to-Vector Regression (2020) (4)
- Acoustic Modeling for Multi-Array Conversational Speech Recognition in the Chime-6 Challenge (2021) (4)
- On the use of some robust modeling techniques for speech recognition (1989) (4)
- Joint training of DNNs by incorporating an explicit dereverberation structure for distant speech recognition (2016) (4)
- Information fusion techniques for automatic image annotation (2007) (4)
- A speaker-dependent deep learning approach to joint speech separation and acoustic modeling for multi-talker automatic speech recognition (2016) (4)
- Improving the ensemble speaker and speaking environment modeling approach by enhancing the precision of the online estimation process (2008) (4)
- Subword-based large-vocabulary speech recognition (1993) (4)
- On project-based learning through the vertically-integrated projects program (2011) (4)
- A discriminative decision tree learning approach to acoustic modeling (2003) (4)
- A hierarchical grid feature representation framework for automatic image annotation (2009) (3)
- Improving Mandarin Tone Mispronunciation Detection for Non-Native Learners with Soft-Target Tone Labels and BLSTM-Based Deep Models (2018) (3)
- Soft margin estimation on improving environment structures for ensemble speaker and speaking environment modeling (2009) (3)
- A Maximum Likelihood Approach to Multi-Objective Learning Using Generalized Gaussian Distributions for Dnn-Based Speech Enhancement (2020) (3)
- On generating mixing noise signals with basis functions for simulating noisy speech and learning dnn-based speech enhancement models (2017) (3)
- A Progressive Learning Approach to Adaptive Noise and Speech Estimation for Speech Enhancement and Noisy Speech Recognition (2021) (3)
- Indexing with musical events and its application to content-based music identification (2004) (3)
- Applications of dynamic programming to speech and language processing (1989) (3)
- A Keyword-Aware Language Modeling Approach to Spoken Keyword Search (2016) (3)
- A Study of Child Speech Extraction Using Joint Speech Enhancement and Separation in Realistic Conditions (2020) (3)
- A Two-stage Single-channel Speaker-dependent Speech Separation Approach for Chime-5 Challenge (2019) (3)
- Maximum likelihood learning of auditory feature maps for stationary vowels (1996) (3)
- A Multi-Target SNR-Progressive Learning Approach to Regression Based Speech Enhancement (2020) (3)
- From decoding-driven to detection-based paradigms for automatic speech recognition (2004) (3)
- A kernelized maximal-figure-of-merit learning approach based on subspace distance minimization (2011) (3)
- Fundamentals Of Speaker And Utterance Verification With Applications (1997) (3)
- MAP estimation of online mapping parameters in ensemble speaker and speaking environment modeling (2009) (3)
- Unsupervised, smooth training of feed-forward neural networks for mismatch compensation (1997) (3)
- on Speech Recognition and Understanding , Part 2 (2009) (3)
- Knowledge integration for improving performance in LVCSR (2013) (3)
- Speech Recognition and Production by Machines (2015) (3)
- Minimax classification with parametric neighborhoods for noisy speech recognition (2001) (3)
- An enhanced minimum classification error learning framework for balancing insertion, deletion and substitution errors (2007) (3)
- Error Modeling via Asymmetric Laplace Distribution for Deep Neural Network Based Single-Channel Speech Enhancement (2018) (3)
- Deep neural network based voice conversion with a large synthesized parallel corpus (2016) (3)
- The USTC-Ximalaya System for the ICASSP 2022 Multi-Channel Multi-Party Meeting Transcription (M2met) Challenge (2022) (3)
- 2D-to-2D Mask Estimation for Speech Enhancement Based on Fully Convolutional Neural Network (2020) (3)
- Zero resource anti-spoofing detection for unit selection based synthetic speech using image spectrogram artifacts (2016) (3)
- A Cross-Entropy-Guided (CEG) Measure for Speech Enhancement Front-End Assessing Performances of Back-End Automatic Speech Recognition (2019) (3)
- A Forward-Backward Subsequence Smoothing Based Eigen Approach to Clutter Rejection in Color Flow Imaging (2014) (3)
- Some techniques for creating robust stochastic models for speech recognition (1987) (3)
- Stochastic modeling in spoken dialogue system design (1994) (3)
- A study on subword modeling for utterance verification in Mexican Spanish (1997) (3)
- A new confidence measure combining Hidden Markov Models and Artificial Neural Networks of phonemes for effective keyword spotting (2012) (3)
- A Study on Using Word-Level HMMs to Improve ASR Performance over State-of-the-Art Phone-Level Acoustic Modeling for LVCSR (2012) (3)
- Generating alternative pronunciations from a dictionary (1999) (2)
- A Study of Designing Compact Audio-Visual Wake Word Spotting System Based on Iterative Fine-Tuning in Neural Network Pruning (2022) (2)
- An Acoustic Segment Model Based Segment Unit Selection Approach to Acoustic Scene Classification with Partial Utterances (2020) (2)
- Information and Services Manager Customizes Dialogue-Based Applications (2000) (2)
- Utterance verification based on neighborhood information and Bayes factors (2002) (2)
- Learning auxiliary categorical information for speech synthesis based on deep and recurrent neural networks (2016) (2)
- High-resolution acoustic modeling and compact language modeling of language-universal speech attributes for spoken language identification (2015) (2)
- Speaker set identification through speaker group modeling (1992) (2)
- An i-vector based descriptor for alphabetical gesture recognition (2014) (2)
- On discriminative semi-supervised incremental learning with a multi-view perspective for image concept modeling (2012) (2)
- New model-based HMM distances with applications to run-time ASR error estimation and model tuning (2003) (2)
- Acoustic Model Ensembling Using Effective Data Augmentation for CHiME-5 Challenge (2019) (2)
- A Space-and-Speaker-Aware Iterative Mask Estimation Approach to Multi-Channel Speech Recognition in the CHiME-6 Challenge (2020) (2)
- A Multi-Objective Programming Approach to Compromising Classification Performance Metrics (2007) (2)
- Audio-Visual Wake Word Spotting in MISP2021 Challenge: Dataset Release and Deep Analysis (2022) (2)
- Automatic Lip-Reading with Hierarchical Pyramidal Convolution and Self-Attention for Image Sequences with No Word Boundaries (2021) (2)
- An mcmc approach to joint estimation of clean speech and noise for robust speech recognition (2013) (2)
- Minimum verification error training for topic verification (2003) (2)
- A study on sampling of STFT modifications in time and frequency domains for DNN-based speech dereverberation (2016) (2)
- A Speech Enhancement Neural Network Architecture with SNR-Progressive Multi-Target Learning for Robust Speech Recognition (2019) (2)
- Adaptive change point detection of dynamic functional connectivity networks (2016) (2)
- Speech Emotion Recognition Based on Acoustic Segment Model (2021) (2)
- Joint tracking of clean speech and noise using HMMs and particle filters for robust speech recognition (2012) (2)
- AN EFFICIENT DECODING APPROACH FOR DIALOGUE SYSTEMS (2000) (2)
- Adaptive change point detection of dynamic functional connectivity networks (2016) (2)
- A Variational Bayesian Approach to Learning Latent Variables for Acoustic Knowledge Transfer (2021) (2)
- Separation Guided Speaker Diarization in Realistic Mismatched Conditions (2021) (2)
- An Experimental Study on Continuous Phone Recognition with Little or No Language-Specific Training Data (2008) (2)
- A Model Ensemble Approach for Sound Event Localization and Detection (2021) (2)
- A single-ensemble-based hybrid approach to clutter rejection combining bilinear Hankel with regression (2013) (2)
- End-to-End Audio-Visual Neural Speaker Diarization (2022) (2)
- Acoustic modeling of context dependent units, for large vocabulary speech recognition in Spanish (1995) (1)
- Maximum Confidence Measure Based Interaural Phase Difference Estimation for Noise Masking in Dual-Microphone Robust Speech Recognition (2011) (1)
- Speaker‐independent recognition of the DARPA Naval Resource Management Task (1989) (1)
- Adaptive Learning in Acoustic and Language Modeling (1995) (1)
- The Multimodal Information based Speech Processing (MISP) 2022 Challenge: Audio-Visual Diarization and Recognition (2023) (1)
- Directions in automatic speech recognition (1995) (1)
- Tunable keyword-aware language modeling and context dependent fillers for LVCSR-based spoken keyword search (2015) (1)
- A PREFERENCE RANKING MODEL USING A DISCRIMINATIVELY-TRAINED CLASSIFIER (2008) (1)
- Geometry Constrained Progressive Learning for Lstm-Based Speech Enhancement (2020) (1)
- Transition features for CRF-based speech recognition and boundary detection (2009) (1)
- An Experimental Study on Private Aggregation of Teacher Ensemble Learning for End-to-End Speech Recognition (2022) (1)
- Joint Training of Multi-Channel-Condition Dereverberation and Acoustic Modeling of Microphone Array Speech for Robust Distant Speech Recognition (2017) (1)
- Using Speech Enhancement Preprocessing for Speech Emotion Recognition in Realistic Noisy Conditions (2020) (1)
- Towards a direct Bayesian adaptation framework for deep models (2016) (1)
- KL-Divergence Regularized Deep Neural Network Adaptation for Low-Resource Speaker-Dependent Speech Enhancement (2019) (1)
- Guest Editorial: Special Issue on Machine Learning Methods in Signal Processing (2004) (1)
- A LSTM-Based Joint Progressive Learning Framework for Simultaneous Speech Dereverberation and Denoising (2019) (1)
- An Efficient Structure for Continuous Speech Recognition (1992) (1)
- Discriminative dynamic Gaussian mixture selection with enhanced robustness and performance for multi-accent speech recognition (2012) (1)
- Audio-Visual Information Fusion Using Cross-Modal Teacher-Student Learning for Voice Activity Detection in Realistic Environments (2021) (1)
- A Study on Joint Modeling and Data Augmentation of Multi-Modalities for Audio-Visual Scene Classification (2022) (1)
- A Maximum Likelihood Approach to Masking-based Speech Enhancement Using Deep Neural Network (2018) (1)
- Model-based margin estimation for hidden Markov model learning and generalisation (2013) (1)
- Improving Separation-Based Speaker Diarization Via Iterative Model Refinement And Speaker Embedding Based Post-Processing (2022) (1)
- A voice user interface demonstration system for mexican Spanish (1998) (1)
- Speech Enhancement with Convolutional-Recurrent Networks (2018) (0)
- Space-and-Speaker-Aware Acoustic Modeling with Effective Data Augmentation for Recognition of Multi-Array Conversational Speech (2022) (0)
- Spoken Language Systems - Technical Challenges for Speech and Natural Language Processing (1999) (0)
- A Speaker-Dependent Approach to Single-Channel Joint Speech Separation and Acoustic Modeling Based on Deep Neural Networks for Robust Recognition of Multi-Talker Speech (2017) (0)
- Automatic recognition of connected digit strings in a credit card authorization task (1990) (0)
- USEOFGENERALIZEDPATTERNMODEL FOR VIDEOANNOTATION (2007) (0)
- ROBUSTNESS AND DISCRIMINATION ORIENTED SPEECH REC USING WEIGHTED HMM AND SUBSPACE PROJECTION APPR (1990) (0)
- ITR-(NHS+ASE) automatic speech attribute transcription (ASAT): (2011) (0)
- A Noise-Aware Memory-Attention Network Architecture for Regression-Based Speech Enhancement (2020) (0)
- ACOUSTIC \IODELISG OF SLBtC'ORD UNITS FOR SPEECH RECOGNITIOS (1990) (0)
- High‐resolution and efficient multiple‐string hypothesization using interword models (1993) (0)
- Title A Bayesian predictive classification approach to robust speechrecognition (2000) (0)
- Iterative Training Techniques for Phonetic Template Based Speech Recognition with a Speaker-Independent Phonetic Recognizer (2005) (0)
- Deep Segment Model for Acoustic Scene Classification (2022) (0)
- Media Annotation-Fusion of Region and Image-Based Techniques for Automatic Image Annotation (2006) (0)
- A Keyword-Aware Language Modeling Approach to Spoken Keyword Search (2015) (0)
- Nanyang Technological University Model-Based Noise Robust Speech Recognition (2012) (0)
- Discovering knowledge in & extracting information from multimedia patterns (2004) (0)
- Speech Enhancement Autoencoder with Hierarchical Latent Structure (2021) (0)
- A new model using artificial intelligence to predict recurrence after surgical resection of stage I-II non-small cell lung cancer. (2021) (0)
- A Quantum Kernel Learning Approach to Acoustic Modeling for Spoken Command Recognition (2022) (0)
- Bandwidth expansion of speech based on wavelet transform modulus maxima vector mapping (2010) (0)
- TclBLASR: an automatic speech recognition extension for tcl (2001) (0)
- Minimum mistake ratio training for connected string model (1994) (0)
- An experimental study on structural-MAP approaches to implementing very large vocabulary speech recognition systems for real-world tasks (2013) (0)
- A particle filter compensation approach to robust LVCSR (2013) (0)
- QDM-SSD: Quality-Aware Dynamic Masking for Separation-Based Speaker Diarization (2023) (0)
- A Study on Subword eling for Utterance (1997) (0)
- A Maximum Likelihood Approach to SNR-Progressive Learning Using Generalized Gaussian Distribution for LSTM-Based Speech Enhancement (2021) (0)
- Method and apparatus for speaker recognition through testing of oral information by means of forced decoding (1998) (0)
- Unsupervised Speaker Adaptation for Phonetic Transcription Based Voice Dialing (2005) (0)
- PHRASE LANGUAGE MODELS FOR SPEECH UNDERSTANDING DETECTION AND VERIFICATION-BASED (1997) (0)
- Speech recognition using keywords and non-keywords-modeling (1990) (0)
- ANSD-MA-MSE: Adaptive Neural Speaker Diarization Using Memory-Aware Multi-Speaker Embedding (2023) (0)
- Voice identification for recognition of interconnected numeral (1996) (0)
- Speaker Adaptation for Voice Dialing (2002) (0)
- Speech Enhancement Based on Deep Neural Networks (2014) (0)
- An Ensemble Teacher-Student Learning Approach with Poisson Sub-sampling to Differential Privacy Preserving Speech Recognition (2022) (0)
- A Study on Detection Based Autom (2006) (0)
- Riemannian Stochastic Gradient Descent for Tensor-Train Recurrent Neural Networks (2018) (0)
- Speaker verification method using group normalization scoring (1993) (0)
- CCPR 2008 Keynote Speech 2 (2008) (0)
- Joint training of DNNs by incorporating an explicit dereverberation structure for distant speech recognition (2016) (0)
- Automatic Speech Recognition by Machines (2021) (0)
- Proceedings - 2010 IEEE International Symposium on Multimedia, ISM 2010: Message from the conference co-chairs (2010) (0)
- Developments and Directions in Speech Recognition and Understanding , Part 1 Citation (2009) (0)
- Title Bayesian adaptive learning of the parameters of hidden Markovmodel for speech recognition (2004) (0)
- Distinctive verification of statements for the recognition of connected digits (1996) (0)
- The USTC-iFlytek System for the First DIHARD Challenge (2018) (0)
- Use of Generalized Pattern Model for Video Annotation (2007) (0)
- Correction to "An SNR-incremental stochastic matching algorithm for noisy speech recognition" (2002) (0)
- Speech and audio processing for multimedia communications (1997) (0)
- Per-Exemplar Fusion Learning for Video Retrieval and Recounting (2012) (0)
- An Information-Extraction Approach to Speech Analysis and Processing (2012) (0)
- A survey on recent progress in the ASAT/SIRKUS paradigm (2010) (0)
- Soft Margin Estimation of Hidde (2006) (0)
- Feature and model compensation for robust speech recognition (1996) (0)
- Improving Deep Neural Network Based Speech Synthesis through Contextual Feature Parametrization and Multi-Task Learning (2017) (0)
- Keyword Recognition and Correction Based on Utterance Verification and Knowledge Integration of Acoustic-Phonetic Features (2018) (0)
- Using Paralinguistic Information to Disambiguate User Intentions for Distinguishing Phrase Structure and Sarcasm in Spoken Dialog Systems (2021) (0)
- Iterative Constrained Optimization for Flexible Classifier Design With Multiple Competing Objectives (2007) (0)
- Keynote speech 1: An integrated deep learning approach to acoustic signal pre-processing and acoustic modeling with applications to robust automatic speech recognition (2017) (0)
This paper list is powered by the following services: