Mark John Francis Gales

Mark John Francis Gales's AcademicInfluence.com Rankings

Engineering

#3181

World Rank

#4217

Historical Rank

Electrical Engineering

#668

World Rank

#731

Historical Rank

engineering Degrees

Mark John Francis Gales

Computer Science

#4165

World Rank

#4382

Historical Rank

Computational Linguistics

#350

World Rank

#355

Historical Rank

Artificial Intelligence

#940

World Rank

#957

Historical Rank

Database

#1401

World Rank

#1474

Historical Rank

computer-science Degrees

Download Badge

Engineering
Computer Science

Mark John Francis Gales's Degrees

PhD Computer Science Stanford University
Masters Electrical Engineering Stanford University

Similar Degrees You Can Earn

Best Online PhD of Computer Science (Doctorates) 2026

Why Is Mark John Francis Gales Influential?

(Suggest an Edit or Addition)

(See a Problem?)

Mark John Francis Gales's Published Works

Number of citations in a given year to any of this author's works

Total number of citations to an author for the works they published in a given year. This highlights publication of the most important work(s) by the author

Published Works

Maximum likelihood linear transformations for HMM-based speech recognition (1998) (1809)
The HTK book version 3.4 (2006) (1059)
The Application of Hidden Markov Models in Speech Recognition (2007) (801)
Semi-tied covariance matrices for hidden Markov models (1999) (649)
Robust continuous speech recognition using parallel model combination (1996) (546)
Predictive Uncertainty Estimation via Prior Networks (2018) (519)
Mean and variance adaptation within the MLLR framework (1996) (494)
Model-based techniques for noise robust speech recognition (1995) (358)
Cluster adaptive training of hidden Markov models (2000) (317)
Cepstral parameter compensation for HMM recognition in noise (1993) (189)
An improved approach to the hidden Markov model decomposition of speech and noise (1992) (188)
Speech Recognition using SVMs (2001) (186)
Robust speech recognition in additive and convolutional noise using parallel model combination (1995) (168)
The generation and use of regression class trees for MLLR adaptation (1996) (145)
Ensemble Distribution Distillation (2019) (145)
Speech recognition and keyword spotting for low-resource languages: Babel project research at CUED (2014) (145)
The MGB challenge: Evaluating multi-genre broadcast media recognition (2015) (127)
HMM recognition in noise using parallel model combination (1993) (126)
Data augmentation for low resource languages (2014) (123)
Joint uncertainty decoding for noise robust speech recognition (2005) (117)
Lightly supervised recognition for automatic alignment of large coherent speech recordings (2010) (115)
Consensus Network Decoding for Statistical Machine Translation System Combination (2007) (113)
Progress in the CU-HTK broadcast news transcription system (2006) (107)
Reverse KL-Divergence Training of Prior Networks: Improved Uncertainty and Adversarial Robustness (2019) (97)
Statistical Parametric Speech Synthesis Based on Speaker and Language Factorization (2012) (97)
Efficient lattice rescoring using recurrent neural network language models (2014) (96)
Recurrent neural network language model adaptation for multi-genre broadcast speech recognition (2015) (93)
Investigation of multilingual deep neural networks for spoken term detection (2013) (91)
The Cambridge University March 2005 speaker diarisation system (2005) (91)
Improved neural network based language modelling and adaptation (2010) (89)
Multilingual representations for low resource speech recognition and keyword search (2015) (87)
Sequence Student-Teacher Training of Deep Neural Networks (2016) (85)
Broadcast news transcription using HTK (1997) (85)
CUED-RNNLM — An open-source toolkit for efficient training and evaluation of recurrent neural network language models (2016) (82)
Adaptive Training with Joint Uncertainty Decoding for Robust Recognition of Noisy Data (2007) (80)
Recurrent neural network language model training with noise contrastive estimation for speech recognition (2015) (79)
Improving environmental robustness in large vocabulary speech recognition (1996) (77)
Adaptation of deep neural network acoustic models using factorised i-vectors (2014) (74)
Multi-basis adaptive neural network for rapid adaptation in speech recognition (2015) (73)
Unsupervised training and directed manual transcription for LVCSR (2010) (73)
Unsupervised clustering of emotion and voice styles for expressive TTS (2012) (72)
Predictive model-based compensation schemes for robust speech recognition (1998) (72)
The development of the 1996 HTK broadcast news transcription system (1996) (72)
State-based Gaussian selection in large vocabulary continuous speech recognition using HMMs (1999) (70)
Training LVCSR systems on thousands of hours of data (2005) (68)
Use of Gaussian selection in large vocabulary continuous speech recognition using HMMS (1996) (68)
The theory of segmental hidden Markov models (1993) (65)
Efficient GPU-based training of recurrent neural network language models using spliced sentence bunch (2014) (65)
Issues with uncertainty decoding for noise robust automatic speech recognition (2008) (65)
System combination and score normalization for spoken term detection (2013) (65)
Variance compensation within the MLLR framework for robust speech recognition and speaker adaptation (1996) (64)
A fast and flexible implementation of parallel model combination (1995) (64)
MMI-MAP and MPE-MAP for acoustic model adaptation (2003) (63)
Cluster adaptive training for speech recognition (1998) (63)
Development of the 2003 CU-HTK conversational telephone speech transcription system (2004) (62)
Iterative unsupervised adaptation using maximum likelihood linear regression (1996) (61)
Environmentally robust ASR front-end for deep neural network acoustic models (2015) (59)
Product of Experts for Statistical Parametric Speech Synthesis (2012) (58)
Discriminative map for acoustic model adaptation (2003) (58)
Fundamental Technologies in Modern Speech Recognition (2012) (57)
A high-performance Cantonese keyword search system (2013) (56)
Efficient Training and Evaluation of Recurrent Neural Network Language Models for Automatic Speech Recognition (2016) (56)
Shifts: A Dataset of Real Distributional Shift Across Multiple Large-Scale Tasks (2021) (54)
Factor analysed hidden Markov models for speech recognition (2004) (54)
Maximum likelihood multiple subspace projections for hidden Markov models (2002) (54)
Product of Gaussians for speech recognition (2006) (53)
Unicode-based graphemic systems for limited resource languages (2015) (53)
Speaker and Noise Factorization for Robust Speech Recognition (2012) (53)
Discriminative classifiers with adaptive kernels for noise robust speech recognition (2010) (51)
Joint decoding of tandem and hybrid systems for improved keyword spotting on low resource languages (2015) (51)
Using SVMS and discriminative models for speech recognition (2002) (50)
Two Efficient Lattice Rescoring Methods Using Recurrent Neural Network Language Models (2016) (50)
Using VTLN for broadcast news transcription (2004) (50)
Unsupervised Training for Mandarin Broadcast News and Conversation Transcription (2007) (49)
Robust speech recognition in noise --- performance of the IBM continuous speech recogniser on the ARPA noise spoke task (1995) (48)
Augmented Statistical Models for Speech Recognition (2006) (47)
Language independent and unsupervised acoustic models for speech recognition and keyword spotting (2014) (46)
Variance compensation within the MLLR framework (1996) (46)
Stimulated Deep Neural Network for Speech Recognition (2016) (43)
Transcription of multi-genre media archives using out-of-domain data (2012) (43)
Use of contexts in language model interpolation and adaptation (2009) (43)
The Cu-Htk Mandarin Broadcast News Transcription System (2006) (42)
Investigation of unsupervised adaptation of DNN acoustic models with filter bank input (2014) (42)
Automatically grading learners' English using a Gaussian process (2015) (42)
Training and adapting MLP features for Arabic speech recognition (2009) (42)
Improving the interpretability of deep neural networks with stimulated learning (2015) (42)
Extended VTS for Noise-Robust Speech Recognition (2009) (42)
Improving speech recognition and keyword search for low resource languages using web data (2015) (42)
Segmental hidden Markov models (1993) (41)
Structured Log Linear Models for Noise Robust Speech Recognition (2010) (41)
Automatic transcription of Broadcast News (2002) (41)
Acoustic factorisation (2001) (41)
Generalised linear Gaussian models (2001) (41)
Rao-Blackwellised Gibbs sampling for switching linear dynamical systems (2004) (40)
Structured SVMs for Automatic Speech Recognition (2013) (40)
Investigating Bidirectional Recurrent Neural Network Language Models for Speech Recognition (2017) (40)
Gaussian Process Experts for Voice Conversion (2011) (40)
Combining Derivative and Parametric Kernels for Speaker Verification (2009) (39)
Exploring Rich Expressive Information from Audiobook Data Using Cluster Adaptive Training (2012) (37)
Cambridge university transcription systems for the multi-genre broadcast challenge (2015) (37)
Speech factorization for HMM-TTS based on cluster adaptive training (2012) (37)
Complex cepstrum as phase information in statistical parametric speech synthesis (2012) (36)
A comparative study of methods for phonetic decision-tree state clustering (1997) (36)
Adaptive training for robust ASR (2001) (36)
Multiple-cluster adaptive training schemes (2001) (35)
Noisy Constrained Maximum-Likelihood Linear Regression for Noise-Robust Speech Recognition (2011) (35)
Semi-tied covariance matrices (1998) (34)
Using SVMs to classify variable length speech patterns (2002) (33)
Adaptation of precision matrix models on large vocabulary continuous speech recognition (2005) (33)
Recent improvements to IBM's speech recognition system for automatic transcription of broadcast news (1999) (32)
Uncertainty Estimation in Autoregressive Structured Prediction (2021) (32)
Automatic Transcription of Multi-genre Media Archives (2013) (31)
Impact of single-microphone dereverberation on DNN-based meeting transcription systems (2014) (30)
Improving the training and evaluation efficiency of recurrent neural network language models (2015) (30)
The efficient incorporation of MLP features into automatic speech recognition systems (2011) (30)
Continuous F0 in the source-excitation generation for HMM-based TTS: Do we need voiced/unvoiced classification? (2011) (30)
Complex cepstrum for statistical parametric speech synthesis (2013) (30)
Towards automatic assessment of spontaneous spoken English (2018) (30)
Predictive linear transforms for noise robust speech recognition (2007) (30)
Issues with uncertainty decoding for noise robust speech recognition (2006) (30)
Parallel model combination for speech recognition in noise (1993) (30)
Structured Discriminative Models For Speech Recognition: An Overview (2012) (30)
Development of a phonetic system for large vocabulary Arabic speech recognition (2007) (29)
Structured Support Vector Machines for Noise Robust Continuous Speech Recognition (2011) (29)
The development of the Cambridge University RT-04 diarisation system (2004) (29)
Syllable language models for Mandarin speech recognition: exploiting character language models. (2013) (29)
Derivative kernels for noise robust ASR (2011) (29)
Automatic complexity control for HLDA systems (2003) (29)
Switching linear dynamical systems for speech recognition (2003) (28)
Discriminative Models for Speech Recognition (2007) (28)
Support vector machines for noise robust ASR (2009) (28)
Fundamental Technologies in Modern Speech Recognition [From the Guest Editors] (2012) (27)
Language model cross adaptation for LVCSR system combination (2013) (27)
Canonical state models for automatic speech recognition (2010) (26)
Multiple kernel learning for speaker verification (2008) (26)
Model-Based Approaches to Handling Uncertainty (2011) (26)
Development of the CUHTK 2004 Mandarin conversational telephone speech transcription system (2005) (26)
Maximum likelihood multiple projection schemes for hidden Markov models (1999) (26)
Discriminative cluster adaptive training (2006) (26)
Handbook of Natural Language Processing and Machine Translation (2012) (26)
Improving Interpretability and Regularization in Deep Learning (2018) (26)
Confidence Estimation and Deletion Prediction Using Bidirectional Recurrent Neural Networks (2018) (26)
Automatic transcription of conversational telephone speech (2005) (25)
Incorporating Uncertainty into Deep Learning for Spoken Language Assessment (2017) (25)
Combining tandem and hybrid systems for improved speech recognition and keyword spotting on low resource languages (2014) (25)
Discriminative semi-parametric trajectory model for speech recognition (2007) (24)
A Log Domain Pulse Model for Parametric Speech Synthesis (2018) (24)
Model-based approaches to handling additive noise in reverberant environments (2011) (24)
Speaker and noise factorisation on the AURORA4 task (2011) (24)
Improving multiple-crowd-sourced transcriptions using a speech recogniser (2015) (24)
Bi-directional Lattice Recurrent Neural Networks for Confidence Estimation (2018) (23)
Development of the CU-HTK 2004 broadcast news transcription systems (2005) (23)
Discriminative adaptive training with VTS and JUD (2009) (23)
Prior information for rapid speaker adaptation (2010) (23)
Context dependent language model adaptation (2008) (23)
Statistical parametric speech synthesis with joint estimation of acoustic and excitation model parameters (2010) (22)
Speech Recognition System Combination for Machine Translation (2007) (22)
Future word contexts in neural network language models (2017) (22)
Morphological decomposition in Arabic ASR systems (2012) (22)
Transformation smoothing for speaker and environmental adaptation (1997) (21)
A mixture of Gaussians front end for speech recognition (2001) (21)
Generating complementary systems for speech recognition (2006) (21)
Combining multiple high quality corpora for improving HMM-TTS (2012) (21)
Photo-realistic expressive text to talking head synthesis (2013) (21)
Unsupervised Adaptation With Discriminative Mapping Transforms (2009) (21)
Minimum phone error training of precision matrix models (2006) (21)
PMC for speech recognition in additive and convolutional noise (1993) (21)
Structured discriminative models for noise robust continuous speech recognition (2011) (20)
Long-Span Summarization via Local Attention and Content Selection (2021) (20)
Improving lightly supervised training for broadcast transcription (2013) (20)
Morphological analysis and decomposition for Arabic speech-to-text systems (2009) (20)
Bayesian Adaptive Inference and Adaptive Training (2007) (20)
Towards Using Conversations with Spoken Dialogue Systems in the Automated Assessment of Non-Native Speakers of English (2016) (19)
PHONETIC AND GRAPHEMIC SYSTEMS FOR MULTI-GENRE BROADCAST TRANSCRIPTION (2018) (19)
Semi-tied Full-covariance Matrices for Hidden Markov Models (1997) (19)
Structured discriminative models for speech recognition (2012) (18)
Selection of Multi-Genre Broadcast Data for the Training of Automatic Speech Recognition Systems (2016) (18)
Language model combination and adaptation usingweighted finite state transducers (2010) (18)
Complementary System Generation using Directed Decision Trees (2007) (17)
Exploiting Future Word Contexts in Neural Network Language Models for Speech Recognition (2019) (17)
The development of the cambridge university alignment systems for the multi-genre broadcast challenge (2015) (17)
Incremental predictive and adaptive noise compensation (2009) (17)
Basis superposition precision matrix modelling for large vocabulary continuous speech recognition (2004) (17)
Speaker diarisation and longitudinal linking in multi-genre broadcast data (2015) (17)
Acoustic Modelling Using Continuous Rational Kernels (2005) (17)
Automatic Speech Recognition System Development in the "Wild" (2018) (17)
Multi-Language Neural Network Language Models (2016) (17)
Improved DNN-based segmentation for multi-genre broadcast audio (2016) (17)
Impact of ASR Performance on Free Speaking Language Assessment (2018) (16)
Integrated Online Speaker Clustering and Adaptation (2011) (16)
Word Boundary Modelling and Full Covariance Gaussians for Arabic Speech-to-Text Systems (2011) (16)
Recent improvements to the Cambridge Arabic Speech-to-Text systems (2010) (16)
A Pulse Model in Log-domain for a Uniform Synthesizer (2016) (15)
Factor analysed hidden Markov models (2002) (15)
Joint Uncertainty Decoding With Predictive Methods for Noise Robust Speech Recognition (2011) (15)
Model complexity control and compression using discriminative growth functions (2004) (15)
An attention based model for off-topic spontaneous spoken response detection: An Initial Study (2017) (14)
Abstractive Spoken Document Summarization Using Hierarchical Model with Multi-Stage Attention Diversity Optimization (2020) (14)
Ensemble Approaches for Uncertainty in Spoken Language Assessment (2020) (14)
Automatic Model Complexity Control Using Marginalized Discriminative Growth Functions (2007) (14)
Multi-task ensembles with teacher-student training (2017) (14)
A confidence-based approach for improving keyword hypothesis scores (2013) (14)
The Cambridge University 2014 BOLT conversational telephone Mandarin Chinese LVCSR system for speech translation (2015) (14)
Phonetic pronunciations for arabic speech-to-text systems (2008) (14)
Precision matrix modelling for large vocabulary continuous speech recognition (2004) (13)
Regression Prior Networks (2020) (13)
Derivative and parametric kernels for speaker verification (2007) (13)
Porting: SwitchBoard to the VoiceMail task (2003) (13)
Improved cross-task recognition using MMIE training (2002) (13)
Investigation of acoustic modeling techniques for LVCSR systems (2005) (13)
Improving reverberant VTS for hands-free robust speech recognition (2011) (13)
Adaptive training using structured transforms (2004) (13)
Multimodal Fusion (2009) (13)
Tail distribution modelling using the richter and power exponential distributions (1999) (13)
Unsupervised discriminative adaptation using discriminative mapping transforms (2008) (12)
A Deep Learning Approach to Assessing Non-native Pronunciation of English Using Phone Distances (2018) (12)
Paraphrastic language models (2014) (12)
Improving LVCSR System Combination Using Neural Network Language Model Cross Adaptation (2011) (12)
Investigation of acoustic units for LVCSR systems (2011) (12)
CU-HTK April 2002 Switchboard System (2002) (12)
Decision tree-based context clustering based on cross validation and hierarchical priors (2011) (12)
Automatic Grammatical Error Detection of Non-native Spoken Learner English (2019) (12)
Unsupervised training with directed manual transcription for recognising Mandarin broadcast audio (2007) (12)
SVMS, SCORE-SPACES AND MAXIMUM MARGIN STATISTICAL MODELS (2004) (12)
Off-topic Response Detection for Spontaneous Spoken English Assessment (2016) (12)
Infinite structured support vector machines for speech recognition (2014) (11)
Low-Resource Speech Recognition and Keyword-Spotting (2017) (11)
Robust excitation-based features for Automatic Speech Recognition (2015) (11)
Automatic transcription of conversational telephone speech: development of the CU-HTK 2002 system (2003) (11)
I-vector estimation using informative priors for adaptation of deep neural networks (2015) (11)
Speaker and Expression Factorization for Audiobook Data: Expressiveness and Transplantation (2015) (11)
An explicit independence constraint for factorised adaptation in speech recognition (2013) (11)
Maximum margin training of generative kernels (2004) (11)
Directed decision trees for generating complementary systems (2009) (11)
Noise robustness in HMM-TTS speaker adaptation (2013) (11)
Rapid likelihood calculation of subspace clustered Gaussian components (2000) (11)
Covariance modelling for noise-robust speech recognition (2008) (11)
Bayesian adaptation and adaptively trained systems (2005) (11)
Automatic model complexity control using marginalized discriminative growth functions (2003) (11)
Factor analysis based VTS and JUD noise estimation and compensation (2011) (10)
Sequence Teacher-Student Training of Acoustic Models for Automatic Free Speaking Language Assessment (2018) (10)
Uncertainty decoding for noise robust automatic speech recognition (2004) (10)
Integrated Expression Prediction and Speech Synthesis From Text (2014) (10)
Stimulated training for automatic speech recognition and keyword search in limited resource conditions (2017) (10)
Improving Lightly Supervised Training for Broadcast Transcriptions (2013) (10)
Paraphrastic recurrent neural network language models (2015) (10)
Impact of ASR Performance on Spoken Grammatical Error Detection (2019) (10)
Speech intonation for TTS: study on evaluation methodology (2014) (10)
Development of the 2004 CU-HTK English CTS systems using more than two thousand hours of data (2004) (10)
Confidence Estimation for Black Box Automatic Speech Recognition Systems Using Lattice Recurrent Neural Networks (2019) (10)
Combining a Gaussian mixture model front end with MFCC parameters (2002) (10)
Exploiting Chinese character models to improve speech recognition performance (2009) (10)
Investigation of back-off based interpolation between recurrent neural network and n-gram language models (2015) (10)
Incremental Adaptation using Bayesian Inference (2006) (10)
Transformation streams and the HMM error model (2002) (9)
Model-Based Approaches for Degraded Channel Modelling in Robust ASR (2012) (9)
Temporally varying model parameters for large vocabulary continuous speech recognition (2005) (9)
Use of Graphemic Lexicons for Spoken Language Assessment (2017) (9)
Tandem system adaptation using multiple linear feature transforms (2013) (9)
Integrated automatic expression prediction and speech synthesis from text (2013) (9)
Scaling Ensemble Distribution Distillation to Many Classes with Proxy Targets (2021) (9)
Improving joint uncertainty decoding performance by predictive methods for noise robust speech recognition (2009) (9)
Transforming features to compensate speech recogniser models for noise (2009) (9)
Asymptotically exact noise-corrupted speech likelihoods (2010) (9)
Inference algorithms for generative score-spaces (2012) (9)
Uncertainty in Structured Prediction (2020) (8)
Building HMM-TTS Voices on Diverse Data (2014) (8)
Universal Adversarial Attacks on Spoken Language Assessment Systems (2020) (8)
Adaptive training with noisy constrained maximum likelihood linear regression for noise robust speech recognition (2009) (8)
Rapid joint speaker and noise compensation for robust speech recognition (2011) (8)
Combining i-vector representation and structured neural networks for rapid adaptation (2016) (8)
Building multiple complementary systems using directed decision trees (2007) (8)
Efficient generation and use of MLP features for Arabic speech recognition (2009) (8)
A language space representation for speech recognition (2015) (8)
Factored Semi-Tied Covariance Matrices (2000) (8)
Should Ensemble Members Be Calibrated? (2021) (8)
Factor analysis based VTS discriminative adaptive training (2012) (8)
Acoustic modelling for speech recognition: Hidden Markov models and beyond? (2009) (8)
Disfluency Detection for Spoken Learner English (2019) (8)
Automatic Detection of Accent and Lexical Pronunciation Errors in Spontaneous Non-Native English Speech (2020) (8)
Training Augmented Models Using SVMs (2006) (7)
Automatic Characterisation of the Pronunciation of Non-native English Speakers using Phone Distance Features (2017) (7)
Discriminative adaptation for speaker verification (2006) (7)
A Deep Learning Approach to Automatic Characterisation of Rhythm in Non-Native English Speech (2019) (7)
System combination with log-linear models (2016) (7)
CUED_SPEECH at TREC 2020 Podcast Summarisation Track (2020) (7)
An initial investigation of long-term adaptation for meeting transcription (2014) (7)
DEVELOPING KEYWORD SEARCH UNDER THE IARPA BABEL PROGRAM (2013) (7)
Discriminative classifiers with generative kernels for noise robust ASR (2008) (7)
Spoken Language 'Grammatical Error Correction' (2020) (7)
Attention Forcing for Machine Translation (2021) (7)
Attention Forcing for Sequence-to-sequence Model Training (2019) (7)
Extending noise robust structured support vector machines to larger vocabulary tasks (2011) (7)
A hierarchical attention based model for off-topic spontaneous spoken response detection (2017) (6)
I-Vectors and Structured Neural Networks for Rapid Adaptation of Acoustic Models (2017) (6)
Generating multiple-accent pronunciations for TTS using joint sequence model interpolation (2014) (6)
DEVELOPMENT OF THE CUHTK 2004 RT 04 F MANDARIN CONVERSATIONAL TELEPHONE SPEECH TRANSCRIPTION SYSTEM (2004) (6)
General Sequence Teacher–Student Learning (2019) (6)
Recent Progress in Large Vocabulary Continuous Speech Recognition: An HTK Perspective (2006) (6)
Attention Forcing for Speech Synthesis (2020) (6)
Incorporating a Generative Front-End Layer to Deep Neural Network for Noise Robust Automatic Speech Recognition (2016) (6)
Integrated speaker-adaptive speech synthesis (2017) (6)
Recurrent neural network language models for keyword search (2017) (6)
Ordering Info About Us Alerts Contact Help Log in The Application of Hidden Markov Models in Speech Recognition (2010) (5)
Surprise Languages: Rapid-Response Cross-Language IR (2019) (5)
Complex cepstrum analysis based on the minimum mean squared error (2013) (5)
Non-Native Children's Automatic Speech Recognition: The INTERSPEECH 2020 Shared Task ALTA Systems (2020) (5)
Complementary Systems for Off-Topic Spoken Response Detection (2020) (5)
Efficient decoding with generative score-spaces using the expectation semiring (2013) (5)
Minimum mean squared error based warped complex cepstrum analysis for statistical parametric speech synthesis (2013) (5)
A generalised derivative kernel for speaker verification (2008) (5)
Constrained discriminative mapping transforms for unsupervised speaker adaptation (2011) (5)
Generative Kernels and Score-Spaces for Classi cation of Speech : Progress Report iii (2012) (5)
Student-Teacher Training with Diverse Decision Tree Ensembles (2017) (5)
Adaptive training using discriminative mapping transforms (2008) (5)
Prior Networks for Detection of Adversarial Attacks (2018) (5)
Multiple-average-voice-based speech synthesis (2014) (5)
Answer Uncertainty and Unanswerability in Multiple-Choice Machine Reading Comprehension (2022) (5)
Discriminative language model adaptation for Mandarin broadcast speech transcription and translation (2007) (5)
Speaker Adaptation and Adaptive Training for Jointly Optimised Tandem Systems (2018) (5)
Parallel model combination on a noise corrupted resource management task (1994) (4)
A variational perspective on noise-robust speech recognition (2011) (4)
Ensemble Distillation Approaches for Grammatical Error Correction (2020) (4)
Annotating large lattices with the exact word error (2015) (4)
Structured discriminative models using deep neural-network features (2015) (4)
Residue-Based Natural Language Adversarial Attack Detection (2022) (4)
Shifts 2.0: Extending The Dataset of Real Distributional Shifts (2022) (4)
Log-Linear System Combination Using Structured Support Vector Machines (2016) (4)
Product of Gaussians and multiple stream systems (2003) (4)
Paraphrastic language models and combination with neural network language models (2013) (3)
Sequence Kernels for Speaker and Speech Recognition (2009) (3)
SelfCheckGPT: Zero-Resource Black-Box Hallucination Detection for Generative Large Language Models (2023) (3)
Kernel Eigenvoices (Revisited) for Large-Vocabulary Speech Recognition (2011) (3)
Graphone Model Interpolation and Arabic Pronunciation Generation (2011) (3)
Statistical parametric speech synthesis based on product of experts (2010) (3)
Multipulse Sequences for Residual Signal Modeling (2011) (3)
Kernelized log linear models for continuous speech recognition (2013) (3)
SVMs for speech recognition (2002) (3)
Cluster adaptive training of average voice models (2014) (3)
Reconstructing voices within the multiple-average-voice-model framework (2015) (3)
Infinite support vector machines in speech recognition (2013) (3)
Bayesian discriminative adaptation for speech recognition (2009) (3)
On Assessing and Developing Spoken ’Grammatical Error Correction’ Systems (2022) (3)
Sparsity and Sentence Structure in Encoder-Decoder Attention of Summarization Systems (2021) (3)
Combining VTS model compensation and support vector machines (2009) (3)
DISCRIMINATIVE CLASSIFIERS WITH GENERATIVE KERNELS FOR NOISE ROBUST SPEECH RECOGNITION (2008) (3)
Self-Distribution Distillation: Efficient Uncertainty Estimation (2022) (2)
Noisy CMLLR for noise-robust speech recognition (2009) (2)
View-Specific Assessment of L2 Spoken English (2022) (2)
Deep Activation Mixture Model for Speech Recognition (2017) (2)
Improving Speech Transcription for Mandarin-English Translation (2007) (2)
Training a parametric-based logF0 model with the minimum generation error criterion (2010) (2)
Transcription of broadcast news with a time constraint: IBM's 10xRT HUB4 system (2000) (2)
Rapid Nonlinear Speaker Adaptation for Large-Vocabulary Continuous Speech Recognition (2012) (2)
Importance sampling to compute likelihoods of noise-corrupted speech (2013) (2)
Hierarchical RNNs for Waveform-Level Speech Synthesis (2018) (2)
Noise Robustness (2018) (2)
Waveform-Based Speaker Representations for Speech Synthesis (2018) (2)
A speech recognition system and method (2009) (2)
Maximum Likelihood Linear Regression 32 . 1 Maximum likelihood linear regression (2007) (2)
Corrections to "Automatic Transcription of Conversational Telephone Speech" (2006) (1)
Improved Auto-Marking Confidence for Spoken Language Assessment (2018) (1)
Adaptive Training and Noise Estimation for Model-Based Noise Compensation for ASR (2012) (1)
Speaker dependent expression predictor from text: Expressiveness and transplantation (2014) (1)
The application of parallel model combination to a large vocabulary dictation task (1995) (1)
Deliberation-Based Multi-Pass Speech Synthesis (2021) (1)
Recent developments at Cambridge in broadcast news transcription (2004) (1)
Analysing Bias in Spoken Language Assessment Using Concept Activation Vectors (2021) (1)
Product of Gaussians as a distributed representation for speech recognition (2003) (1)
Cross-domain paraphrasing for improving language modelling using out-of-domain data (2013) (1)
Morph-to-word transduction for accurate and efficient automatic speech recognition and keyword search (2017) (1)
Podcast Summary Assessment: A Resource for Evaluating Summary Assessment Methods (2022) (1)
“World Knowledge” in Multiple Choice Reading Comprehension (2022) (1)
MQAG: Multiple-choice Question Answering and Generation for Assessing Information Consistency in Summarization (2023) (1)
Non-native Speaker Verification for Spoken Language Assessment (2019) (1)
Grammatical Error Correction Systems for Automated Assessment: Are They Susceptible to Universal Adversarial Attacks? (2022) (1)
Training a supra-segmental parametric F0 model without interpolating F0 (2013) (1)
The HMM error model (2002) (1)
A Spectrally Weighted Mixture of Least Square Error and Wasserstein Discriminator Loss for Generative SPSS (2018) (1)
Increasing Context for Estimating Confidence Scores in Automatic Speech Recognition (2022) (1)
Confidence Scores for Speech Processing (2018) (1)
Paraphrastic neural network language models (2014) (1)
Incremental adaptation with VTS and joint adaptively trained systems (2009) (1)
Multiple-Choice Question Generation: Towards an Automated Assessment Framework (2022) (1)
Progress in Broadcast News English Transcription (2004) (0)
Learning Between Different Teacher and Student Models in ASR (2019) (0)
Efficient Use of End-to-End Data in Spoken Language Processing (2021) (0)
Transformation Techniques in Speaker Adaptation 33 Maximum Likelihood Linear Regression 43 (0)
Structured Discriminative Models for Sequential Data Classification (2010) (0)
Ensemble Prosody Prediction For Expressive Speech Synthesis (2023) (0)
Variational dynamic kernels for speaker verification (2009) (0)
Edinburgh Research Explorer Multiple-average-voice-based speech synthesis (2017) (0)
Data underpinning "I-Vector Estimation Using Informative Priors for Adaptation of Deep Neural Networks" (2015) (0)
Factor analysed HMMs (Hidden Markov Models) (2002) (0)
IBM ’ s 10 x Real-time Broadcast News Transcription System Used in the 1999 Hub 4 Evaluation (2000) (0)
Statistical parametric synthesis based on products of experts (2010) (0)
RECURRENT NEURAL NETWORK LANGUAGE MODELS FOR KEYWORD SEARC (2016) (0)
A speech processing system that applies speaker adaptation techniques into an environment mismatch function (2010) (0)
Data underpining "System Combination with Log-linear Models" (2016) (0)
Supplementary data for "Speaker Diarisation and Linking in Multi-Genre Broadcast Data" (2015) (0)
Mandarin Chinese LVCSR System for Speech Translation (2015) (0)
PARAPHRASTIC RECURRENT NEURAL NETWORK LANGUAGE (2015) (0)
Analyzing Biases to Spurious Correlations in Text Classification Tasks (2022) (0)
Ensemble methods and efficient decoding (2016) (0)
Long-Span Dependencies in Transformer-based Summarization Systems (2021) (0)
Parallel Attention Forcing for Machine Translation (2022) (0)
2 ML Estimation of Semi-Tied Full Covariances (1997) (0)
L2 proficiency assessment using self-supervised speech representations (2022) (0)
A speech processing system (2012) (0)
Audio Grade Feature extraction Speech recogniser Text Features (2017) (0)
Model-based approaches to adaptive training in reverberant environments (2012) (0)
Speech synthesis by combining probability distributions from different linguistic levels (2012) (0)
Full-covariance model compensation for noise-robust speech recognition (2008) (0)
Deliberation Networks and How to Train Them (2022) (0)
Active Memory Networks for Language Modeling (2018) (0)
Grade Feature extraction Speech recogniser Text Features Grader Error Detection & Correction (2017) (0)
INCREMENTAL BAYESIAN ADAPTATION (0)
Noise-robust TTS speaker adaptation with statistics smoothing (2014) (0)
An Initial Investigation of Non-Native Spoken Question-Answering (2021) (0)
Surprise Languages : Rapid-Response Cross-Language (2019) (0)
An adaptive speech recognition system and method using a cascade of transforms (2010) (0)
tinuous Spee Recognition allel Model mbination (1996) (0)
Augmented Statistical Models: Using Dynamic Kernels for Acoustic Models (2009) (0)
Gender Bias and Universal Substitution Adversarial Attacks on Grammatical Error Correction Systems for Automated Assessment (2022) (0)
Minimum Phone Error Training of (2006) (0)
Light Supervised Data Selection, Voice Quality Normalized Training and Log Domain Pulse Synthesis (2017) (0)
Sentiment Perception Adversarial Attacks on Neural Machine Translation Systems (2023) (0)
Novel structural-scale uncertainty measures and error retention curves: application to multiple sclerosis (2022) (0)
A text-to-speech system having speaker voice related parameters and speaker attribute related parameters (2012) (0)
Generating Complementary Systems for Large Vocabulary Continuous Speech Recognition (0)
Identifying Adversarially Attackable and Robust Samples (2023) (0)
SLAM 2013 Speech, Language and Audio in Multimedia (2013) (0)
Engineering Department Noisy Cmllr for Noise-robust Speech Recognition (0)
1 Statistical Sequence Modelling (2017) (0)
N-best T5: Robust ASR Error Correction using Multiple Input Hypotheses and Constrained Decoding Space (2023) (0)
IBM's 10xReal-time broadcast news transciption used in the 1999 hub4 evaluation (2000) (0)
Proceedings of INTERSPEECH 2013, 14th Annual Conference of the International Speech Communication Association, Lyon, France, August 25-29, 2013 (2013) (0)

This paper list is powered by the following services:

What Schools Are Affiliated With Mark John Francis Gales?

Mark John Francis Gales is affiliated with the following schools:

University of Cambridge

Mark John Francis Gales's Academic­Influence.com Rankings

Mark John Francis Gales's Degrees

Similar Degrees You Can Earn

Why Is Mark John Francis Gales Influential?

Mark John Francis Gales's Published Works

Published Works

What Schools Are Affiliated With Mark John Francis Gales?

Mark John Francis Gales's AcademicInfluence.com Rankings