Philip C. Woodland

Philip C. Woodland's AcademicInfluence.com Rankings

Philip C. Woodland

Engineering

#4088

World Rank

#5253

Historical Rank

Applied Physics

#921

World Rank

#943

Historical Rank

engineering Degrees

Download Badge

Engineering

Why Is Philip C. Woodland Influential?

(Suggest an Edit or Addition)

(See a Problem?)

Philip C. Woodland's Published Works

Number of citations in a given year to any of this author's works

Total number of citations to an author for the works they published in a given year. This highlights publication of the most important work(s) by the author

Published Works

Maximum likelihood linear regression for speaker adaptation of continuous density hidden Markov models (1995) (2626)
The HTK book (1995) (2157)
The HTK book version 3.4 (2006) (1059)
Minimum Phone Error and I-smoothing for improved discriminative training (2002) (816)
Tree-based state tying for high accuracy acoustic modelling (1994) (783)
Mean and variance adaptation within the MLLR framework (1996) (494)
Large scale discriminative training of hidden Markov models for speech recognition (2002) (368)
Large vocabulary continuous speech recognition using HTK (1994) (308)
Posterior probability decoding, confidence estimation and system combination (2000) (290)
MMIE training of large vocabulary recognition systems (1997) (212)
Flexible speaker adaptation using maximum likelihood linear regression (1995) (190)
Speaker adaptation for continuous density HMMs: a review (2001) (183)
Large vocabulary decoding and confidence estimation using word posterior probabilities (2000) (174)
A One Pass Decoder Design For Large Vocabulary Recognition (1994) (159)
The 1994 HTK large vocabulary speech recognition system (1995) (147)
Large scale discriminative training for speech recognition (2000) (146)
The use of state tying in continuous speech recognition (1993) (135)
A variable-length category-based n-gram language model (1996) (135)
The MGB challenge: Evaluating multi-genre broadcast media recognition (2015) (127)
Speaker adaptation of continuous density HMMs using multivariate linear regression (1994) (121)
Tree-Based State Tying for High Accuracy Modelling (1994) (121)
A computational model of the auditory periphery for speech and hearing research. II. Descending paths. (1994) (119)
State clustering in hidden Markov model-based continuous speech recognition (1994) (118)
Consensus Network Decoding for Statistical Machine Translation System Combination (2007) (113)
Spoken Document Retrieval for TREC-8 at Cambridge University (1998) (110)
Progress in the CU-HTK broadcast news transcription system (2006) (107)
Speaker adaptation of HMMs using linear regression (1994) (106)
Segment generation and clustering in the HTK broadcast news transcription system (1998) (105)
A computational model of the auditory periphery for speech and hearing research. I. Ascending path. (1994) (100)
The use of prosody in a combined system for punctuation generation and speech recognition (2001) (99)
The 1998 HTK system for transcription of conversational telephone speech (1999) (98)
Improving broadcast news transcription by lightly supervised discriminative training (2004) (97)
Efficient lattice rescoring using recurrent neural network language models (2014) (96)
Recurrent neural network language model adaptation for multi-genre broadcast speech recognition (2015) (93)
Experiments in speaker normalisation and adaptation for large vocabulary speech recognition (1997) (92)
The Cambridge University March 2005 speaker diarisation system (2005) (91)
Investigation of multilingual deep neural networks for spoken term detection (2013) (91)
Improved neural network based language modelling and adaptation (2010) (89)
An investigation into vocal tract length normalisation (1999) (88)
Multilingual representations for low resource speech recognition and keyword search (2015) (87)
Broadcast news transcription using HTK (1997) (85)
The Cambridge University spoken document retrieval system (1999) (84)
Structural metadata research in the EARS program (2005) (84)
The HTK tied-state continuous speech recogniser (1993) (83)
CUED-RNNLM — An open-source toolkit for efficient training and evaluation of recurrent neural network language models (2016) (82)
Effects of out of vocabulary words in spoken document retrieval (poster session) (2000) (81)
Using accent-specific pronunciation modelling for robust speech recognition (1996) (79)
Recurrent neural network language model training with noise contrastive estimation for speech recognition (2015) (79)
Comparison of part-of-speech and automatically derived category-based language models for speech recognition (1998) (79)
A hidden Markov-model-based trainable speech synthesizer (1999) (78)
Improving environmental robustness in large vocabulary speech recognition (1996) (77)
Lattice-based discriminative training for large vocabulary speech recognition (1996) (76)
Very deep convolutional neural networks for robust speech recognition (2016) (75)
Adaptation of deep neural network acoustic models using factorised i-vectors (2014) (74)
Unsupervised training and directed manual transcription for LVCSR (2010) (73)
Multilingual large vocabulary speech recognition: the European SQALE project (1997) (72)
The development of the 1996 HTK broadcast news transcription system (1996) (72)
Training LVCSR systems on thousands of hours of data (2005) (68)
Speaker adaptation: techniques and challenges (1999) (66)
Efficient GPU-based training of recurrent neural network language models using spliced sentence bunch (2014) (65)
System combination and score normalization for spoken term detection (2013) (65)
The development of the HTK Broadcast News transcription system: An overview (2002) (65)
Variance compensation within the MLLR framework for robust speech recognition and speaker adaptation (1996) (64)
MMI-MAP and MPE-MAP for acoustic model adaptation (2003) (63)
Development of the 2003 CU-HTK conversational telephone speech transcription system (2004) (62)
Iterative unsupervised adaptation using maximum likelihood linear regression (1996) (61)
DNN speaker adaptation using parameterised sigmoid and ReLU hidden activation functions (2016) (60)
Combined Bayesian and predictive techniques for rapid speaker adaptation of continuous density hidden Markov models (1997) (60)
Improvements in linear transform based speaker adaptation (2001) (60)
The development of the 1994 HTK large vocabulary speech recognition system (1995) (59)
WSJCAM0 corpus and recording description (1994) (59)
Discriminative map for acoustic model adaptation (2003) (58)
Efficient Training and Evaluation of Recurrent Neural Network Language Models for Automatic Speech Recognition (2016) (56)
Frame discrimination training for HMMs for large vocabulary speech recognition (1999) (56)
A high-performance Cantonese keyword search system (2013) (56)
Parameterised sigmoid and reLU hidden activation functions for DNN acoustic modelling (2015) (54)
Combining Information Sources for Confidence Estimation with CRF Models (2011) (54)
Speaker clustering using direct maximisation of the MLLR-adapted likelihood (1998) (54)
The 1997 HTK broadcast news transcription system (1998) (54)
Joint decoding of tandem and hybrid systems for improved keyword spotting on low resource languages (2015) (51)
Using VTLN for broadcast news transcription (2004) (50)
Two Efficient Lattice Rescoring Methods Using Recurrent Neural Network Language Models (2016) (50)
Improvements in an HMM-based speech synthesiser (1995) (50)
THE CU-HTK MARCH 2000 HUB5E TRANSCRIPTION SYSTEM (2000) (49)
Unsupervised Training for Mandarin Broadcast News and Conversation Transcription (2007) (49)
Using accent-specific pronunciation modelling for improved large vocabulary continuous speech recognition (1997) (49)
Hidden Markov models using vector linear prediction and discriminative output distributions (1992) (48)
Discriminative adaptive training using the MPE criterion (2003) (48)
MPE-based discriminative linear transform for speaker adaptation (2004) (48)
Variance compensation within the MLLR framework (1996) (46)
Flexible speaker adaptation for large vocabulary speech recognition (1995) (44)
A rule-based named entity recognition system for speech input (2000) (44)
Improved discriminative training techniques for large vocabulary continuous speech recognition (2001) (43)
Use of contexts in language model interpolation and adaptation (2009) (43)
Transcription of multi-genre media archives using out-of-domain data (2012) (43)
Training and adapting MLP features for Arabic speech recognition (2009) (42)
The Cu-Htk Mandarin Broadcast News Transcription System (2006) (42)
A combined punctuation generation and speech recognition system and its performance enhancement using prosody (2003) (41)
Large scale MMIE training for conversational telephone speech recognition (2000) (41)
The use of accent-specific pronunciation dictionaries in acoustic model training (1998) (40)
A method for direct audio search with applications to indexing and retrieval (2000) (38)
Particle-based language modelling (2000) (38)
Cambridge university transcription systems for the multi-genre broadcast challenge (2015) (37)
Automatic speech synthesiser parameter estimation using HMMs (1995) (36)
Unsupervised language model adaptation for Mandarin broadcast conversation transcription (2006) (35)
Speaker adaptation using lattice-based MLLR (2001) (35)
Standalone training of context-dependent deep neural network acoustic models (2014) (35)
A PLSA-based language model for conversational telephone speech (2004) (34)
Variable-length categoryn-gram language models (1999) (34)
Discriminative Neural Clustering for Speaker Diarisation (2019) (33)
Combination of word-based and category-based language models (1996) (33)
Language modelling for Russian and English using words and classes (2003) (32)
The 1998 HTK broadcast news transcription system: development and results (1999) (32)
A general artificial neural network extension for HTK (2015) (32)
Recent advances in broadcast news transcription (2003) (32)
Experiments in broadcast news transcription (1998) (31)
Automatic Transcription of Multi-genre Media Archives (2013) (31)
The efficient incorporation of MLP features into automatic speech recognition systems (2011) (30)
Improving the training and evaluation efficiency of recurrent neural network language models (2015) (30)
The HTK large vocabulary recognition system for the 1995 ARPA H3 task (1996) (30)
Syllable language models for Mandarin speech recognition: exploiting character language models. (2013) (29)
The development of the Cambridge University RT-04 diarisation system (2004) (29)
Automatic complexity control for HLDA systems (2003) (29)
Development of a phonetic system for large vocabulary Arabic speech recognition (2007) (29)
Segmentation and classification of broadcast news audio (1998) (28)
Comparison of language modelling techniques for Russian and English (1998) (28)
Spontaneous speech recognition for the credit card corpus using the HTK toolkit (1994) (27)
Dynamic HMM selection for continuous speech recognition (1999) (27)
General query expansion techniques for spoken document retrieval (1999) (27)
Discriminative linear transforms for speaker adaptation (2001) (27)
Speaker Diarisation Using 2D Self-attentive Combination of Embeddings (2019) (27)
Language model cross adaptation for LVCSR system combination (2013) (27)
Development of the CUHTK 2004 Mandarin conversational telephone speech transcription system (2005) (26)
Optimising hidden Markov models using discriminative output distributions (1991) (26)
Handbook of Natural Language Processing and Machine Translation (2012) (26)
Generating and evaluating segmentations for automatic speech recognition of conversational telephone speech (2004) (26)
MPE-based discriminative linear transforms for speaker adaptation (2008) (26)
Tree-based state clustering for large vocabulary speech recognition (1994) (25)
Automatic transcription of conversational telephone speech (2005) (25)
Confidence Estimation for Attention-Based Sequence-to-Sequence Models for Speech Recognition (2020) (24)
Design of fast LVCSR systems (2003) (23)
Efficient class-based language modelling for very large vocabularies (2001) (23)
Development of the CU-HTK 2004 broadcast news transcription systems (2005) (23)
Context dependent language model adaptation (2008) (23)
New features in the CU-HTK system for transcription of conversational telephone speech (2001) (22)
Morphological decomposition in Arabic ASR systems (2012) (22)
Spoken document representations for probabilistic retrieval (2000) (22)
Emotion Recognition by Fusing Time Synchronous and Time Asynchronous Representations (2020) (22)
Speech Recognition System Combination for Machine Translation (2007) (22)
Unsupervised Adaptation With Discriminative Mapping Transforms (2009) (21)
Improved Tdnns Using Deep Kernels and Frequency Dependent Grid-RNNS (2018) (21)
Morphological analysis and decomposition for Arabic speech-to-text systems (2009) (20)
Rapid speaker adaptation using model prediction (1995) (20)
An investigation into the the interactions between speaker diarisation systems and automatic speech transcription (2003) (20)
Audio Indexing and Retrieval of Complete Broadcoast News Shows (2000) (20)
Improving lightly supervised training for broadcast transcription (2013) (20)
Selection of Multi-Genre Broadcast Data for the Training of Automatic Speech Recognition Systems (2016) (18)
Language model combination and adaptation usingweighted finite state transducers (2010) (18)
Automatic capitalisation generation for speech input (2004) (18)
Speaker diarisation and longitudinal linking in multi-genre broadcast data (2015) (17)
The development of the cambridge university alignment systems for the multi-genre broadcast challenge (2015) (17)
Large vocabulary multilingual speech recognition using HTK (1995) (17)
Improved DNN-based segmentation for multi-genre broadcast audio (2016) (17)
Word Boundary Modelling and Full Covariance Gaussians for Arabic Speech-to-Text Systems (2011) (16)
The CUHTK-entropic 10xRT broadcast news transcription system (1999) (16)
Recent improvements to the Cambridge Arabic Speech-to-Text systems (2010) (16)
Detecting deletions in ASR output (2014) (16)
Weight limiting, weight quantisation and generalisation in multi-layer perceptrons (1989) (16)
Modelling word-pair relations in a category-based language model (1997) (16)
Multi-Span Acoustic Modelling using Raw Waveform Signals (2019) (15)
The Cambridge University Multimedia Document Retrieval Demo System (2000) (15)
Benchmark DARPA RM results using the HTK portable HMM toolkit (1992) (15)
High Order Recurrent Neural Networks for Acoustic Modelling (2018) (15)
The Cambridge University 2014 BOLT conversational telephone Mandarin Chinese LVCSR system for speech translation (2015) (14)
Cambridge University Engineering Department (1994) (14)
Discriminatively Trained Gaussian Mixture Models for Sentence Boundary Detection (2006) (14)
Joint optimisation of tandem systems using Gaussian mixture density neural network discriminative sequence training (2017) (14)
Phonetic pronunciations for arabic speech-to-text systems (2008) (14)
A confidence-based approach for improving keyword hypothesis scores (2013) (14)
Improved cross-task recognition using MMIE training (2002) (13)
Integrating Source-Channel and Attention-Based Sequence-to-Sequence Models for Speech Recognition (2019) (13)
Using relative duration in large vocabulary speech recognition (1993) (13)
Porting: SwitchBoard to the VoiceMail task (2003) (13)
Investigation of acoustic units for LVCSR systems (2011) (12)
Improving LVCSR System Combination Using Neural Network Language Model Cross Adaptation (2011) (12)
CU-HTK April 2002 Switchboard System (2002) (12)
Paraphrastic language models (2014) (12)
Unsupervised training with directed manual transcription for recognising Mandarin broadcast audio (2007) (12)
Unsupervised discriminative adaptation using discriminative mapping transforms (2008) (12)
I-vector estimation using informative priors for adaptation of deep neural networks (2015) (11)
Information Retrieval from Unsegmented Broadcast News Audio (2001) (11)
Isolated word speech recognition based on connectionist techniques (1990) (11)
Automatic transcription of conversational telephone speech: development of the CU-HTK 2002 system (2003) (11)
Modelling syllable characteristics to improve a large vocabulary continuous speech recogniser (1994) (10)
Paraphrastic recurrent neural network language models (2015) (10)
Combination of Deep Speaker Embeddings for Diarisation (2020) (10)
PyHTK: Python Library and ASR Pipelines for HTK (2019) (10)
Improving Lightly Supervised Training for Broadcast Transcriptions (2013) (10)
Investigation of back-off based interpolation between recurrent neural network and n-gram language models (2015) (10)
Exploiting Chinese character models to improve speech recognition performance (2009) (10)
Development of the 2004 CU-HTK English CTS systems using more than two thousand hours of data (2004) (10)
Improvements in accuracy and speed in the HTK broadcast news transcription system (1999) (10)
Knowledge Distillation for Neural Transducers from Large Self-Supervised Pre-Trained Models (2021) (9)
Tree-Constrained Pointer Generator for End-to-End Contextual Speech Recognition (2021) (9)
Relating dynamic brain states to dynamic machine states: Human and machine solutions to the speech recognition problem (2017) (8)
An experimental comparison of connectionist and conventional classification systems on natural data (1990) (8)
Efficient generation and use of MLP features for Arabic speech recognition (2009) (8)
Residual Energy-Based Models for End-to-End Speech Recognition (2021) (8)
Modelling sub-phone insertions and deletions in continuous speech recognition (2000) (8)
System combination with log-linear models (2016) (7)
DEVELOPING KEYWORD SEARCH UNDER THE IARPA BABEL PROGRAM (2013) (7)
Improving retrieval on imperfect speech transcriptions (poster abstract) (1999) (7)
I-Vectors and Structured Neural Networks for Rapid Adaptation of Acoustic Models (2017) (6)
Recent Progress in Large Vocabulary Continuous Speech Recognition: An HTK Perspective (2006) (6)
Sequence training of DNN acoustic models with natural gradient (2017) (6)
DEVELOPMENT OF THE CUHTK 2004 RT 04 F MANDARIN CONVERSATIONAL TELEPHONE SPEECH TRANSCRIPTION SYSTEM (2004) (6)
Adapting GPT, GPT-2 and BERT Language Models for Speech Recognition (2021) (5)
Discriminative language model adaptation for Mandarin broadcast speech transcription and translation (2007) (5)
The Cambridge Multimedia Document Retrieval Project: summary of experiments (2001) (5)
RETRIEVAL FOR TREC-9 AT CAMBRIDGE UNIVERSITY (2001) (5)
Comparative evaluation of word- and category-based language models (1996) (5)
Word-pair relations for category-based language models (1997) (5)
Maximum mutual information training of hidden Markov models with vector linear predictors (2002) (5)
Speaker Adaptation and Adaptive Training for Jointly Optimised Tandem Systems (2018) (5)
Complementary Phone Error Training (2012) (5)
Improved Large-Margin Softmax Loss for Speaker Diarisation (2019) (5)
A dynamic network decoder design for large vocabulary speech recognition (1994) (5)
Using Sub-word-level Information for Confidence Estimation with Conditional Random Field Models (2012) (4)
HTK V1.5: User, Reference and Programmer Manuals (1993) (4)
Variable-length category-based n-grams for language modelling (1995) (4)
A wave digital filter model of the entire auditory periphery (1993) (4)
Hidden Markov models using shared vector linear predictors (1993) (4)
A Distributed Optimisation Framework Combining Natural Gradient with Hessian-Free for Discriminative Sequence Training (2021) (4)
Implementation of automatic capitalisation generation systems for speech input (2002) (4)
The Cambridge University Multimedia Document Retrieval demo system (demonstration session) (2000) (4)
Spoken language systems technology workshop (1995) (4)
Spoken Alphabet Recognition Using Multilayer Perceptrons (1992) (4)
Speech analysis using a nonlinear cochlear model with feedback regulation (1992) (4)
Network representation of the middle and inner ear in a composite model of the auditory periphery (1992) (4)
Combining Natural Gradient with Hessian Free Methods for Sequence Training (2018) (4)
Tandem Multitask Training of Speaker Diarisation and Speech Recognition for Meeting Transcription (2022) (4)
Word-to-category backoff language models (1996) (3)
Paraphrastic language models and combination with neural network language models (2013) (3)
Estimating the Uncertainty in Emotion Class Labels with Utterance-Specific Dirichlet Priors (2022) (3)
The RT04 evaluation structural metadata systems at CUED (2004) (3)
Cosine-Distance Virtual Adversarial Training for Semi-Supervised Speaker-Discriminative Acoustic Embeddings (2020) (3)
Semi-tied Units for Efficient Gating in LSTM and Highway Networks (2018) (3)
Discriminative optimisation of large vocabulary recognition systems (1996) (3)
WSJCAM 0 Corpus and Recording (2007) (3)
Graphone Model Interpolation and Arabic Pronunciation Generation (2011) (3)
Improving Confidence Estimation on Out-of-Domain Data for End-to-End Speech Recognition (2021) (2)
Content-Aware Speaker Embeddings for Speaker Diarisation (2021) (2)
Improving Speech Transcription for Mandarin-English Translation (2007) (2)
A neural network speech recogniser for directory access applications (1990) (2)
A composite model of the auditory periphery with feedback regulation (1992) (2)
SU Detection for RT-03f at Cambridge University (2003) (2)
Exploiting variable-width features in large vocabulary speech recognition (1993) (2)
Fixed dimension classifiers for speech recognition (1990) (2)
The HTK large vocabulary continuous speech recognition system: an overview (1994) (2)
Minimising Biasing Word Errors for Contextual ASR With the Tree-Constrained Pointer Generator (2022) (2)
Combining Frame-Synchronous and Label-Synchronous Systems for Speech Recognition (2021) (2)
Tree-constrained Pointer Generator with Graph Neural Network Encodings for Contextual Speech Recognition (2022) (1)
Direct sub-word confidence estimation with hidden-state conditional random fields (2014) (1)
Cross-domain paraphrasing for improving language modelling using out-of-domain data (2013) (1)
Recent developments in the HTK continuous speech recognition system (1994) (1)
Recent developments at Cambridge in broadcast news transcription (2004) (1)
Variable Frame Rate Acoustic Models Using Minimum Error Reinforcement Learning (2021) (1)
Corrections to "Automatic Transcription of Conversational Telephone Speech" (2006) (1)
RETRIEVAL FOR TREC-7 AT CAMBRIDGE UNIVERSITY (1999) (1)
RETRIEVAL FOR TREC-8 AT CAMBRIDGE UNIVERSITY-DRAFT (2000) (1)
Cross-Utterance Language Models with Acoustic Error Sampling (2020) (1)
Improving retrieval on imperfect speech transcriptions (1999) (1)
Supplementary data for "Parameterised Sigmoid and ReLU HiddenActivation Functions for DNN Acoustic Modelling" (2015) (1)
Paraphrastic neural network language models (2014) (1)
Speech pattern recognition using pattern recognizers and classifiers (1999) (1)
Biased Self-supervised learning for ASR (2022) (1)
Cluster identification for speaker-environment tracking (2002) (0)
Data underpining "System Combination with Log-linear Models" (2016) (0)
Self-Supervised Representations in Speech-Based Depression Detection (2023) (0)
Proceedings of INTERSPEECH 2013, 14th Annual Conference of the International Speech Communication Association, Lyon, France, August 25-29, 2013 (2013) (0)
Supplementary data for "Speaker Diarisation and Linking in Multi-Genre Broadcast Data" (2015) (0)
Progress in Broadcast News English Transcription (2004) (0)
Data Underpinning "Joint Optimisation of Tandem Systems Using Gaussian Mixture Density Neural Network Discriminative Sequence Training" (2017) (0)
End-to-end Spoken Language Understanding with Tree-constrained Pointer Generator (2022) (0)
RECURRENT NEURAL NETWORKS FOR ACOUSTIC MODELLING (2018) (0)
Data underpinning "I-Vector Estimation Using Informative Priors for Adaptation of Deep Neural Networks" (2015) (0)
Erratum: Language modelling for Russian and English using words and classes [Computer Speech and Language 17 (2003) 87-104] (2003) (0)
Adaptable End-to-End ASR Models using Replaceable Internal LMs and Residual Softmax (2023) (0)
Recognition ********* a dynamic network decoder design for large vocabulary speech recognition (1994) (0)
Multi-level representations in speech processing in brain and machine: Evidence from EMEG and RSA (2016) (0)
Distribution-Based Emotion Recognition in Conversation (2022) (0)
Mandarin Chinese LVCSR System for Speech Translation (2015) (0)
Spectral Clustering-aware Learning of Embeddings for Speaker Diarisation (2022) (0)
Combining hybrid DNN-HMM ASR systems with attention-based models using lattice rescoring (2022) (0)
Self-Supervised Learning-Based Source Separation for Meeting Data (2023) (0)
Formant tracking using continuous density hidden markov models (1986) (0)
Knowledge Distillation from Multiple Foundation Models for End-to-End Speech Recognition (2023) (0)
Application of an auditory model to the computer simulation of hearing impairment: preliminary results (1993) (0)
SLAM 2013 Speech, Language and Audio in Multimedia (2013) (0)
PARAPHRASTIC RECURRENT NEURAL NETWORK LANGUAGE (2015) (0)

This paper list is powered by the following services:

What Schools Are Affiliated With Philip C. Woodland?

Philip C. Woodland is affiliated with the following schools:

University of Cambridge

Philip C. Woodland's Academic­Influence.com Rankings

Why Is Philip C. Woodland Influential?

Philip C. Woodland's Published Works

Published Works

What Schools Are Affiliated With Philip C. Woodland?

Philip C. Woodland's AcademicInfluence.com Rankings