Philip C. Woodland
#125,306
Most Influential Person Now
Philip C. Woodland's AcademicInfluence.com Rankings
Philip C. Woodlandengineering Degrees
Engineering
#4088
World Rank
#5253
Historical Rank
Applied Physics
#921
World Rank
#943
Historical Rank

Download Badge
Engineering
Why Is Philip C. Woodland Influential?
(Suggest an Edit or Addition)Philip C. Woodland's Published Works
Number of citations in a given year to any of this author's works
Total number of citations to an author for the works they published in a given year. This highlights publication of the most important work(s) by the author
Published Works
- Maximum likelihood linear regression for speaker adaptation of continuous density hidden Markov models (1995) (2626)
- The HTK book (1995) (2157)
- The HTK book version 3.4 (2006) (1059)
- Minimum Phone Error and I-smoothing for improved discriminative training (2002) (816)
- Tree-based state tying for high accuracy acoustic modelling (1994) (783)
- Mean and variance adaptation within the MLLR framework (1996) (494)
- Large scale discriminative training of hidden Markov models for speech recognition (2002) (368)
- Large vocabulary continuous speech recognition using HTK (1994) (308)
- Posterior probability decoding, confidence estimation and system combination (2000) (290)
- MMIE training of large vocabulary recognition systems (1997) (212)
- Flexible speaker adaptation using maximum likelihood linear regression (1995) (190)
- Speaker adaptation for continuous density HMMs: a review (2001) (183)
- Large vocabulary decoding and confidence estimation using word posterior probabilities (2000) (174)
- A One Pass Decoder Design For Large Vocabulary Recognition (1994) (159)
- The 1994 HTK large vocabulary speech recognition system (1995) (147)
- Large scale discriminative training for speech recognition (2000) (146)
- The use of state tying in continuous speech recognition (1993) (135)
- A variable-length category-based n-gram language model (1996) (135)
- The MGB challenge: Evaluating multi-genre broadcast media recognition (2015) (127)
- Speaker adaptation of continuous density HMMs using multivariate linear regression (1994) (121)
- Tree-Based State Tying for High Accuracy Modelling (1994) (121)
- A computational model of the auditory periphery for speech and hearing research. II. Descending paths. (1994) (119)
- State clustering in hidden Markov model-based continuous speech recognition (1994) (118)
- Consensus Network Decoding for Statistical Machine Translation System Combination (2007) (113)
- Spoken Document Retrieval for TREC-8 at Cambridge University (1998) (110)
- Progress in the CU-HTK broadcast news transcription system (2006) (107)
- Speaker adaptation of HMMs using linear regression (1994) (106)
- Segment generation and clustering in the HTK broadcast news transcription system (1998) (105)
- A computational model of the auditory periphery for speech and hearing research. I. Ascending path. (1994) (100)
- The use of prosody in a combined system for punctuation generation and speech recognition (2001) (99)
- The 1998 HTK system for transcription of conversational telephone speech (1999) (98)
- Improving broadcast news transcription by lightly supervised discriminative training (2004) (97)
- Efficient lattice rescoring using recurrent neural network language models (2014) (96)
- Recurrent neural network language model adaptation for multi-genre broadcast speech recognition (2015) (93)
- Experiments in speaker normalisation and adaptation for large vocabulary speech recognition (1997) (92)
- The Cambridge University March 2005 speaker diarisation system (2005) (91)
- Investigation of multilingual deep neural networks for spoken term detection (2013) (91)
- Improved neural network based language modelling and adaptation (2010) (89)
- An investigation into vocal tract length normalisation (1999) (88)
- Multilingual representations for low resource speech recognition and keyword search (2015) (87)
- Broadcast news transcription using HTK (1997) (85)
- The Cambridge University spoken document retrieval system (1999) (84)
- Structural metadata research in the EARS program (2005) (84)
- The HTK tied-state continuous speech recogniser (1993) (83)
- CUED-RNNLM — An open-source toolkit for efficient training and evaluation of recurrent neural network language models (2016) (82)
- Effects of out of vocabulary words in spoken document retrieval (poster session) (2000) (81)
- Using accent-specific pronunciation modelling for robust speech recognition (1996) (79)
- Recurrent neural network language model training with noise contrastive estimation for speech recognition (2015) (79)
- Comparison of part-of-speech and automatically derived category-based language models for speech recognition (1998) (79)
- A hidden Markov-model-based trainable speech synthesizer (1999) (78)
- Improving environmental robustness in large vocabulary speech recognition (1996) (77)
- Lattice-based discriminative training for large vocabulary speech recognition (1996) (76)
- Very deep convolutional neural networks for robust speech recognition (2016) (75)
- Adaptation of deep neural network acoustic models using factorised i-vectors (2014) (74)
- Unsupervised training and directed manual transcription for LVCSR (2010) (73)
- Multilingual large vocabulary speech recognition: the European SQALE project (1997) (72)
- The development of the 1996 HTK broadcast news transcription system (1996) (72)
- Training LVCSR systems on thousands of hours of data (2005) (68)
- Speaker adaptation: techniques and challenges (1999) (66)
- Efficient GPU-based training of recurrent neural network language models using spliced sentence bunch (2014) (65)
- System combination and score normalization for spoken term detection (2013) (65)
- The development of the HTK Broadcast News transcription system: An overview (2002) (65)
- Variance compensation within the MLLR framework for robust speech recognition and speaker adaptation (1996) (64)
- MMI-MAP and MPE-MAP for acoustic model adaptation (2003) (63)
- Development of the 2003 CU-HTK conversational telephone speech transcription system (2004) (62)
- Iterative unsupervised adaptation using maximum likelihood linear regression (1996) (61)
- DNN speaker adaptation using parameterised sigmoid and ReLU hidden activation functions (2016) (60)
- Combined Bayesian and predictive techniques for rapid speaker adaptation of continuous density hidden Markov models (1997) (60)
- Improvements in linear transform based speaker adaptation (2001) (60)
- The development of the 1994 HTK large vocabulary speech recognition system (1995) (59)
- WSJCAM0 corpus and recording description (1994) (59)
- Discriminative map for acoustic model adaptation (2003) (58)
- Efficient Training and Evaluation of Recurrent Neural Network Language Models for Automatic Speech Recognition (2016) (56)
- Frame discrimination training for HMMs for large vocabulary speech recognition (1999) (56)
- A high-performance Cantonese keyword search system (2013) (56)
- Parameterised sigmoid and reLU hidden activation functions for DNN acoustic modelling (2015) (54)
- Combining Information Sources for Confidence Estimation with CRF Models (2011) (54)
- Speaker clustering using direct maximisation of the MLLR-adapted likelihood (1998) (54)
- The 1997 HTK broadcast news transcription system (1998) (54)
- Joint decoding of tandem and hybrid systems for improved keyword spotting on low resource languages (2015) (51)
- Using VTLN for broadcast news transcription (2004) (50)
- Two Efficient Lattice Rescoring Methods Using Recurrent Neural Network Language Models (2016) (50)
- Improvements in an HMM-based speech synthesiser (1995) (50)
- THE CU-HTK MARCH 2000 HUB5E TRANSCRIPTION SYSTEM (2000) (49)
- Unsupervised Training for Mandarin Broadcast News and Conversation Transcription (2007) (49)
- Using accent-specific pronunciation modelling for improved large vocabulary continuous speech recognition (1997) (49)
- Hidden Markov models using vector linear prediction and discriminative output distributions (1992) (48)
- Discriminative adaptive training using the MPE criterion (2003) (48)
- MPE-based discriminative linear transform for speaker adaptation (2004) (48)
- Variance compensation within the MLLR framework (1996) (46)
- Flexible speaker adaptation for large vocabulary speech recognition (1995) (44)
- A rule-based named entity recognition system for speech input (2000) (44)
- Improved discriminative training techniques for large vocabulary continuous speech recognition (2001) (43)
- Use of contexts in language model interpolation and adaptation (2009) (43)
- Transcription of multi-genre media archives using out-of-domain data (2012) (43)
- Training and adapting MLP features for Arabic speech recognition (2009) (42)
- The Cu-Htk Mandarin Broadcast News Transcription System (2006) (42)
- A combined punctuation generation and speech recognition system and its performance enhancement using prosody (2003) (41)
- Large scale MMIE training for conversational telephone speech recognition (2000) (41)
- The use of accent-specific pronunciation dictionaries in acoustic model training (1998) (40)
- A method for direct audio search with applications to indexing and retrieval (2000) (38)
- Particle-based language modelling (2000) (38)
- Cambridge university transcription systems for the multi-genre broadcast challenge (2015) (37)
- Automatic speech synthesiser parameter estimation using HMMs (1995) (36)
- Unsupervised language model adaptation for Mandarin broadcast conversation transcription (2006) (35)
- Speaker adaptation using lattice-based MLLR (2001) (35)
- Standalone training of context-dependent deep neural network acoustic models (2014) (35)
- A PLSA-based language model for conversational telephone speech (2004) (34)
- Variable-length categoryn-gram language models (1999) (34)
- Discriminative Neural Clustering for Speaker Diarisation (2019) (33)
- Combination of word-based and category-based language models (1996) (33)
- Language modelling for Russian and English using words and classes (2003) (32)
- The 1998 HTK broadcast news transcription system: development and results (1999) (32)
- A general artificial neural network extension for HTK (2015) (32)
- Recent advances in broadcast news transcription (2003) (32)
- Experiments in broadcast news transcription (1998) (31)
- Automatic Transcription of Multi-genre Media Archives (2013) (31)
- The efficient incorporation of MLP features into automatic speech recognition systems (2011) (30)
- Improving the training and evaluation efficiency of recurrent neural network language models (2015) (30)
- The HTK large vocabulary recognition system for the 1995 ARPA H3 task (1996) (30)
- Syllable language models for Mandarin speech recognition: exploiting character language models. (2013) (29)
- The development of the Cambridge University RT-04 diarisation system (2004) (29)
- Automatic complexity control for HLDA systems (2003) (29)
- Development of a phonetic system for large vocabulary Arabic speech recognition (2007) (29)
- Segmentation and classification of broadcast news audio (1998) (28)
- Comparison of language modelling techniques for Russian and English (1998) (28)
- Spontaneous speech recognition for the credit card corpus using the HTK toolkit (1994) (27)
- Dynamic HMM selection for continuous speech recognition (1999) (27)
- General query expansion techniques for spoken document retrieval (1999) (27)
- Discriminative linear transforms for speaker adaptation (2001) (27)
- Speaker Diarisation Using 2D Self-attentive Combination of Embeddings (2019) (27)
- Language model cross adaptation for LVCSR system combination (2013) (27)
- Development of the CUHTK 2004 Mandarin conversational telephone speech transcription system (2005) (26)
- Optimising hidden Markov models using discriminative output distributions (1991) (26)
- Handbook of Natural Language Processing and Machine Translation (2012) (26)
- Generating and evaluating segmentations for automatic speech recognition of conversational telephone speech (2004) (26)
- MPE-based discriminative linear transforms for speaker adaptation (2008) (26)
- Tree-based state clustering for large vocabulary speech recognition (1994) (25)
- Automatic transcription of conversational telephone speech (2005) (25)
- Confidence Estimation for Attention-Based Sequence-to-Sequence Models for Speech Recognition (2020) (24)
- Design of fast LVCSR systems (2003) (23)
- Efficient class-based language modelling for very large vocabularies (2001) (23)
- Development of the CU-HTK 2004 broadcast news transcription systems (2005) (23)
- Context dependent language model adaptation (2008) (23)
- New features in the CU-HTK system for transcription of conversational telephone speech (2001) (22)
- Morphological decomposition in Arabic ASR systems (2012) (22)
- Spoken document representations for probabilistic retrieval (2000) (22)
- Emotion Recognition by Fusing Time Synchronous and Time Asynchronous Representations (2020) (22)
- Speech Recognition System Combination for Machine Translation (2007) (22)
- Unsupervised Adaptation With Discriminative Mapping Transforms (2009) (21)
- Improved Tdnns Using Deep Kernels and Frequency Dependent Grid-RNNS (2018) (21)
- Morphological analysis and decomposition for Arabic speech-to-text systems (2009) (20)
- Rapid speaker adaptation using model prediction (1995) (20)
- An investigation into the the interactions between speaker diarisation systems and automatic speech transcription (2003) (20)
- Audio Indexing and Retrieval of Complete Broadcoast News Shows (2000) (20)
- Improving lightly supervised training for broadcast transcription (2013) (20)
- Selection of Multi-Genre Broadcast Data for the Training of Automatic Speech Recognition Systems (2016) (18)
- Language model combination and adaptation usingweighted finite state transducers (2010) (18)
- Automatic capitalisation generation for speech input (2004) (18)
- Speaker diarisation and longitudinal linking in multi-genre broadcast data (2015) (17)
- The development of the cambridge university alignment systems for the multi-genre broadcast challenge (2015) (17)
- Large vocabulary multilingual speech recognition using HTK (1995) (17)
- Improved DNN-based segmentation for multi-genre broadcast audio (2016) (17)
- Word Boundary Modelling and Full Covariance Gaussians for Arabic Speech-to-Text Systems (2011) (16)
- The CUHTK-entropic 10xRT broadcast news transcription system (1999) (16)
- Recent improvements to the Cambridge Arabic Speech-to-Text systems (2010) (16)
- Detecting deletions in ASR output (2014) (16)
- Weight limiting, weight quantisation and generalisation in multi-layer perceptrons (1989) (16)
- Modelling word-pair relations in a category-based language model (1997) (16)
- Multi-Span Acoustic Modelling using Raw Waveform Signals (2019) (15)
- The Cambridge University Multimedia Document Retrieval Demo System (2000) (15)
- Benchmark DARPA RM results using the HTK portable HMM toolkit (1992) (15)
- High Order Recurrent Neural Networks for Acoustic Modelling (2018) (15)
- The Cambridge University 2014 BOLT conversational telephone Mandarin Chinese LVCSR system for speech translation (2015) (14)
- Cambridge University Engineering Department (1994) (14)
- Discriminatively Trained Gaussian Mixture Models for Sentence Boundary Detection (2006) (14)
- Joint optimisation of tandem systems using Gaussian mixture density neural network discriminative sequence training (2017) (14)
- Phonetic pronunciations for arabic speech-to-text systems (2008) (14)
- A confidence-based approach for improving keyword hypothesis scores (2013) (14)
- Improved cross-task recognition using MMIE training (2002) (13)
- Integrating Source-Channel and Attention-Based Sequence-to-Sequence Models for Speech Recognition (2019) (13)
- Using relative duration in large vocabulary speech recognition (1993) (13)
- Porting: SwitchBoard to the VoiceMail task (2003) (13)
- Investigation of acoustic units for LVCSR systems (2011) (12)
- Improving LVCSR System Combination Using Neural Network Language Model Cross Adaptation (2011) (12)
- CU-HTK April 2002 Switchboard System (2002) (12)
- Paraphrastic language models (2014) (12)
- Unsupervised training with directed manual transcription for recognising Mandarin broadcast audio (2007) (12)
- Unsupervised discriminative adaptation using discriminative mapping transforms (2008) (12)
- I-vector estimation using informative priors for adaptation of deep neural networks (2015) (11)
- Information Retrieval from Unsegmented Broadcast News Audio (2001) (11)
- Isolated word speech recognition based on connectionist techniques (1990) (11)
- Automatic transcription of conversational telephone speech: development of the CU-HTK 2002 system (2003) (11)
- Modelling syllable characteristics to improve a large vocabulary continuous speech recogniser (1994) (10)
- Paraphrastic recurrent neural network language models (2015) (10)
- Combination of Deep Speaker Embeddings for Diarisation (2020) (10)
- PyHTK: Python Library and ASR Pipelines for HTK (2019) (10)
- Improving Lightly Supervised Training for Broadcast Transcriptions (2013) (10)
- Investigation of back-off based interpolation between recurrent neural network and n-gram language models (2015) (10)
- Exploiting Chinese character models to improve speech recognition performance (2009) (10)
- Development of the 2004 CU-HTK English CTS systems using more than two thousand hours of data (2004) (10)
- Improvements in accuracy and speed in the HTK broadcast news transcription system (1999) (10)
- Knowledge Distillation for Neural Transducers from Large Self-Supervised Pre-Trained Models (2021) (9)
- Tree-Constrained Pointer Generator for End-to-End Contextual Speech Recognition (2021) (9)
- Relating dynamic brain states to dynamic machine states: Human and machine solutions to the speech recognition problem (2017) (8)
- An experimental comparison of connectionist and conventional classification systems on natural data (1990) (8)
- Efficient generation and use of MLP features for Arabic speech recognition (2009) (8)
- Residual Energy-Based Models for End-to-End Speech Recognition (2021) (8)
- Modelling sub-phone insertions and deletions in continuous speech recognition (2000) (8)
- System combination with log-linear models (2016) (7)
- DEVELOPING KEYWORD SEARCH UNDER THE IARPA BABEL PROGRAM (2013) (7)
- Improving retrieval on imperfect speech transcriptions (poster abstract) (1999) (7)
- I-Vectors and Structured Neural Networks for Rapid Adaptation of Acoustic Models (2017) (6)
- Recent Progress in Large Vocabulary Continuous Speech Recognition: An HTK Perspective (2006) (6)
- Sequence training of DNN acoustic models with natural gradient (2017) (6)
- DEVELOPMENT OF THE CUHTK 2004 RT 04 F MANDARIN CONVERSATIONAL TELEPHONE SPEECH TRANSCRIPTION SYSTEM (2004) (6)
- Adapting GPT, GPT-2 and BERT Language Models for Speech Recognition (2021) (5)
- Discriminative language model adaptation for Mandarin broadcast speech transcription and translation (2007) (5)
- The Cambridge Multimedia Document Retrieval Project: summary of experiments (2001) (5)
- RETRIEVAL FOR TREC-9 AT CAMBRIDGE UNIVERSITY (2001) (5)
- Comparative evaluation of word- and category-based language models (1996) (5)
- Word-pair relations for category-based language models (1997) (5)
- Maximum mutual information training of hidden Markov models with vector linear predictors (2002) (5)
- Speaker Adaptation and Adaptive Training for Jointly Optimised Tandem Systems (2018) (5)
- Complementary Phone Error Training (2012) (5)
- Improved Large-Margin Softmax Loss for Speaker Diarisation (2019) (5)
- A dynamic network decoder design for large vocabulary speech recognition (1994) (5)
- Using Sub-word-level Information for Confidence Estimation with Conditional Random Field Models (2012) (4)
- HTK V1.5: User, Reference and Programmer Manuals (1993) (4)
- Variable-length category-based n-grams for language modelling (1995) (4)
- A wave digital filter model of the entire auditory periphery (1993) (4)
- Hidden Markov models using shared vector linear predictors (1993) (4)
- A Distributed Optimisation Framework Combining Natural Gradient with Hessian-Free for Discriminative Sequence Training (2021) (4)
- Implementation of automatic capitalisation generation systems for speech input (2002) (4)
- The Cambridge University Multimedia Document Retrieval demo system (demonstration session) (2000) (4)
- Spoken language systems technology workshop (1995) (4)
- Spoken Alphabet Recognition Using Multilayer Perceptrons (1992) (4)
- Speech analysis using a nonlinear cochlear model with feedback regulation (1992) (4)
- Network representation of the middle and inner ear in a composite model of the auditory periphery (1992) (4)
- Combining Natural Gradient with Hessian Free Methods for Sequence Training (2018) (4)
- Tandem Multitask Training of Speaker Diarisation and Speech Recognition for Meeting Transcription (2022) (4)
- Word-to-category backoff language models (1996) (3)
- Paraphrastic language models and combination with neural network language models (2013) (3)
- Estimating the Uncertainty in Emotion Class Labels with Utterance-Specific Dirichlet Priors (2022) (3)
- The RT04 evaluation structural metadata systems at CUED (2004) (3)
- Cosine-Distance Virtual Adversarial Training for Semi-Supervised Speaker-Discriminative Acoustic Embeddings (2020) (3)
- Semi-tied Units for Efficient Gating in LSTM and Highway Networks (2018) (3)
- Discriminative optimisation of large vocabulary recognition systems (1996) (3)
- WSJCAM 0 Corpus and Recording (2007) (3)
- Graphone Model Interpolation and Arabic Pronunciation Generation (2011) (3)
- Improving Confidence Estimation on Out-of-Domain Data for End-to-End Speech Recognition (2021) (2)
- Content-Aware Speaker Embeddings for Speaker Diarisation (2021) (2)
- Improving Speech Transcription for Mandarin-English Translation (2007) (2)
- A neural network speech recogniser for directory access applications (1990) (2)
- A composite model of the auditory periphery with feedback regulation (1992) (2)
- SU Detection for RT-03f at Cambridge University (2003) (2)
- Exploiting variable-width features in large vocabulary speech recognition (1993) (2)
- Fixed dimension classifiers for speech recognition (1990) (2)
- The HTK large vocabulary continuous speech recognition system: an overview (1994) (2)
- Minimising Biasing Word Errors for Contextual ASR With the Tree-Constrained Pointer Generator (2022) (2)
- Combining Frame-Synchronous and Label-Synchronous Systems for Speech Recognition (2021) (2)
- Tree-constrained Pointer Generator with Graph Neural Network Encodings for Contextual Speech Recognition (2022) (1)
- Direct sub-word confidence estimation with hidden-state conditional random fields (2014) (1)
- Cross-domain paraphrasing for improving language modelling using out-of-domain data (2013) (1)
- Recent developments in the HTK continuous speech recognition system (1994) (1)
- Recent developments at Cambridge in broadcast news transcription (2004) (1)
- Variable Frame Rate Acoustic Models Using Minimum Error Reinforcement Learning (2021) (1)
- Corrections to "Automatic Transcription of Conversational Telephone Speech" (2006) (1)
- RETRIEVAL FOR TREC-7 AT CAMBRIDGE UNIVERSITY (1999) (1)
- RETRIEVAL FOR TREC-8 AT CAMBRIDGE UNIVERSITY-DRAFT (2000) (1)
- Cross-Utterance Language Models with Acoustic Error Sampling (2020) (1)
- Improving retrieval on imperfect speech transcriptions (1999) (1)
- Supplementary data for "Parameterised Sigmoid and ReLU HiddenActivation Functions for DNN Acoustic Modelling" (2015) (1)
- Paraphrastic neural network language models (2014) (1)
- Speech pattern recognition using pattern recognizers and classifiers (1999) (1)
- Biased Self-supervised learning for ASR (2022) (1)
- Cluster identification for speaker-environment tracking (2002) (0)
- Data underpining "System Combination with Log-linear Models" (2016) (0)
- Self-Supervised Representations in Speech-Based Depression Detection (2023) (0)
- Proceedings of INTERSPEECH 2013, 14th Annual Conference of the International Speech Communication Association, Lyon, France, August 25-29, 2013 (2013) (0)
- Supplementary data for "Speaker Diarisation and Linking in Multi-Genre Broadcast Data" (2015) (0)
- Progress in Broadcast News English Transcription (2004) (0)
- Data Underpinning "Joint Optimisation of Tandem Systems Using Gaussian Mixture Density Neural Network Discriminative Sequence Training" (2017) (0)
- End-to-end Spoken Language Understanding with Tree-constrained Pointer Generator (2022) (0)
- RECURRENT NEURAL NETWORKS FOR ACOUSTIC MODELLING (2018) (0)
- Data underpinning "I-Vector Estimation Using Informative Priors for Adaptation of Deep Neural Networks" (2015) (0)
- Erratum: Language modelling for Russian and English using words and classes [Computer Speech and Language 17 (2003) 87-104] (2003) (0)
- Adaptable End-to-End ASR Models using Replaceable Internal LMs and Residual Softmax (2023) (0)
- Recognition ********* a dynamic network decoder design for large vocabulary speech recognition (1994) (0)
- Multi-level representations in speech processing in brain and machine: Evidence from EMEG and RSA (2016) (0)
- Distribution-Based Emotion Recognition in Conversation (2022) (0)
- Mandarin Chinese LVCSR System for Speech Translation (2015) (0)
- Spectral Clustering-aware Learning of Embeddings for Speaker Diarisation (2022) (0)
- Combining hybrid DNN-HMM ASR systems with attention-based models using lattice rescoring (2022) (0)
- Self-Supervised Learning-Based Source Separation for Meeting Data (2023) (0)
- Formant tracking using continuous density hidden markov models (1986) (0)
- Knowledge Distillation from Multiple Foundation Models for End-to-End Speech Recognition (2023) (0)
- Application of an auditory model to the computer simulation of hearing impairment: preliminary results (1993) (0)
- SLAM 2013 Speech, Language and Audio in Multimedia (2013) (0)
- PARAPHRASTIC RECURRENT NEURAL NETWORK LANGUAGE (2015) (0)
This paper list is powered by the following services:
What Schools Are Affiliated With Philip C. Woodland?
Philip C. Woodland is affiliated with the following schools: