Stephen Renals

Q: What Schools Are Affiliated With Stephen Renals

Stephen Renals is affiliated with the following schools: University of Sheffield, University of Edinburgh, University of Cambridge

Stephen Renals's AcademicInfluence.com Rankings

Stephen Renals

Computer Science

#6819

World Rank

#7184

Historical Rank

Computational Linguistics

#1260

World Rank

#1274

Historical Rank

Artificial Intelligence

#2613

World Rank

#2654

Historical Rank

Database

#3900

World Rank

#4057

Historical Rank

computer-science Degrees

Download Badge

Computer Science

Why Is Stephen Renals Influential?

(Suggest an Edit or Addition)

According to Wikipedia, Stephen Renals from the University of Edinburgh, UK was named Fellow of the Institute of Electrical and Electronics Engineers in 2014 for contributions to speech recognition technology and its use in spoken language processing.

(See a Problem?)

Stephen Renals's Published Works

Number of citations in a given year to any of this author's works

Total number of citations to an author for the works they published in a given year. This highlights publication of the most important work(s) by the author

Published Works

Proceedings of the ACL Workshop on Intrinsic and Extrinsic Evaluation Measures for Machine Translation and/or Summarization (2005) (474)
WSJCAMO: a British English speech corpus for large vocabulary continuous speech recognition (1995) (318)
Connectionist probability estimators in HMM speech recognition (1994) (301)
Extractive summarization of meeting recordings (2005) (270)
Learning hidden unit contributions for unsupervised speaker adaptation of neural network acoustic models (2014) (242)
Convolutional Neural Networks for Distant Speech Recognition (2014) (227)
Speaker verification using sequence discriminant support vector machines (2005) (223)
Speaker-adaptation for hybrid HMM-ANN continuous speech recognition system (1995) (219)
Multilingual training of deep neural networks (2013) (211)
Robust Speaker-Adaptive HMM-Based Text-to-Speech Synthesis (2009) (205)
The AMI System for the Transcription of Speech in Meetings (2005) (184)
Multiplicative LSTM for sequence modelling (2016) (174)
Machine Learning for Multimodal Interaction , 4th International Workshop, MLMI 2007, Brno, Czech Republic, June 28-30, 2007, Revised Selected Papers (2008) (157)
Phoneme classification experiments using radial basis functions (1989) (148)
Unsupervised cross-lingual knowledge transfer in DNN-based LVCSR (2012) (141)
Recognition and understanding of meetings the AMI and AMIDA projects (2007) (141)
Handbook of Phonetic Sciences (2010) (138)
Punctuation annotation using statistical prosody models. (2001) (136)
THE USE OF RECURRENT NEURAL NETWORKS IN CONTINUOUS SPEECH RECOGNITION (1996) (131)
The MGB challenge: Evaluating multi-genre broadcast media recognition (2015) (127)
A study of speaker adaptation for DNN-based speech synthesis (2015) (126)
Speech and crosstalk detection in multichannel audio (2005) (125)
Sentence Boundary Detection in Broadcast Speech Transcripts (2000) (124)
Proceedings of the Ninth Text REtrieval Conference (2001) (121)
Human-computer dialogue simulation using hidden Markov models (2005) (121)
On training the recurrent neural network encoder-decoder for large vocabulary end-to-end speech recognition (2016) (119)
Hybrid acoustic models for distant and multichannel large vocabulary speech recognition (2013) (115)
Automatic Dialect Detection in Arabic Broadcast Speech (2015) (113)
Dynamic Evaluation of Neural Sequence Models (2017) (113)
Learning Hidden Unit Contributions for Unsupervised Acoustic Model Adaptation (2016) (109)
Incorporating Speaker and Discourse Features into Speech Summarization (2006) (106)
Practical Identifiability of Finite Mixtures of Multivariate Bernoulli Distributions (2000) (106)
8th Annual Conference of the International Speech Communication Association (2007) (101)
Speech Recognition Using Augmented Conditional Random Fields (2009) (98)
Acoustic-Articulatory Modeling With the Trajectory HMM (2008) (94)
Speech synthesis technologies for individuals with vocal disabilities: Voice banking and reconstruction (2012) (90)
Open Challenges in Modelling, Analysis and Synthesis of Human Behaviour in Human–Human and Human–Machine Interactions (2015) (88)
CDNN: a context dependent neural network for continuous speech recognition (1992) (86)
Segmental Recurrent Neural Networks for End-to-End Speech Recognition (2016) (84)
Speech recognition challenge in the wild: Arabic MGB-3 (2017) (81)
A study of the recurrent neural network encoder-decoder for large vocabulary speech recognition (2015) (81)
Automatic summarization of voicemail messages using lexical and prosodic features (2005) (78)
Indexing and retrieval of broadcast news (2000) (76)
The 2005 AMI System for the Transcription of Speech in Meetings (2005) (76)
Towards an improved modeling of the glottal source in statistical parametric speech synthesis (2007) (76)
The MGB-2 challenge: Arabic multi-dialect broadcast media recognition (2016) (75)
Evaluation of kernel methods for speaker verification and identification (2002) (75)
Dynamic Bayesian networks for meeting structuring (2004) (75)
Evaluation of a hierarchical reinforcement learning spoken dialogue system (2010) (72)
Confidence measures for hybrid HMM/ANN speech recognition (1997) (72)
Connectionist probability estimation in the DECIPHER speech recognition system (1992) (71)
SVMSVM: support vector machine speaker verification methodology (2003) (70)
Evaluating Automatic Summaries of Meeting Recordings (2005) (70)
Connectionist speech recognition of Broadcast News (2002) (70)
Ageing Voices: The Effect of Changes in Voice Parameters on ASR Performance (2010) (69)
Longitudinal study of ASR performance on ageing voices (2008) (69)
Estimation of global posteriors and forward-backward training of hybrid HMM/ANN systems (1997) (68)
Glottal spectral separation for parametric speech synthesis (2008) (64)
Deep Architectures for Articulatory Inversion (2012) (64)
Retrieval of broadcast news documents with the THISL system (1998) (64)
A study of network dynamics (1990) (62)
Automatic Segmentation of Multiparty Dialogue (2006) (61)
Automatic Meeting Segmentation Using Dynamic Bayesian Networks (2007) (61)
Confidence measures from local posterior probability estimates (1999) (55)
ASR system modeling for automatic evaluation and optimization of dialogue systems (2002) (54)
Radial basis function network for speech pattern classification (1989) (54)
Knowledge distillation for small-footprint highway networks (2016) (52)
Transcription of conference room meetings: an investigation (2005) (51)
Efficient search using posterior phone probability estimates (1995) (51)
Revisiting hybrid and GMM-HMM system combination techniques (2013) (51)
HMM-based speech synthesiser using the LF-model of the glottal source (2011) (50)
A Deep Neural Network for Acoustic-Articulatory Speech Inversion (2011) (49)
Accessing the spoken word (2005) (49)
Improving Children's Speech Recognition Through Out-of-Domain Data Augmentation (2016) (49)
IPA: improved phone modelling with recurrent neural networks (1994) (48)
Evaluating speech synthesis intelligibility using Amazon Mechanical Turk (2010) (48)
From Text Summarisation to Style-Specific Summarisation for Broadcast News (2004) (47)
Multi-level adaptive networks in tandem and hybrid ASR systems (2013) (46)
Neural networks for distant speech recognition (2014) (46)
Regularization of context-dependent deep neural networks with context-independent multi-task training (2015) (46)
Sequence-to-sequence models for punctuated transcription combining lexical and acoustic features (2017) (46)
Extrinsic summarization evaluation: A decision audit task (2008) (46)
Audio information access from meeting rooms (2003) (45)
Acoustic data-driven pronunciation lexicon for large vocabulary speech recognition (2013) (44)
Term-Weighting for Summarization of Multi-party Spoken Dialogues (2007) (44)
Speech Input from Older Users in Smart Environments: Challenges and Perspectives (2009) (43)
Start-synchronous search for large vocabulary continuous speech recognition (1999) (43)
Transcription of multi-genre media archives using out-of-domain data (2012) (43)
Combining Spectral Representations for Large-Vocabulary Continuous Speech Recognition (2008) (43)
The THISL broadcast news retrieval system. (1999) (41)
A neural network based, speaker independent, large vocabulary, continuous speech recognition system: the WERNICKE project (1993) (41)
Interpretation of Multiparty Meetings the AMI and Amida Projects (2008) (40)
Hierarchical Bayesian Language Models for Conversational Speech Recognition (2010) (39)
Chaos in Neural Networks (1990) (39)
Modelling acoustic feature dependencies with artificial neural networks: Trajectory-RNADE (2015) (39)
The 1994 Abbot hybrid connectionist-HMM large vocabulary recognition system. (1995) (38)
A digital microphone array for distant speech recognition (2010) (38)
Multimodal Integration for Meeting Group Action Segmentation and Recognition (2005) (38)
Predicting tongue shapes from a few landmark locations (2008) (37)
Content-based access to spoken audio (2005) (37)
REINFORCEMENT LEARNING OF DIALOGUE STRATEGIES WITH HIERARCHICAL ABSTRACT MACHINES (2006) (37)
Document space models using latent semantic analysis (1997) (36)
Dynamic Evaluation of Transformer Language Models (2019) (36)
Hierarchical Pitman-Yor language models for ASR in meetings (2007) (36)
Word Error Rate Estimation for Speech Recognition: e-WER (2018) (36)
Recognition of Dialogue Acts in Multiparty Meetings Using a Switching DBN (2008) (36)
Regularized subspace Gaussian mixture models for cross-lingual speech recognition (2011) (35)
Are extractive text summarisation techniques portable to broadcast news? (2003) (35)
Differentiable pooling for unsupervised speaker adaptation (2015) (34)
Glottal Spectral Separation for Speech Synthesis (2014) (34)
An Overview of the SPRACH System for the Transcription of Broadcast News (1999) (33)
Transcription and summarization of voicemail speech (2000) (32)
Recent improvements to the ABBOT large vocabulary CSR system (1995) (32)
Punctuated transcription of multi-genre broadcasts using acoustic and lexical approaches (2016) (32)
The THISL SDR System At TREC-8 (1999) (31)
Automatic Transcription of Multi-genre Media Archives (2013) (31)
Unsupervised Adaptation of Recurrent Neural Network Language Models (2016) (30)
European Language Grid: An Overview (2020) (30)
The MGB-5 Challenge: Recognition and Dialect Identification of Dialectal Arabic Speech (2019) (30)
Adaptation Algorithms for Neural Network-Based Speech Recognition: An Overview (2020) (29)
Multitask Learning of Context-Dependent Targets in Deep Neural Network Acoustic Models (2017) (28)
Maximum entropy segmentation of broadcast news (2005) (28)
Structured output layer with auxiliary targets for context-dependent acoustic modelling (2015) (28)
Neural nets and hidden Markov models: Review and generalizations (1991) (28)
On Learning Interpretable CNNs with Parametric Modulated Kernel-Based Filters (2019) (28)
Recognition and interpretation of meetings: The AMI and AMIDA projects (2007) (27)
Large vocabulary continuous speech recognition using a hybrid connectionist-HMM system (1994) (27)
Multimodal Signal Processing (2012) (27)
Efficient evaluation of the LVCSR search space using the NOWAY decoder (1996) (26)
Dimensionality reduction of electropalatographic data using latent variable models (1998) (26)
Cross-Lingual Subspace Gaussian Mixture Models for Low-Resource Speech Recognition (2014) (26)
End-to-End Neural Segmental Models for Speech Recognition (2017) (25)
DECODER TECHNOLOGY FOR CONNECTIONIST LARGE VOCABULARY SPEECH RECOGNITION (1995) (25)
INTERSPEECH 2010 11th Annual Conference of the International Speech Communication Association (2010) (25)
Information extraction from broadcast news (2000) (25)
Regularized Subspace Gaussian Mixture Models for Speech Recognition (2011) (24)
Proceedings of the 9th European Conference on Speech Communication and Technology (2003) (24)
Hybrid Neural Network/Hidden Markov Model Systems for Continuous Speech Recognition (1993) (24)
The Role of Prosody in a Voicemail Summarization System (2001) (23)
A lecture transcription system combining neural network acoustic and language models (2013) (23)
Small-Footprint Deep Neural Networks with Highway Connections for Speech Recognition (2015) (23)
DBN Based Joint Dialogue Act Recognition of Multiparty Meetings (2007) (23)
Maximum a posteriori adaptation of subspace Gaussian mixture models for cross-lingual speech recognition (2012) (23)
Detecting summarization hot spots in meetings using group level involvement and turn-taking features (2013) (22)
The THISL Spoken Document Retrieval System (1998) (22)
SAT-LHUC: Speaker adaptive training for learning hidden unit contributions (2016) (22)
The UEDIN systems for the IWSLT 2012 evaluation (2012) (21)
Probabilistic Linear Discriminant Analysis for Acoustic Modeling (2014) (20)
Confidence Measures for Evaluating Pronunciation Models (1998) (20)
Proceedings of the 6th International Conference on Spoken Language Processing (2000) (20)
Learning multi-goal dialogue strategies using reinforcement learning with reduced state-action spaces (2006) (20)
Prosodic Correlates of Rhetorical Relations (2006) (20)
Detecting Action Items in Meetings (2008) (19)
Probability estimation by feed-forward networks in continuous speech recognition (1991) (19)
Named entity tagged language models (1999) (19)
Hierarchical dialogue optimization using semi-Markov decision processes (2007) (18)
Neural net word representations for phrase-break prediction without a part of speech tagger (2014) (18)
Applying vocal tract length normalization to meeting recordings (2005) (18)
Recognition, indexing and retrieval of british broadcast news with the THISL system (1999) (18)
Topic-based mixture language modelling (1999) (18)
Tal: A Synchronised Multi-Speaker Corpus of Ultrasound Tongue Imaging, Audio, and Lip Videos (2020) (17)
Spoken dialogue interfaces for older people (2012) (17)
Extractive summarization of voicemail using lexical and prosodic feature subset selection (2001) (16)
Modelling Participant Affect in Meetings with Turn-Taking Features (2013) (16)
Using Prosodic Features in Language Models for Meetings (2007) (16)
Small-Footprint Highway Deep Neural Networks for Speech Recognition (2016) (16)
Learning phoneme recognition using neural networks (1989) (16)
Joint Uncertainty Decoding for Noise Robust Subspace Gaussian Mixture Models (2013) (16)
Hierarchical Recurrent Neural Network for Story Segmentation (2017) (16)
Age recognition for spoken dialogue systems: do we need it? (2009) (15)
Multistream Dynamic Bayesian Network for Meeting Segmentation (2004) (15)
Proceedings of the Human Language Technology Conference of the NAACL, Main Conference (2006) (15)
Untranscribed Web Audio for Low Resource Speech Recognition (2019) (15)
Speaker Adaptive Training Using Model Agnostic Meta-Learning (2019) (15)
Recognition of overlapping speech using digital MEMS microphone arrays (2013) (15)
On The Usefulness of Self-Attention for Automatic Speech Recognition with Transformers (2020) (14)
Embeddings for DNN Speaker Adaptive Training (2019) (14)
Channel Adversarial Training for Speaker Verification and Diarization (2019) (14)
Determining the number of speakers in a meeting using microphone array features (2012) (14)
Unsupervised language model adaptation based on topic and role information in multiparty meetings (2008) (14)
Multi-reference WER for evaluating ASR for languages with no orthographic rules (2015) (14)
Automatic Segmentation and Summarization of Meeting Speech (2007) (14)
Phone deactivation pruning in large vocabulary continuous speech recognition (1996) (13)
The ambient spotlight: queryless desktop search from meeting speech (2010) (13)
The UEDIN English ASR system for the IWSLT 2013 evaluation (2013) (13)
Meta Comments for Summarizing Meeting Speech (2008) (13)
Lightly supervised automatic subtitling of weather forecasts (2013) (13)
Windowed Attention Mechanisms for Speech Recognition (2019) (13)
Confidence measures derived from an acceptor HMM (1998) (13)
Feature selection for the classification of crosstalk in multi-channel audio (2003) (12)
A system for automatic alignment of broadcast media captions using weighted finite-state transducers (2015) (12)
Speaker-independent Classification of Phonetic Segments from Raw Ultrasound in Child Speech (2019) (12)
BASELINE IE-NE EXPERIMENTS USING THE SPRACH/LASIE SYSTEM (1999) (12)
A Cascaded Broadcast News Highlighter (2008) (12)
Learning Noise Invariant Features Through Transfer Learning For Robust End-to-End Speech Recognition (2020) (12)
Adaptation Algorithms for Speech Recognition: An Overview (2020) (12)
Recognition and understanding of meetings (2010) (12)
The UEDIN ASR systems for the IWSLT 2014 evaluation (2014) (12)
Cross-lingual adaptation with multi-task adaptive networks (2014) (11)
A hybrid Maxent/HMM based ASR system (2005) (11)
Complementary tasks for context-dependent deep neural network acoustic models (2015) (11)
On the Robustness and Training Dynamics of Raw Waveform Models (2020) (11)
Cross Lingual Transfer Learning for Zero-Resource Domain Adaptation (2019) (11)
Speech and neural network dynamics (1990) (11)
Multi-Scale Octave Convolutions for Robust Speech Recognition (2019) (11)
The AMI Meeting Transcription System (2007) (11)
Ultrax: An Animated Midsagittal Vocal Tract Display for Speech Therapy (2012) (11)
Improved average-voice-based speech synthesis using gender-mixed modeling and a parameter generation algorithm considering GV (2007) (11)
Simplifying very deep convolutional neural network architectures for robust speech recognition (2017) (11)
A comparative study of continuous speech recognition using neural networks and hidden Markov models (1991) (11)
Processing and Linking Audio Events in Large Multimedia Archives: The EU inEvent Project (2013) (10)
Recording speech articulation in dialogue: Evaluating a synchronized double electromagnetic articulography setup (2013) (10)
Audio-Visual Processing in Meetings: Seven Questions and Current AMI Answers (2006) (10)
Text- and Speech-Triggered Information Access (2003) (10)
Prosodically-enhanced recurrent neural network language models (2015) (10)
Analyzing Deep CNN-Based Utterance Embeddings for Acoustic Model Adaptation (2018) (10)
Automated production of true-cased punctuated subtitles for weather and news broadcasts (2014) (10)
Acoustic Model Adaptation from Raw Waveforms with Sincnet (2019) (10)
Power law discounting for n-gram language models (2010) (10)
Exploring the style-technique interaction in extractive summarization of broadcast news (2003) (10)
The 1995 ABBOT LVCSR system for multiple unknown microphones (1996) (10)
Towards online speech summarization (2007) (10)
Probabilistic linear discriminant analysis with bottleneck features for speech recognition (2014) (9)
Dialogue act compression via pitch contour preservation (2006) (9)
Differentiable Pooling for Unsupervised Acoustic Model Adaptation (2016) (9)
Modeling Topic and Role Information in Meetings Using the Hierarchical Dirichlet Process (2008) (9)
Incorporating lexical and prosodic information at different levels for meeting summarization (2014) (9)
Stochastic Attention Head Removal: A Simple and Effective Method for Improving Transformer Based ASR Models (2021) (9)
Connectionist Speech Recognition: Status and Prospects (1991) (9)
Hierarchical recurrent neural network for story segmentation using fusion of lexical and acoustic features (2017) (8)
Proc. International Workshop on Spoken Language Translation (2012) (8)
Acoustic confidence measures for segmenting broadcast news (1998) (8)
A Deep 2D Convolutional Network for Waveform-Based Speech Recognition (2020) (8)
WERD: Using social text spelling variants for evaluating dialectal speech recognition (2017) (8)
Transforming access to the spoken word (2004) (8)
Description of the UEDIN system for German ASR (2013) (8)
Speech Acoustic Modelling from Raw Phase Spectrum (2021) (8)
Experimental evaluation of latent variable models for dimensionality reduction (1998) (8)
A latent-variable modelling approach to the acoustic-to-articulatory mapping problem. I (1999) (8)
A parallel training algorithm for hierarchical pitman-yor process language models (2009) (8)
Variable word rate N-grams (2000) (8)
Just-in-time prepared captioning for live transmissions (2016) (7)
Factorised Representations for Neural Network Adaptation to Diverse Acoustic Environments (2017) (7)
Interspeech 2006 - ICSLP (2006) (7)
Multi-stream segmentation of meetings (2004) (7)
Statistical annotation of named entities in spoken audio. (1999) (7)
Tied Probabilistic Linear Discriminant Analysis for Speech Recognition (2014) (7)
European Language Grid: A Joint Platform for the European Language Technology Community (2021) (7)
Acoustic space dimensionality selection and combination using the maximum entropy principle (2004) (7)
Multi-Reference Evaluation for Dialectal Speech Recognition System: A Study for Egyptian ASR (2015) (6)
An Edinburgh Speech Production Facility (2010) (6)
Augmentation of adaptation data (2010) (6)
Statistical Language Modelling (2000) (6)
On the effect of snr and superdirective beamforming in speaker diarisation in meetings (2012) (6)
Feed forward pre-training for recurrent neural network language models (2014) (6)
Lattice-based lightly-supervised acoustic model training (2019) (6)
ROCKIT: Roadmap for Conversational Interaction Technologies (2014) (6)
Distant Speech Recognition Experiments Using the AMI Corpus (2017) (6)
An advanced integrated architecture for wireless voicemail data retrieval (2001) (6)
Recording, Indexing, Summarizing, and Accessing Meeting Videos: An Overview of the AMI Project (2007) (5)
The THISL spoken document retrieval project (1999) (5)
Lattice-Based Unsupervised Test-Time Adaptation of Neural Network Acoustic Models (2019) (5)
9th European Conference on Speech Communication and Technology (Interspeech 2005 - Eurospeech) (2005) (5)
Multistream Recognition of Dialogue Acts in Meetings (2006) (5)
Deep Scattering Power Spectrum Features for Robust Speech Recognition (2020) (5)
Trainable Dynamic Subsampling for End-to-End Speech Recognition (2019) (5)
Synchronising audio and ultrasound by learning cross-modal embeddings (2019) (5)
Silent versus modal multi-speaker speech recognition from ultrasound and video (2021) (5)
The SUMMA Platform Prototype (2017) (5)
Pitch adaptive features for LVCSR (2008) (5)
Noise Compensation for Subspace Gaussian Mixture Models (2012) (5)
A connectionist approach to speech recognition using peripheral auditory modelling (1988) (5)
Spoken Dialogue Management Using Hierarchical Reinforcement Learning and Dialogue Simulation (2005) (4)
Learning Temporal Dependencies in Connectionist Speech Recognition (1993) (4)
Evaluation of extractive voicemail summarization. (2003) (4)
Exploiting ultrasound tongue imaging for the automatic detection of speech articulation errors (2021) (4)
Speech Acoustic Modelling Using Raw Source and Filter Components (2021) (4)
Leveraging speaker attribute information using multi task learning for speaker verification and diarization (2020) (4)
THISL spoken document retrieval at TREC-7 (1999) (4)
Phone recognition analysis for trajectory HMM (2006) (4)
Proc. NAACL/HLT (2010) (4)
Ultrasound tongue imaging for diarization and alignment of child speech therapy sessions (2019) (4)
Character-Level Neural Translation for Multilingual Media Monitoring in the SUMMA Project (2016) (4)
State of the art in Speech Recognition (2017) (4)
Improving statistical speech recognition (1992) (4)
Connectionist Optimisation of Tied Mixture Hidden Markov Models (1991) (3)
Proc IEEE International Conference on Acoustics, Speech and Signal Processing (2015) (3)
PROBABILITY ESTIMATION IN THE DECIPHER SPEECH RECOGNITION SYSTEM (1992) (3)
RETRIEVAL SYSTEM (1999) (3)
Recognition and Understanding of Meetings Overview of the European AMI and AMIDA Projects (2008) (3)
Special issue on searching speech (2012) (3)
Explorer Dynamic Evaluation of Neural Sequence Models (2018) (3)
Proc. ICML/UAI/COLT Workshop on Prior Knowledge for Text and Language Processing (2008) (3)
Bayesian regularisation methods in a hybrid MLP-HMM system (1993) (3)
Noise adaptive training for subspace Gaussian mixture models (2013) (3)
Speaker Verification Using Sequence Discriminant (2005) (3)
Joint uncertainty decoding with unscented transform for noise robust subspace Gaussian mixture models (2012) (3)
HMM-based Speech Synthesis with an Acoustic Glottal Source Model (2009) (3)
DropClass and DropAdapt: Dropping classes for deep speaker representation learning (2020) (3)
Raw Sign and Magnitude Spectra for Multi-Head Acoustic Modelling (2020) (3)
Word Error Rate Estimation Without ASR Output: e-WER2 (2020) (3)
Integrated transcription and identification of named entities in broadcast speech (1999) (3)
On the Usefulness of Statistical Normalisation of Bottleneck Features for Speech Recognition (2019) (2)
An HMM-based speech synthesiser using glottal post-filtering (2010) (2)
Proc. Interspeech 2009 (2009) (2)
Proceedings of the Rich Transcription 2005 Spring Meeting Recognition Evaluation (2005) (2)
Text- and Speech-Triggered Information Access: 8th ELSNET Summer School, Chios Island, Greece, July 15-30, 2000, Revised Lectures (2003) (2)
Automatic analysis of multiparty meetings (2011) (2)
Neural networks for speech pattern classification (1989) (2)
Roadmap for Conversational Interaction Technologies (2014) (2)
Accessing information in spoken audio (2000) (2)
Automatic Meeting Segmentation Using (2007) (2)
Feature-space speaker adaptation for probabilistic linear discriminant analysis acoustic models (2015) (2)
Automatic audiovisual synchronisation for ultrasound tongue imaging (2021) (2)
Analysis of a simultaneous-speaker sound corpus (1998) (2)
IEEE Automatic Speech Recognition and Understanding Workshop (ASRU 2017) (2017) (2)
Multi-view Dimensionality Reduction for Dialect Identification of Arabic Broadcast Speech (2016) (2)
Dropping Classes for Deep Speaker Representation Learning (2020) (2)
Using gamma filters to model temporal dependencies in speech (1994) (2)
Audio-Visual Processing in Meetings : Seven Questions and Some AMI Answers (2006) (2)
The THISL system for indexing and retrieval of broadcast news (1999) (2)
Text- and Speech-Triggered Information Access: Introduction (2000) (1)
When Can Self-Attention Be Replaced by Feed Forward Layers? (2020) (1)
11th International Workshop on Spoken Language Translation (IWSLT 2014) (2014) (1)
Towards Robust Word Alignment of Child Speech Therapy Sessions (2018) (1)
Proceedings of Interspeech 2014 (2014) (1)
Image Analysis and Processing Workshops, 2007. ICIAPW 2007. 14th International Conference on (2007) (1)
Multilingual Speech Recognition (2017) (1)
Proceedings of the 1999 DARPA Broadcast News Workshop (1999) (1)
Proceedings of the Second international conference on Machine Learning for Multimodal Interaction (2005) (1)
On the Efficiency of Recurrent Neural Network Optimization Algorithms (2015) (1)
Unstable connectionist networks in speech recognition (1988) (1)
User Generated Dialogue Systems: uDialogue (2017) (1)
The Ambient Spotlight: personal multimodal search without query (2010) (1)
The Edinburgh Speech Production Facility Dialogue Corpus (2010) (1)
The Edinburgh Speech Production Facility’s articulatory corpus of spontaneous dialogue. (2010) (1)
Analysis of a neural network model for speech recognition (1989) (1)
Train Your Classifier First: Cascade Neural Networks Training from Upper Layers to Lower Layers (2021) (1)
Voice banking and reconstruction 解説 — Speech synthesis technologies for individuals with vocal disabilities — (2011) (1)
Editorial: Expanding the Technical Reach of our Transactions (2014) (1)
Transcription of Conference Room M (2005) (1)
Multi-class extractive voicemail summarization (2003) (1)
Machine Learning for Multimodal Interaction, Third International Workshop, MLMI 2006, Bethesda, MD, USA, May 1-4, 2006, Revised Selected Papers (2006) (1)
Automatic dialogue act recognition using a dynamic Bayesian network (2007) (0)
The ambient spotlight: Personal meeting capture with a microphone array (2011) (0)
Stochastic Attention Head Removal: A Simple and Effective Method for Improving Automatic Speech Recognition with Transformers (2020) (0)
Edinburgh Research Explorer On training the recurrent neural network encoder-decoder for large vocabulary end-to-end speech recognition (2015) (0)
Edinburgh Differentiable pooling for unsupervised speaker adaptation (2018) (0)
Human Language Technology Conference of the North American Chapter of the Association of Computational Linguistics, Proceedings, April 22-27, 2007, Rochester, New York, USA (2007) (0)
Conditions for application (2018) (0)
Proceedings of the 4th international conference on Machine learning for multimodal interaction (2007) (0)
Review: Using Speech Recognition (1996) (0)
Prosodic features in spoken language identification (2019) (0)
Proceedings of Interspeech 2013 (2013) (0)
Proceedings IEEE International Conference on Acoustics, Speech and Signal Processing (2015) (0)
KER volume 27 issue 2 Cover and Back matter (2012) (0)
OPT2015 Optimization for Machine Learning at the Neural Information Processing Systems Conference, 2015 (2015) (0)
Proceedings of WASSS 2013 (2013) (0)
Using Participant Role in Multiparty Meetings as Prior Knowledge for Nonparametric Topic Modeling (2008) (0)
Session details: Multimodal communication analysis (Oral) (2009) (0)
Proceedings of the ITRW on Prosody in Speech Recognition and Understanding (2001) (0)
Top-down training for neural networks (2019) (0)
INTERPRETATIONOFMULTIPARTYMEETINGS THE AMIAND AMIDA PROJECTS (2008) (0)
1 Content-based Access to Spoken Audio (2005) (0)
C L ] 1 1 A ug 2 01 6 Automatic Dialect Detection in Arabic Broadcast Speech (2018) (0)
Transforming Voice Source Parameters in a HMM-based Speech Synthesiser with Glottal Post-Filtering (2010) (0)
Manual and automatic labels for version 1.0 of UXTD, UXSSD, and UPX core data -- version 1.0 (2018) (0)
Edinburgh Research Explorer The MGB-2 Challenge: Arabic Multi-Device Broadcast Media Recognition (2016) (0)
Edinburgh Research Explorer A Deep 2D Convolutional Network for Waveform-Based Speech Recognition (2020) (0)
Annotation using Statistical Prosody Models (2001) (0)
Proceedings of the Analyzing Conversations in Text and Speech (ACTS), Proceedings of the Workshop (2006) (0)
The SPRACH/LaSIE system for named entity identification in broadcast news. (1999) (0)
Proceedings of the 12th International Conference on Multimodal Interfaces / 7. International Workshop on Machine Learning for Multimodal Interaction, ICMI-MLMI 2010, Beijing, China, November 8-12, 2010 (2010) (0)
A Review of: “Connectionism in Perspective” R. Pfeifer, Z. Schreter, F. Fogelman-Soulie & L. Steels (Eds) Amsterdam: North-Holland, 1989 ISBN 0-444-88061-5, 518 pp., $90.25 (1991) (0)
Proc. 7th ISCA Speech Synthesis Workshop (SSW7) (2010) (0)
Explorer Roadmap for Conversational Interaction Technologies (2016) (0)
Video processing and recognition (2012) (0)
Invited Talk: Recognition and Understanding of Meetings (2010) (0)
Introduction to the special issue on new approaches to statistical speech and text processing (2008) (0)
Explorer Hierarchical recurrent neural network for story segmentation using fusion of lexical and acoustic features (2017) (0)
Editorial (2014) (0)
Edinburgh Explorer Lattice-based lightly-supervised acoustic model training (2019) (0)
Edinburgh Research Explorer Meta Comments for Summarizing Meeting Speech (2018) (0)
Edinburgh Research Explorer The THISL SDR System at TREC-8 (0)
Accessing the spokenword (0)
INTERSPEECH 2014, 15th Annual Conference of the International Speech Communication Association, Singapore, September 14-18, 2014 (2014) (0)
Automatic speech recognition using peripheral auditory modelling and a PDP approach to classification (1987) (0)
Leveraging Linguistic Knowledge for Accent Robustness of End-to-End Models (2021) (0)
Multi-Stream Acoustic Modelling Using Raw Real and Imaginary Parts of the Fourier Transform (2023) (0)
Multimodal Signal Processing: References (2012) (0)
SS.7 LEARNING PHONEME RECOGNITION USING NEURAL NETWORKS (1989) (0)
In Proc. Interspeech 2013 (2013) (0)
Introduction to Speech Recognition (2013) (0)
Edinburgh Research Explorer Variable word rate N-grams (2018) (0)
Proceedings of the 10th International Workshop on Spoken Language Translation (IWSLT 2013) (2013) (0)
UltraSuite Repository - sample data (2019) (0)
Multi-frame factorisation for long-span acoustic modelling (2015) (0)
Recognizing Aurora-2 using Clean Speech Models (2009) (0)
Windowed Attention Mechanisms for Speech Recognition WINDOWED ATTENTION MECHANISMS FOR SPEECH RECOGNITION (2019) (0)
ULTIPLICATIVE LSTM FOR SEQUENCE MODELLING (2017) (0)
Investigating the contribution of speaker attributes to speaker separability using disentangled speaker representations (2022) (0)
Explorer Simplifying very deep convolutional neural network architectures for robust speech recognition (2017) (0)
SLAM 2013 Speech, Language and Audio in Multimedia (2013) (0)
Grapheme-to-phoneme conversion methods for minority language conditions (2012) (0)
RETRIEVAL SYSTEM DaveAbberley ( 1 ) , DavidKirby ( 2 ) , SteveRenals ( 1 ) andTonyRobinson ( 3 ) ( 1 ) (1999) (0)
Multimodal Signal Processing: Conclusion and perspectives (2012) (0)
Towards Robust Waveform-Based Acoustic Models (2021) (0)
Explorer The MGB-2 Challenge : Arabic Multi-Device Broadcast Media (2018) (0)
Proceedings of the Second Workshop on Arabic Natural Language Processing (2015) (0)
DETECTION OF SPEECH AND CROSSTALK IN MULTI-CHANNEL MEETING RECORDINGS (2004) (0)
Proc. The First Young Researchers Workshop in Speech Technology (2009) (0)

This paper list is powered by the following services:

Other Resources About Stephen Renals

What Schools Are Affiliated With Stephen Renals?

Stephen Renals is affiliated with the following schools:

Stephen Renals's Academic­Influence.com Rankings

Why Is Stephen Renals Influential?

Stephen Renals's Published Works

Published Works

Other Resources About Stephen Renals

What Schools Are Affiliated With Stephen Renals?

Stephen Renals's AcademicInfluence.com Rankings