Alex Acero

Alex Acero's AcademicInfluence.com Rankings

Alex Acero

Engineering

#5816

World Rank

#7093

Historical Rank

Electrical Engineering

#1663

World Rank

#1761

Historical Rank

engineering Degrees

Alex Acero

Computer Science

#7509

World Rank

#7906

Historical Rank

Algorithms

#290

World Rank

#294

Historical Rank

Computational Linguistics

#1504

World Rank

#1520

Historical Rank

Machine Learning

#2785

World Rank

#2820

Historical Rank

computer-science Degrees

Download Badge

Engineering
Computer Science

Alex Acero's Degrees

PhD Electrical Engineering Stanford University
Masters Electrical Engineering Stanford University
Bachelors Electrical Engineering Stanford University

Why Is Alex Acero Influential?

(Suggest an Edit or Addition)

(See a Problem?)

Alex Acero's Published Works

Number of citations in a given year to any of this author's works

Total number of citations to an author for the works they published in a given year. This highlights publication of the most important work(s) by the author

Published Works

Context-Dependent Pre-Trained Deep Neural Networks for Large-Vocabulary Speech Recognition (2012) (2840)
Spoken Language Processing: A Guide to Theory, Algorithm and System Development (2001) (1996)
Learning deep structured semantic models for web search using clickthrough data (2013) (1677)
Spoken Language Processing (2001) (971)
Recent advances in deep learning for speech research at Microsoft (2013) (743)
Automatically extracting highlights for TV Baseball programs (2000) (492)
Learning query intent from regularized click graphs (2008) (388)
Hidden conditional random fields for phone classification (2005) (365)
Binary coding of speech spectrograms using a deep auto-encoder (2010) (363)
HMM adaptation using vector taylor series for noisy speech recognition (2000) (323)
Adaptation of Maximum Entropy Capitalizer: Little Data Can Help a Lo (2006) (266)
Large-vocabulary speech recognition under adverse acoustic environments (2000) (257)
Environmental robustness in automatic speech recognition (1990) (236)
Large vocabulary continuous speech recognition with context-dependent DBN-HMMS (2011) (199)
Efficient Cepstral Normalization for Robust Speech Recognition (1993) (176)
Dynamic compensation of HMM variances using the feature enhancement uncertainty computed from a parametric model of speech distortion (2005) (175)
Uncertainty decoding with SPLICE for noise robust speech recognition (2002) (174)
Evaluation of the SPLICE algorithm on the Aurora2 database (2001) (161)
High-performance robust speech recognition using stereo training data (2001) (158)
Enhancement of log Mel power spectra of speech using a phase-sensitive model of the acoustic environment and sequential estimation of the corrupting noise (2004) (136)
Active learning and semi-supervised learning for speech recognition: A unified framework using the global entropy reduction maximization criterion (2010) (134)
High-performance hmm adaptation with joint compensation of additive and convolutive distortions via Vector Taylor Series (2007) (129)
Recursive estimation of nonstationary noise using iterative stochastic approximation for robust speech recognition (2003) (121)
A study on multilingual acoustic modeling for large vocabulary ASR (2009) (119)
ALGONQUIN: iterating laplace's method to remove multiple types of acoustic distortion for robust speech recognition (2001) (119)
A unified framework of HMM adaptation with joint compensation of additive and convolutive distortions (2009) (113)
Extracting structured information from user queries with semi-supervised conditional random fields (2009) (112)
Signal Processing for Robust Speech Recognition (1994) (107)
Speech Denoising and Dereverberation Using Probabilistic Models (2000) (101)
Structured speech modeling (2006) (94)
Position Specific Posterior Lattices for Indexing Speech (2005) (93)
Whistler: a trainable text-to-speech system (1996) (92)
Estimating cepstrum of speech under the presence of noise using a joint prior of static and dynamic features (2004) (90)
Noise Adaptive Training for Robust Automatic Speech Recognition (2010) (87)
Automatic generation of synthesis units for trainable text-to-speech systems (1998) (87)
An introduction to voice search (2008) (86)
A minimum-mean-square-error noise reduction algorithm on Mel-frequency cepstra for robust speech recognition (2008) (84)
Live search for mobile:Web services by voice on the cellphone (2008) (84)
Air- and bone-conductive integrated microphones for robust speech detection and enhancement (2003) (84)
Formant analysis and synthesis using hidden Markov models (1999) (79)
Soft indexing of speech content for search in spoken documents (2007) (76)
Robust speech recognition by normalization of the acoustic space (1991) (74)
Noise robust speech recognition with a switching linear dynamic model (2004) (73)
Microsoft Windows highly intelligent speech recognizer: Whisper (1995) (72)
Discriminative models for spoken language understanding (2006) (71)
Evaluation of SPLICE on the Aurora 2 and 3 tasks (2002) (70)
Spoken language understanding (2005) (68)
Robust Speech Recognition Using a Cepstral Minimum-Mean-Square-Error-Motivated Noise Suppressor (2008) (68)
Multi-sensory microphones for robust speech detection, enhancement and recognition (2004) (67)
Recent improvements on Microsoft's trainable text-to-speech system-Whistler (1997) (65)
An Integrative and Discriminative Technique for Spoken Utterance Classification (2008) (65)
Noise adaptive training using a vector taylor series approach for noise robust automatic speech recognition (2009) (64)
Why word error rate is not a good metric for speech recognizer training for the speech translation task? (2011) (63)
Large-Margin Minimum Classification Error Training for Large-Scale Speech Recognition Tasks (2007) (62)
Combination of statistical and rule-based approaches for spoken language understanding (2002) (61)
Spoken Language Understanding "” An Introduction to the Statistical Framework (2005) (61)
Automated directory assistance system - from theory to practice (2007) (59)
Distributed speech processing in miPad's multimodal user interface (2002) (58)
Adaptive Kalman Filtering and Smoothing for Tracking Vocal Tract Resonances Using a Continuous-Valued Hidden Dynamic Model (2007) (57)
Mipad: a next generation PDA prototype (2000) (55)
Speech utterance classification (2003) (55)
Tracking Vocal Tract Resonances Using a Quantized Nonlinear Function Embedded in a Temporal Constraint (2006) (52)
A structured speech model with continuous hidden dynamics and prediction-residual training for tracking vocal tract resonances (2004) (51)
Semantic Frame‐Based Spoken Language Understanding (2011) (51)
Maximum mutual information SPLICE transform for seen and unseen conditions (2005) (48)
Environment normalization for robust speech recognition using direct cepstral comparison (1994) (48)
Analysis and comparison of two speech feature extraction/compensation algorithms (2005) (48)
Maximum Entropy Confidence Estimation for Speech Recognition (2007) (48)
MiPad: a multimodal interaction prototype (2001) (47)
Using continuous features in the maximum entropy model (2009) (47)
Efficient joint compensation of speech for the effects of additive noise and linear filtering (1992) (46)
Direct filtering for air- and bone-conductive microphones (2004) (46)
A Bayesian approach to speech feature enhancement using the dynamic cepstral prior (2002) (46)
Multiple Approaches to Robust Speech Recognition (1992) (43)
A Novel Framework and Training Algorithm for Variable-Parameter Hidden Markov Models (2009) (42)
Large-margin minimum classification error training: A theoretical risk minimization perspective (2008) (40)
A comparison of three non-linear observation models for noisy speech features (2003) (39)
Combination of CFG and n-gram modeling in semantic grammar learning (2003) (38)
Robust bandwidth extension of noise-corrupted narrowband speech (2005) (38)
Training Algorithms for Hidden Conditional Random Fields (2006) (37)
Exploiting variances in robust feature extraction based on a parametric model of speech distortion (2002) (37)
HMM-based smoothing for concatenative speech synthesis (1998) (36)
Improvements on speech recognition for fast talkers (1999) (36)
The VESTEL telephone speech database (1994) (35)
Voicepedia: towards speech-based access to unstructured information (2007) (35)
Speaker and gender normalization for continuous-density hidden Markov models (1996) (35)
Rapid development of spoken language understanding grammars (2006) (34)
Hidden conditional random field with distribution constraints for phone classification (2009) (34)
Commute UX: Voice Enabled In-car Infotainment System (2009) (34)
Use of incrementally regulated discriminative margins in MCE training for speech recognition (2006) (34)
Statistical Modeling of the Speech Signal (2010) (34)
A new method for speech denoising and robust speech recognition using probabilistic models for clean speech and for noise (2001) (33)
ALGONQUIN - Learning Dynamic Noise Models From Noisy Speech for Robust Speech Recognition (2001) (33)
Tracking vocal tract resonances using an analytical nonlinear predictor and a target-guided temporal constraint (2003) (31)
Learning Dynamic Noise Models from Noisy Speech for Robust Speech Recognition (2001) (31)
Discriminative pronounciation learning using phonetic decoder and minimum-classification-error criterion (2009) (31)
Separating Speaker and Environmental Variability Using Factored Transforms (2011) (31)
Maximum a posteriori pitch tracking (1998) (30)
Combining Statistical and Knowledge-Based Spoken Language Understanding in Conditional Models (2006) (30)
Grammar learning for spoken language understanding (2001) (30)
HMM adaptation using a phase-sensitive acoustic distortion model for environment-robust speech recognition (2008) (29)
A bidirectional target-filtering model of speech coarticulation and reduction: two-stage implementation for phonetic recognition (2006) (28)
Robust HMM-based endpoint detector (1993) (28)
Leakage model and teeth clack removal for air- and bone-conductive integrated microphones (2005) (28)
Noise from corrupted speech log mel-spectral energies (2002) (26)
Multisensory processing for speech enhancement and magnitude-normalized spectra for speech modeling (2008) (26)
Voice search of structured media data (2009) (26)
Context dependent phonetic string edit distance for automatic speech recognition (2010) (26)
A noise-robust ASR front-end using Wiener filter constructed from MMSE estimation of clean speech and noise (2003) (25)
AUGMENTED CEPSTRAL NORMALIZATION FOR ROBUST SPEECH RECOGNITION (2000) (24)
An expectation maximization approach for formant tracking using a parameter-free non-linear predictor (2003) (24)
N-Gram Based Filler Model for Robust Grammar Authoring (2006) (24)
A novel decision function and the associated decision-feedback learning for speech translation (2011) (24)
From Sphinx-II to Whisper — Making Speech Recognition Usable (1996) (24)
Learning with click graph for query intent classification (2010) (23)
Adapting grapheme-to-phoneme conversion for name recognition (2007) (23)
Unified framework for single channel speech enhancement (2009) (23)
A lattice search technique for a long-contextual-span hidden trajectory model of speech (2006) (22)
MICROPHONE ARRAY POST-PROCESSOR USING INSTANTANEOUS DIRECTION OF ARRIVAL (2006) (22)
Joint estimation of noise and channel distortion in a generalized EM framework (2001) (22)
Estimating speech recognition error rate without acoustic test data (2003) (22)
Voice-Rate: A Dialog System for Consumer Ratings (2007) (22)
Cross-lingual speech recognition under runtime resource constraints (2009) (22)
Source-filter models for time-scale pitch-scale modification of speech (1998) (21)
Training Wideband Acoustic Models Using Mixed-Bandwidth Training Data for Speech Recognition (2007) (21)
A harmonic-model-based front end for robust speech recognition (2003) (21)
Robust Adaptive Beamforming Algorithm using Instantaneous Direction of Arrival with Enhanced Noise Suppression Capability (2007) (20)
Acoustic model adaptation via Linear Spline Interpolation for robust speech recognition (2010) (20)
Towards non-stationary model-based noise adaptation for large vocabulary speech recognition (2001) (20)
Joint Discriminative Front End and Back End Training for Improved Speech Recognition Accuracy (2006) (20)
Language modeling for voice search: A machine translation approach (2008) (19)
Automatic children's reading tutor on hand-held devices (2008) (19)
Automatic Removal of Typed Keystrokes From Speech Signals (2007) (19)
A Bidirectional Target Filtering Model of Speech Coarticulation: two-stage Implementation for Phonetic Recognition (2006) (19)
Evaluation of spoken language grammar learning in the ATIS domain (2002) (18)
Rejection techniques for digit recognition in telecommunication applications (1993) (17)
Maximizing global entropy reduction for active learning in speech recognition (2009) (17)
Recursive noise estimation using iterative stochastic approximation for stereo-based robust speech recognition (2001) (17)
Efficient and Robust Language Modeling in an Automatic Children's Reading Tutor System (2007) (17)
Lexicon modeling for query understanding (2011) (16)
Microphone Array for Headset with Spatial Noise Suppressor (2005) (16)
Evaluation of a long-contextual-Span hidden trajectory model and phonetic recognizer using a* lattice search (2005) (16)
A graphical model for multi-sensory speech processing in air-and-bone conductive microphones (2005) (16)
Factored adaptation for separable compensation of speaker and environmental variability (2011) (16)
Efficient on-line acoustic environment estimation for FCDCN in a continuous speech recognition system (2001) (16)
A Generative-Discriminative Framework using Ensemble Methods for Text-Dependent Speaker Verification (2007) (16)
Speech/noise separation using two microphones and a VQ model of speech signals (2000) (16)
A quantitative model for formant dynamics and contextually assimilated reduction in fluent speech (2004) (15)
Sequential MAP noise estimation and a phase-sensitive model of the acoustic environment (2002) (15)
A hidden trajectory model with bi-directional target filtering: cascaded vs. integrated implementation for phonetic recognition (2005) (15)
Indexing uncertainty for spoken document search (2005) (15)
Concept acquisition in example-based grammar authoring (2003) (14)
Discriminative training of variable-parameter HMMs for noise robust speech recognition (2008) (14)
Training wideband acoustic models using mixed-bandwidth training data via feature bandwidth extension (2005) (14)
Speaker-adaptive learning of resonance targets in a hidden trajectory model of speech coarticulation (2007) (13)
Speech Utterance Classification Model Training without Manual Transcriptions (2006) (13)
Speech enhancement using a pitch predictive model (2008) (13)
A new speaker identification algorithm for gaming scenarios (2011) (13)
Incremental Bayes learning with prior evolution for tracking nonstationary noise statistics from noisy speech data (2003) (13)
Unsupervised learning from users' error correction in speech dictation (2004) (12)
Using collective information in semi-supervised learning for speech recognition (2009) (12)
Acoustical Pre-Processing for Robust Speech Recognition (1989) (12)
Towards a non-parametric acoustic model: an acoustic decision tree for observation probability calculation (2008) (12)
Sound capture system and spatial filter for small devices (2008) (12)
Speech Modelingwith Magnitude-Normalized Complex Spectra and Its Application to Multisensory Speech Enhancement (2006) (12)
Nonlinear information fusion in multi-sensor processing - extracting and exploiting hidden dynamics of speech captured by a bone-conductive microphone (2004) (11)
Discriminative training methods for language models using conditional entropy criteria (2010) (11)
How to train a discriminative front end with stochastic gradient descent and maximum mutual information (2005) (11)
A long-contextual-span model of resonance dynamics for speech recognition: parameter learning and recognizer evaluation (2005) (10)
Microphone Array Post-Filter using Incremental Bayes Learning to Track the Spatial Distributions of Speech and Noise (2007) (10)
Factored adaptation using a combination of feature-space and model-space transforms (2012) (10)
Robust design of wideband loudspeaker arrays (2008) (10)
Improved name recognition with user modeling (2003) (10)
Information retrieval methods for automatic speech recognition (2010) (10)
Robust speech recognition using cepstral minimum-mean-square-error noise suppressor (2008) (10)
A Semantically Structured Language Model (2004) (10)
An overview of text-to-speech synthesis (2000) (9)
Parameter clustering and sharing in variable-parameter HMMs for noise robust speech recognition (2008) (9)
Speech and Language Processing for Multimodal Human-Computer Interaction (2004) (9)
Speaker adaptation with an Exponential Transform (2011) (8)
SGStudio: rapid semantic grammar development for spoken language understanding (2005) (8)
A fine pitch model for speech (2007) (8)
A Robust HMM-Based Endpoint Detector for Telecommunication Applications (1993) (8)
Learning statistically characterized resonance targets in a hidden trajectory model of speech coarticulation and reduction (2005) (8)
Towards Environment-Independent Spoken Language Systems (1990) (8)
Noise robust model adaptation using linear spline interpolation (2009) (8)
Discriminative training of garbage model for non-vocabulary utterance rejection (1994) (8)
Improvements on Mel-Frequency Cepstrum Minimum-Mean-Square-Error Noise Suppressor for Robust Speech Recognition (2008) (8)
Reverberated speech signal separation based on regularized subband feedforward ICA and instantaneous direction of arrival (2010) (7)
Discriminative training of n-gram classifiers for speech and text routing (2003) (7)
Continuous speech recognition with a TF-IDF acoustic model (2010) (7)
INTEGRATION OF METADATA IN SPOKEN DOCUMENT SEARCH USING POSITION SPECIFIC POSTERIOR LATICES (2006) (6)
Maximum entropy based generic filter for language model adaptation (2005) (6)
Maximum entropy model parameterization with TF∗IDF weighted vector space model (2007) (6)
Robust location understanding in spoken dialog systems using intersections (2007) (6)
Cross-Pollination in Signal Processing Technical Areas (2009) (6)
The MSR system for IWSLT 2011 evaluation (2011) (6)
Dual stage probabilistic voice activity detector. (2010) (6)
Statistical Spoken Language Understanding: from Generative Model to Conditional Model (2005) (6)
Unsupervised semantic intent discovery from call log acoustics (2005) (5)
Speech Recognition and Understanding (2003) (5)
Call analysis with classification using speech and non-speech features (2006) (5)
Confidence measures for voice search applications (2007) (5)
Adapting acoustic models to new domains and conditions using untranscribed data (2003) (4)
A Discriminative Training Framework using N-Best Speech Recognition Transcriptions and Scores for Spoken Utterance Classification (2007) (4)
Suppression Rule for Speech Recognition Friendly Noise Suppressors (2006) (4)
New methods and evaluation experiments on translating TED talks in the IWSLT benchmark (2012) (4)
A speech-centric perspective for human-computer interface (2002) (4)
Conditional Maximum Likelihood Estimation of Naive Bayes Probability Models Using Rational Function Growth Transform (2004) (4)
Media Search in Mobile Devices [From the Guest Editors] (2011) (4)
A mixed-excitation frequency domain model for time-scale pitch-scale modification of speech (1998) (3)
Embracing a New Golden Age of Signal Processing (2009) (3)
SPEECH OGLE: Indexing Uncertainty for Spoken Document Search (2005) (3)
IMPROVEMENTS ON SPEECH RECOGNITON FOR FAST TALKERS (1999) (3)
Pruning Analysis for the Position Specific Posterior Lattices for Spoken Document Search (2006) (3)
Inductive and example-based learning for text classification (2008) (3)
Joint encoding of the waveform and speech recognition features using a transform codec (2011) (3)
Robust Multichannel Linear Prediction for Online Speech Dereverberation Using Weighted Householder Least Squares Lattice Adaptive Filter (2020) (3)
Use and Acquisition of Semantic Language Model (2004) (3)
Maximum a posteriori ICA: Applying prior knowledge to the separation of acoustic sources (2008) (3)
Experimental investigation of delayed instantaneous demixer for speech enhancement (2001) (2)
Experimenting with a global decision tree for state clustering in automatic speech recognition systems (2009) (2)
Automatic Head-size Equalization in Panorama Images for Video Conferencing (2005) (2)
Media Search in Mobile Devices (2011) (2)
UNCERTAINTY DECODING WITH SPUCE FOR NOISE ROBUST SPEECH RECOGNITION (2002) (2)
An EM algorithm for training wideband acoustic models from mixed-bandwidth training data (2005) (2)
Adaptation of compressed HMM parameters for resource-constrained speech recognition (2008) (2)
An effective and efficient utterance verification technology using word n-gram filler models (2006) (2)
5. Voice Search (2011) (1)
HMM adaptation using linear spline interpolation with integrated spline parameter training for robust speech recognition (2010) (1)
Commute UX: Telephone Dialog System for Location-based Services (2007) (1)
Should We Experiment with New Peer-Review Models? [President's Message] (2015) (1)
Separating colorred signals distorted by convolutive channels using diagonal constrained decorrelation (2002) (1)
A time-synchronous phonetic decoder for a long-contextual-Span hidden trajectory model (2006) (1)
At the Forefront in Technical Publications [President's Message] (2014) (1)
Chapters? Role in Networking and Continuing Education [President's Message] (2014) (1)
AN EM-based probabilistic approach for Acoustic Echo Suppression (2008) (1)
Curiosity in Science and Technology (2009) (1)
The IEEE Signal Processing Cup: A Competition for Undergraduate Students [President's Message] (2015) (1)
Voice Search - An Introduction (2008) (1)
2006 Workshop on Spoken Language Technology (2006) (1)
Building Voice User Interfaces (2006) (1)
Spoken Languge Understanding (2010) (0)
A TIME-SYNCHRONOUS PHONETIC DECODE HIDDEN TRAJECTO (2006) (0)
Semiautomatic improvements of system-initiative spoken dialog applications using interactive clustering (2005) (0)
Speech and Language Processing for Multimodal Human-Computer Interaction (Invited Article) (2004) (0)
Phone Classification Using Hidden Conditional Random Fields (0)
Speech Research: Near and Not-so-near Results and What They Might Mean for IUI (Panel). (1998) (0)
Perspectives on Special Issues (2009) (0)
DEXTER: Deep Encoding of External Knowledge for Named Entity Recognition in Virtual Assistants (2021) (0)
Handling phonetic context and speaker variation in a structure-based speech recognizer (2007) (0)
Siri's voice gets deep learning (2016) (0)
Impact of Signal Processing and of Our Work (2010) (0)
Acoustical pre-processing for robust spoken language systems (1990) (0)
SigView: Video Tutorials in Emerging Signal Processing Topics [President's Message] (2015) (0)
Removal of Typed Keystrokes from Speech Signals (2007) (0)
Author ' s personal copy Using continuous features in the maximum entropy model q (2009) (0)
Signal Processing: The Science Behind Our Digital Life [President's Message] (2015) (0)
Conditional Maximum Likelihood Estimation Using Rational Function Growth Transform (2004) (0)
Speech research (panel): near and not-so-near results and what they might mean for IUI (1998) (0)
A DISCRIMINATIVETRAININGFRAMEWORK USINGN-BESTSPEECHRECOGNITION TRANSCRIPTIONSAND SCORESFOR SPOKENUTTERANCE CLASSIFICATION (2007) (0)
We Need Your Help to Take the Society to New Heights [President's Message] (2016) (0)
Conditional ML Estimation Using Rational Function Growth Transform (2004) (0)
SigPort: A Paper Repository for Signal Processing [President's Message] (2015) (0)
Towards Microphone-Independent Spoken Language Systems (1990) (0)
The Only Constant Is Change [President's Message] (2014) (0)
Robust Speech Recognition Using a Cepstral Minimum-Mean-Square-Error-Motivated (2008) (0)
TRANSACTIONS ASSOCIATE EDITORS (2008) (0)
The IEEE Gives Our Society the "Thumbs Up" [President's Message] (2015) (0)
Computers, Robotics, and the Human Brain (2008) (0)
GlobalSIP and ChinaSIP: New Conferences Developed by the IEEE Signal Processing Society [President's Message] (2014) (0)
Impacto Global da Signal Processing Magazine (2011) (0)
Endpoint Detection For Speech Under Noisy Environments (2014) (0)
Report on the NSF-sponsored Human Language Technology Workshop on Industrial Centers (2007) (0)
Novel Acoustic Modeling with Structured Hidden Dynamics for Speech Coarticulation and Reduction (2004) (0)
Where Does Your Conference Registration Fee Go? [President's Message] (2014) (0)

This paper list is powered by the following services:

Alex Acero's Academic­Influence.com Rankings

Alex Acero's Degrees

Why Is Alex Acero Influential?

Alex Acero's Published Works

Published Works

Alex Acero's AcademicInfluence.com Rankings