Alex Acero
#147,414
Most Influential Person Now
Alex Acero's AcademicInfluence.com Rankings
Alex Aceroengineering Degrees
Engineering
#5816
World Rank
#7093
Historical Rank
Electrical Engineering
#1663
World Rank
#1761
Historical Rank

Alex Acerocomputer-science Degrees
Computer Science
#7509
World Rank
#7906
Historical Rank
Algorithms
#290
World Rank
#294
Historical Rank
Computational Linguistics
#1504
World Rank
#1520
Historical Rank
Machine Learning
#2785
World Rank
#2820
Historical Rank

Download Badge
Engineering Computer Science
Alex Acero's Degrees
- PhD Electrical Engineering Stanford University
- Masters Electrical Engineering Stanford University
- Bachelors Electrical Engineering Stanford University
Why Is Alex Acero Influential?
(Suggest an Edit or Addition)Alex Acero's Published Works
Number of citations in a given year to any of this author's works
Total number of citations to an author for the works they published in a given year. This highlights publication of the most important work(s) by the author
Published Works
- Context-Dependent Pre-Trained Deep Neural Networks for Large-Vocabulary Speech Recognition (2012) (2840)
- Spoken Language Processing: A Guide to Theory, Algorithm and System Development (2001) (1996)
- Learning deep structured semantic models for web search using clickthrough data (2013) (1677)
- Spoken Language Processing (2001) (971)
- Recent advances in deep learning for speech research at Microsoft (2013) (743)
- Automatically extracting highlights for TV Baseball programs (2000) (492)
- Learning query intent from regularized click graphs (2008) (388)
- Hidden conditional random fields for phone classification (2005) (365)
- Binary coding of speech spectrograms using a deep auto-encoder (2010) (363)
- HMM adaptation using vector taylor series for noisy speech recognition (2000) (323)
- Adaptation of Maximum Entropy Capitalizer: Little Data Can Help a Lo (2006) (266)
- Large-vocabulary speech recognition under adverse acoustic environments (2000) (257)
- Environmental robustness in automatic speech recognition (1990) (236)
- Large vocabulary continuous speech recognition with context-dependent DBN-HMMS (2011) (199)
- Efficient Cepstral Normalization for Robust Speech Recognition (1993) (176)
- Dynamic compensation of HMM variances using the feature enhancement uncertainty computed from a parametric model of speech distortion (2005) (175)
- Uncertainty decoding with SPLICE for noise robust speech recognition (2002) (174)
- Evaluation of the SPLICE algorithm on the Aurora2 database (2001) (161)
- High-performance robust speech recognition using stereo training data (2001) (158)
- Enhancement of log Mel power spectra of speech using a phase-sensitive model of the acoustic environment and sequential estimation of the corrupting noise (2004) (136)
- Active learning and semi-supervised learning for speech recognition: A unified framework using the global entropy reduction maximization criterion (2010) (134)
- High-performance hmm adaptation with joint compensation of additive and convolutive distortions via Vector Taylor Series (2007) (129)
- Recursive estimation of nonstationary noise using iterative stochastic approximation for robust speech recognition (2003) (121)
- A study on multilingual acoustic modeling for large vocabulary ASR (2009) (119)
- ALGONQUIN: iterating laplace's method to remove multiple types of acoustic distortion for robust speech recognition (2001) (119)
- A unified framework of HMM adaptation with joint compensation of additive and convolutive distortions (2009) (113)
- Extracting structured information from user queries with semi-supervised conditional random fields (2009) (112)
- Signal Processing for Robust Speech Recognition (1994) (107)
- Speech Denoising and Dereverberation Using Probabilistic Models (2000) (101)
- Structured speech modeling (2006) (94)
- Position Specific Posterior Lattices for Indexing Speech (2005) (93)
- Whistler: a trainable text-to-speech system (1996) (92)
- Estimating cepstrum of speech under the presence of noise using a joint prior of static and dynamic features (2004) (90)
- Noise Adaptive Training for Robust Automatic Speech Recognition (2010) (87)
- Automatic generation of synthesis units for trainable text-to-speech systems (1998) (87)
- An introduction to voice search (2008) (86)
- A minimum-mean-square-error noise reduction algorithm on Mel-frequency cepstra for robust speech recognition (2008) (84)
- Live search for mobile:Web services by voice on the cellphone (2008) (84)
- Air- and bone-conductive integrated microphones for robust speech detection and enhancement (2003) (84)
- Formant analysis and synthesis using hidden Markov models (1999) (79)
- Soft indexing of speech content for search in spoken documents (2007) (76)
- Robust speech recognition by normalization of the acoustic space (1991) (74)
- Noise robust speech recognition with a switching linear dynamic model (2004) (73)
- Microsoft Windows highly intelligent speech recognizer: Whisper (1995) (72)
- Discriminative models for spoken language understanding (2006) (71)
- Evaluation of SPLICE on the Aurora 2 and 3 tasks (2002) (70)
- Spoken language understanding (2005) (68)
- Robust Speech Recognition Using a Cepstral Minimum-Mean-Square-Error-Motivated Noise Suppressor (2008) (68)
- Multi-sensory microphones for robust speech detection, enhancement and recognition (2004) (67)
- Recent improvements on Microsoft's trainable text-to-speech system-Whistler (1997) (65)
- An Integrative and Discriminative Technique for Spoken Utterance Classification (2008) (65)
- Noise adaptive training using a vector taylor series approach for noise robust automatic speech recognition (2009) (64)
- Why word error rate is not a good metric for speech recognizer training for the speech translation task? (2011) (63)
- Large-Margin Minimum Classification Error Training for Large-Scale Speech Recognition Tasks (2007) (62)
- Combination of statistical and rule-based approaches for spoken language understanding (2002) (61)
- Spoken Language Understanding "” An Introduction to the Statistical Framework (2005) (61)
- Automated directory assistance system - from theory to practice (2007) (59)
- Distributed speech processing in miPad's multimodal user interface (2002) (58)
- Adaptive Kalman Filtering and Smoothing for Tracking Vocal Tract Resonances Using a Continuous-Valued Hidden Dynamic Model (2007) (57)
- Mipad: a next generation PDA prototype (2000) (55)
- Speech utterance classification (2003) (55)
- Tracking Vocal Tract Resonances Using a Quantized Nonlinear Function Embedded in a Temporal Constraint (2006) (52)
- A structured speech model with continuous hidden dynamics and prediction-residual training for tracking vocal tract resonances (2004) (51)
- Semantic Frame‐Based Spoken Language Understanding (2011) (51)
- Maximum mutual information SPLICE transform for seen and unseen conditions (2005) (48)
- Environment normalization for robust speech recognition using direct cepstral comparison (1994) (48)
- Analysis and comparison of two speech feature extraction/compensation algorithms (2005) (48)
- Maximum Entropy Confidence Estimation for Speech Recognition (2007) (48)
- MiPad: a multimodal interaction prototype (2001) (47)
- Using continuous features in the maximum entropy model (2009) (47)
- Efficient joint compensation of speech for the effects of additive noise and linear filtering (1992) (46)
- Direct filtering for air- and bone-conductive microphones (2004) (46)
- A Bayesian approach to speech feature enhancement using the dynamic cepstral prior (2002) (46)
- Multiple Approaches to Robust Speech Recognition (1992) (43)
- A Novel Framework and Training Algorithm for Variable-Parameter Hidden Markov Models (2009) (42)
- Large-margin minimum classification error training: A theoretical risk minimization perspective (2008) (40)
- A comparison of three non-linear observation models for noisy speech features (2003) (39)
- Combination of CFG and n-gram modeling in semantic grammar learning (2003) (38)
- Robust bandwidth extension of noise-corrupted narrowband speech (2005) (38)
- Training Algorithms for Hidden Conditional Random Fields (2006) (37)
- Exploiting variances in robust feature extraction based on a parametric model of speech distortion (2002) (37)
- HMM-based smoothing for concatenative speech synthesis (1998) (36)
- Improvements on speech recognition for fast talkers (1999) (36)
- The VESTEL telephone speech database (1994) (35)
- Voicepedia: towards speech-based access to unstructured information (2007) (35)
- Speaker and gender normalization for continuous-density hidden Markov models (1996) (35)
- Rapid development of spoken language understanding grammars (2006) (34)
- Hidden conditional random field with distribution constraints for phone classification (2009) (34)
- Commute UX: Voice Enabled In-car Infotainment System (2009) (34)
- Use of incrementally regulated discriminative margins in MCE training for speech recognition (2006) (34)
- Statistical Modeling of the Speech Signal (2010) (34)
- A new method for speech denoising and robust speech recognition using probabilistic models for clean speech and for noise (2001) (33)
- ALGONQUIN - Learning Dynamic Noise Models From Noisy Speech for Robust Speech Recognition (2001) (33)
- Tracking vocal tract resonances using an analytical nonlinear predictor and a target-guided temporal constraint (2003) (31)
- Learning Dynamic Noise Models from Noisy Speech for Robust Speech Recognition (2001) (31)
- Discriminative pronounciation learning using phonetic decoder and minimum-classification-error criterion (2009) (31)
- Separating Speaker and Environmental Variability Using Factored Transforms (2011) (31)
- Maximum a posteriori pitch tracking (1998) (30)
- Combining Statistical and Knowledge-Based Spoken Language Understanding in Conditional Models (2006) (30)
- Grammar learning for spoken language understanding (2001) (30)
- HMM adaptation using a phase-sensitive acoustic distortion model for environment-robust speech recognition (2008) (29)
- A bidirectional target-filtering model of speech coarticulation and reduction: two-stage implementation for phonetic recognition (2006) (28)
- Robust HMM-based endpoint detector (1993) (28)
- Leakage model and teeth clack removal for air- and bone-conductive integrated microphones (2005) (28)
- Noise from corrupted speech log mel-spectral energies (2002) (26)
- Multisensory processing for speech enhancement and magnitude-normalized spectra for speech modeling (2008) (26)
- Voice search of structured media data (2009) (26)
- Context dependent phonetic string edit distance for automatic speech recognition (2010) (26)
- A noise-robust ASR front-end using Wiener filter constructed from MMSE estimation of clean speech and noise (2003) (25)
- AUGMENTED CEPSTRAL NORMALIZATION FOR ROBUST SPEECH RECOGNITION (2000) (24)
- An expectation maximization approach for formant tracking using a parameter-free non-linear predictor (2003) (24)
- N-Gram Based Filler Model for Robust Grammar Authoring (2006) (24)
- A novel decision function and the associated decision-feedback learning for speech translation (2011) (24)
- From Sphinx-II to Whisper — Making Speech Recognition Usable (1996) (24)
- Learning with click graph for query intent classification (2010) (23)
- Adapting grapheme-to-phoneme conversion for name recognition (2007) (23)
- Unified framework for single channel speech enhancement (2009) (23)
- A lattice search technique for a long-contextual-span hidden trajectory model of speech (2006) (22)
- MICROPHONE ARRAY POST-PROCESSOR USING INSTANTANEOUS DIRECTION OF ARRIVAL (2006) (22)
- Joint estimation of noise and channel distortion in a generalized EM framework (2001) (22)
- Estimating speech recognition error rate without acoustic test data (2003) (22)
- Voice-Rate: A Dialog System for Consumer Ratings (2007) (22)
- Cross-lingual speech recognition under runtime resource constraints (2009) (22)
- Source-filter models for time-scale pitch-scale modification of speech (1998) (21)
- Training Wideband Acoustic Models Using Mixed-Bandwidth Training Data for Speech Recognition (2007) (21)
- A harmonic-model-based front end for robust speech recognition (2003) (21)
- Robust Adaptive Beamforming Algorithm using Instantaneous Direction of Arrival with Enhanced Noise Suppression Capability (2007) (20)
- Acoustic model adaptation via Linear Spline Interpolation for robust speech recognition (2010) (20)
- Towards non-stationary model-based noise adaptation for large vocabulary speech recognition (2001) (20)
- Joint Discriminative Front End and Back End Training for Improved Speech Recognition Accuracy (2006) (20)
- Language modeling for voice search: A machine translation approach (2008) (19)
- Automatic children's reading tutor on hand-held devices (2008) (19)
- Automatic Removal of Typed Keystrokes From Speech Signals (2007) (19)
- A Bidirectional Target Filtering Model of Speech Coarticulation: two-stage Implementation for Phonetic Recognition (2006) (19)
- Evaluation of spoken language grammar learning in the ATIS domain (2002) (18)
- Rejection techniques for digit recognition in telecommunication applications (1993) (17)
- Maximizing global entropy reduction for active learning in speech recognition (2009) (17)
- Recursive noise estimation using iterative stochastic approximation for stereo-based robust speech recognition (2001) (17)
- Efficient and Robust Language Modeling in an Automatic Children's Reading Tutor System (2007) (17)
- Lexicon modeling for query understanding (2011) (16)
- Microphone Array for Headset with Spatial Noise Suppressor (2005) (16)
- Evaluation of a long-contextual-Span hidden trajectory model and phonetic recognizer using a* lattice search (2005) (16)
- A graphical model for multi-sensory speech processing in air-and-bone conductive microphones (2005) (16)
- Factored adaptation for separable compensation of speaker and environmental variability (2011) (16)
- Efficient on-line acoustic environment estimation for FCDCN in a continuous speech recognition system (2001) (16)
- A Generative-Discriminative Framework using Ensemble Methods for Text-Dependent Speaker Verification (2007) (16)
- Speech/noise separation using two microphones and a VQ model of speech signals (2000) (16)
- A quantitative model for formant dynamics and contextually assimilated reduction in fluent speech (2004) (15)
- Sequential MAP noise estimation and a phase-sensitive model of the acoustic environment (2002) (15)
- A hidden trajectory model with bi-directional target filtering: cascaded vs. integrated implementation for phonetic recognition (2005) (15)
- Indexing uncertainty for spoken document search (2005) (15)
- Concept acquisition in example-based grammar authoring (2003) (14)
- Discriminative training of variable-parameter HMMs for noise robust speech recognition (2008) (14)
- Training wideband acoustic models using mixed-bandwidth training data via feature bandwidth extension (2005) (14)
- Speaker-adaptive learning of resonance targets in a hidden trajectory model of speech coarticulation (2007) (13)
- Speech Utterance Classification Model Training without Manual Transcriptions (2006) (13)
- Speech enhancement using a pitch predictive model (2008) (13)
- A new speaker identification algorithm for gaming scenarios (2011) (13)
- Incremental Bayes learning with prior evolution for tracking nonstationary noise statistics from noisy speech data (2003) (13)
- Unsupervised learning from users' error correction in speech dictation (2004) (12)
- Using collective information in semi-supervised learning for speech recognition (2009) (12)
- Acoustical Pre-Processing for Robust Speech Recognition (1989) (12)
- Towards a non-parametric acoustic model: an acoustic decision tree for observation probability calculation (2008) (12)
- Sound capture system and spatial filter for small devices (2008) (12)
- Speech Modelingwith Magnitude-Normalized Complex Spectra and Its Application to Multisensory Speech Enhancement (2006) (12)
- Nonlinear information fusion in multi-sensor processing - extracting and exploiting hidden dynamics of speech captured by a bone-conductive microphone (2004) (11)
- Discriminative training methods for language models using conditional entropy criteria (2010) (11)
- How to train a discriminative front end with stochastic gradient descent and maximum mutual information (2005) (11)
- A long-contextual-span model of resonance dynamics for speech recognition: parameter learning and recognizer evaluation (2005) (10)
- Microphone Array Post-Filter using Incremental Bayes Learning to Track the Spatial Distributions of Speech and Noise (2007) (10)
- Factored adaptation using a combination of feature-space and model-space transforms (2012) (10)
- Robust design of wideband loudspeaker arrays (2008) (10)
- Improved name recognition with user modeling (2003) (10)
- Information retrieval methods for automatic speech recognition (2010) (10)
- Robust speech recognition using cepstral minimum-mean-square-error noise suppressor (2008) (10)
- A Semantically Structured Language Model (2004) (10)
- An overview of text-to-speech synthesis (2000) (9)
- Parameter clustering and sharing in variable-parameter HMMs for noise robust speech recognition (2008) (9)
- Speech and Language Processing for Multimodal Human-Computer Interaction (2004) (9)
- Speaker adaptation with an Exponential Transform (2011) (8)
- SGStudio: rapid semantic grammar development for spoken language understanding (2005) (8)
- A fine pitch model for speech (2007) (8)
- A Robust HMM-Based Endpoint Detector for Telecommunication Applications (1993) (8)
- Learning statistically characterized resonance targets in a hidden trajectory model of speech coarticulation and reduction (2005) (8)
- Towards Environment-Independent Spoken Language Systems (1990) (8)
- Noise robust model adaptation using linear spline interpolation (2009) (8)
- Discriminative training of garbage model for non-vocabulary utterance rejection (1994) (8)
- Improvements on Mel-Frequency Cepstrum Minimum-Mean-Square-Error Noise Suppressor for Robust Speech Recognition (2008) (8)
- Reverberated speech signal separation based on regularized subband feedforward ICA and instantaneous direction of arrival (2010) (7)
- Discriminative training of n-gram classifiers for speech and text routing (2003) (7)
- Continuous speech recognition with a TF-IDF acoustic model (2010) (7)
- INTEGRATION OF METADATA IN SPOKEN DOCUMENT SEARCH USING POSITION SPECIFIC POSTERIOR LATICES (2006) (6)
- Maximum entropy based generic filter for language model adaptation (2005) (6)
- Maximum entropy model parameterization with TF∗IDF weighted vector space model (2007) (6)
- Robust location understanding in spoken dialog systems using intersections (2007) (6)
- Cross-Pollination in Signal Processing Technical Areas (2009) (6)
- The MSR system for IWSLT 2011 evaluation (2011) (6)
- Dual stage probabilistic voice activity detector. (2010) (6)
- Statistical Spoken Language Understanding: from Generative Model to Conditional Model (2005) (6)
- Unsupervised semantic intent discovery from call log acoustics (2005) (5)
- Speech Recognition and Understanding (2003) (5)
- Call analysis with classification using speech and non-speech features (2006) (5)
- Confidence measures for voice search applications (2007) (5)
- Adapting acoustic models to new domains and conditions using untranscribed data (2003) (4)
- A Discriminative Training Framework using N-Best Speech Recognition Transcriptions and Scores for Spoken Utterance Classification (2007) (4)
- Suppression Rule for Speech Recognition Friendly Noise Suppressors (2006) (4)
- New methods and evaluation experiments on translating TED talks in the IWSLT benchmark (2012) (4)
- A speech-centric perspective for human-computer interface (2002) (4)
- Conditional Maximum Likelihood Estimation of Naive Bayes Probability Models Using Rational Function Growth Transform (2004) (4)
- Media Search in Mobile Devices [From the Guest Editors] (2011) (4)
- A mixed-excitation frequency domain model for time-scale pitch-scale modification of speech (1998) (3)
- Embracing a New Golden Age of Signal Processing (2009) (3)
- SPEECH OGLE: Indexing Uncertainty for Spoken Document Search (2005) (3)
- IMPROVEMENTS ON SPEECH RECOGNITON FOR FAST TALKERS (1999) (3)
- Pruning Analysis for the Position Specific Posterior Lattices for Spoken Document Search (2006) (3)
- Inductive and example-based learning for text classification (2008) (3)
- Joint encoding of the waveform and speech recognition features using a transform codec (2011) (3)
- Robust Multichannel Linear Prediction for Online Speech Dereverberation Using Weighted Householder Least Squares Lattice Adaptive Filter (2020) (3)
- Use and Acquisition of Semantic Language Model (2004) (3)
- Maximum a posteriori ICA: Applying prior knowledge to the separation of acoustic sources (2008) (3)
- Experimental investigation of delayed instantaneous demixer for speech enhancement (2001) (2)
- Experimenting with a global decision tree for state clustering in automatic speech recognition systems (2009) (2)
- Automatic Head-size Equalization in Panorama Images for Video Conferencing (2005) (2)
- Media Search in Mobile Devices (2011) (2)
- UNCERTAINTY DECODING WITH SPUCE FOR NOISE ROBUST SPEECH RECOGNITION (2002) (2)
- An EM algorithm for training wideband acoustic models from mixed-bandwidth training data (2005) (2)
- Adaptation of compressed HMM parameters for resource-constrained speech recognition (2008) (2)
- An effective and efficient utterance verification technology using word n-gram filler models (2006) (2)
- 5. Voice Search (2011) (1)
- HMM adaptation using linear spline interpolation with integrated spline parameter training for robust speech recognition (2010) (1)
- Commute UX: Telephone Dialog System for Location-based Services (2007) (1)
- Should We Experiment with New Peer-Review Models? [President's Message] (2015) (1)
- Separating colorred signals distorted by convolutive channels using diagonal constrained decorrelation (2002) (1)
- A time-synchronous phonetic decoder for a long-contextual-Span hidden trajectory model (2006) (1)
- At the Forefront in Technical Publications [President's Message] (2014) (1)
- Chapters? Role in Networking and Continuing Education [President's Message] (2014) (1)
- AN EM-based probabilistic approach for Acoustic Echo Suppression (2008) (1)
- Curiosity in Science and Technology (2009) (1)
- The IEEE Signal Processing Cup: A Competition for Undergraduate Students [President's Message] (2015) (1)
- Voice Search - An Introduction (2008) (1)
- 2006 Workshop on Spoken Language Technology (2006) (1)
- Building Voice User Interfaces (2006) (1)
- Spoken Languge Understanding (2010) (0)
- A TIME-SYNCHRONOUS PHONETIC DECODE HIDDEN TRAJECTO (2006) (0)
- Semiautomatic improvements of system-initiative spoken dialog applications using interactive clustering (2005) (0)
- Speech and Language Processing for Multimodal Human-Computer Interaction (Invited Article) (2004) (0)
- Phone Classification Using Hidden Conditional Random Fields (0)
- Speech Research: Near and Not-so-near Results and What They Might Mean for IUI (Panel). (1998) (0)
- Perspectives on Special Issues (2009) (0)
- DEXTER: Deep Encoding of External Knowledge for Named Entity Recognition in Virtual Assistants (2021) (0)
- Handling phonetic context and speaker variation in a structure-based speech recognizer (2007) (0)
- Siri's voice gets deep learning (2016) (0)
- Impact of Signal Processing and of Our Work (2010) (0)
- Acoustical pre-processing for robust spoken language systems (1990) (0)
- SigView: Video Tutorials in Emerging Signal Processing Topics [President's Message] (2015) (0)
- Removal of Typed Keystrokes from Speech Signals (2007) (0)
- Author ' s personal copy Using continuous features in the maximum entropy model q (2009) (0)
- Signal Processing: The Science Behind Our Digital Life [President's Message] (2015) (0)
- Conditional Maximum Likelihood Estimation Using Rational Function Growth Transform (2004) (0)
- Speech research (panel): near and not-so-near results and what they might mean for IUI (1998) (0)
- A DISCRIMINATIVETRAININGFRAMEWORK USINGN-BESTSPEECHRECOGNITION TRANSCRIPTIONSAND SCORESFOR SPOKENUTTERANCE CLASSIFICATION (2007) (0)
- We Need Your Help to Take the Society to New Heights [President's Message] (2016) (0)
- Conditional ML Estimation Using Rational Function Growth Transform (2004) (0)
- SigPort: A Paper Repository for Signal Processing [President's Message] (2015) (0)
- Towards Microphone-Independent Spoken Language Systems (1990) (0)
- The Only Constant Is Change [President's Message] (2014) (0)
- Robust Speech Recognition Using a Cepstral Minimum-Mean-Square-Error-Motivated (2008) (0)
- TRANSACTIONS ASSOCIATE EDITORS (2008) (0)
- The IEEE Gives Our Society the "Thumbs Up" [President's Message] (2015) (0)
- Computers, Robotics, and the Human Brain (2008) (0)
- GlobalSIP and ChinaSIP: New Conferences Developed by the IEEE Signal Processing Society [President's Message] (2014) (0)
- Impacto Global da Signal Processing Magazine (2011) (0)
- Endpoint Detection For Speech Under Noisy Environments (2014) (0)
- Report on the NSF-sponsored Human Language Technology Workshop on Industrial Centers (2007) (0)
- Novel Acoustic Modeling with Structured Hidden Dynamics for Speech Coarticulation and Reduction (2004) (0)
- Where Does Your Conference Registration Fee Go? [President's Message] (2014) (0)
This paper list is powered by the following services: