Alan W. Black
#93,999
Most Influential Person Now
British computer scientist
Alan W. Black's AcademicInfluence.com Rankings
Alan W. Blackcomputer-science Degrees
Computer Science
#3798
World Rank
#3993
Historical Rank
Computational Linguistics
#467
World Rank
#475
Historical Rank
Database
#1837
World Rank
#1926
Historical Rank

Download Badge
Computer Science
Why Is Alan W. Black Influential?
(Suggest an Edit or Addition)According to Wikipedia, Alan W Black is a Scottish computer scientist, known for his research on speech synthesis. He is a professor in the Language Technologies Institute at Carnegie Mellon University in Pittsburgh, Pennsylvania.
Alan W. Black's Published Works
Number of citations in a given year to any of this author's works
Total number of citations to an author for the works they published in a given year. This highlights publication of the most important work(s) by the author
Published Works
- Statistical Parametric Speech Synthesis (2007) (1485)
- Unit selection in a concatenative speech synthesis system using a large speech database (1996) (1437)
- Voice Conversion Based on Maximum-Likelihood Estimation of Spectral Parameter Trajectory (2007) (1005)
- The CMU Arctic speech databases (2004) (646)
- Finding Function in Form: Compositional Character Models for Open Vocabulary Word Representation (2015) (616)
- The HMM-based speech synthesis system (HTS) version 2.0 (2007) (613)
- Festival Speech Synthesis System (1998) (507)
- Pocketsphinx: A Free, Real-Time Continuous Speech Recognition System for Hand-Held Devices (2006) (455)
- The architecture of the Festival speech synthesis system (1998) (412)
- Two/Too Simple Adaptations of Word2Vec for Syntax Problems (2015) (369)
- Normalization of non-standard words (2001) (361)
- Automatically clustering similar units for unit selection in speech synthesis (1997) (350)
- The Dialog State Tracking Challenge (2013) (340)
- Style Transfer Through Back-Translation (2018) (316)
- Issues in building general letter to sound rules (1998) (292)
- Assigning phrase breaks from part-of-speech sequences (1997) (289)
- Let's go public! taking a spoken dialog system to the real world (2005) (281)
- Spectral Mapping Using Artificial Neural Networks for Voice Conversion (2010) (262)
- The Second Conversational Intelligence Challenge (ConvAI2) (2019) (250)
- Statistical mapping between articulatory movements and acoustic spectrum using a Gaussian mixture model (2008) (249)
- Voice conversion using Artificial Neural Networks (2009) (233)
- Measuring Bias in Contextualized Word Representations (2019) (233)
- The blizzard challenge - 2005: evaluating corpus-based speech synthesis on common datasets (2005) (223)
- Black is to Criminal as Caucasian is to Police: Detecting and Removing Multiclass Bias in Word Embeddings (2019) (200)
- Optimising selection of units from speech databases for concatenative synthesis (1995) (196)
- A Dataset for Document Grounded Conversations (2018) (175)
- Letter to sound rules for accented lexicon compression (1998) (167)
- Flite: a small fast run-time synthesis engine (2001) (158)
- Prosody and the Selection of Source Units for Concatenative Synthesis (1997) (158)
- Doing research on a deployed spoken dialogue system: one year of let's go! experience (2006) (151)
- Spectral conversion based on maximum likelihood estimation considering global variance of converted parameter (2005) (145)
- Text-Independent Voice Conversion Based on Unit Selection (2006) (141)
- CLUSTERGEN: a statistical parametric synthesizer using trajectory modeling (2006) (137)
- Not All Contexts Are Created Equal: Better Word Representations with Variable Attention (2015) (132)
- Unit size in unit selection speech synthesis (2003) (128)
- Limited domain synthesis (2000) (125)
- CHATR: a generic speech synthesis system (1994) (108)
- The Zero Resource Speech Challenge 2019: TTS without T (2019) (101)
- AN HMM-BASED SPEECH SYNTHESIS SYSTEM APPLIED TO ENGLISH (2003) (99)
- Politeness Transfer: A Tag and Generate Approach (2020) (97)
- Generating F/sub 0/ contours from ToBI labels using linear regression (1996) (90)
- Character-based Neural Machine Translation (2015) (90)
- Strategy and Policy Learning for Non-Task-Oriented Conversational Systems (2016) (89)
- Acoustic-to-articulatory inversion mapping with Gaussian mixture model (2004) (87)
- Sub-Phonetic Modeling For Capturing Pronunciation Variations For Conversational Speech Synthesis (2006) (81)
- LET's GO: improving spoken dialog systems for the elderly and non-natives (2003) (79)
- Universal Phone Recognition with a Multilingual Allophone System (2020) (79)
- The IIIT-H Indic Speech Databases (2012) (78)
- Perfect synthesis for all of the people all of the time (2002) (78)
- Microblogs as Parallel Corpora (2013) (77)
- A unit selection approach to F0 modeling and its application to emphasis (2003) (76)
- Synthesizer voice quality of new languages calibrated with mean mel cepstral distortion (2008) (76)
- A Survey of Code-switched Speech and Language Processing (2019) (75)
- Using decision trees within the tilt intonation model to predict F0 contours (1999) (74)
- Mapping from articulatory movements to vocal tract spectrum with Gaussian mixture model for articulatory speech synthesis (2004) (74)
- Socially-Aware Virtual Agents: Automatically Assessing Dyadic Rapport from Temporal Patterns of Behavior (2016) (74)
- Sequence-Based Multi-Lingual Low Resource Speech Recognition (2018) (73)
- Knowledge of language origin improves pronunciation accuracy of proper names (2001) (72)
- Unit selection and emotional speech (2003) (72)
- Spoken Dialog Challenge 2010: Comparison of Live and Control Test Results (2011) (70)
- Heterogeneous relation graphs as a formalism for representing linguistic information (2001) (69)
- Generating F0 contours from toBI labels using linear regression (1996) (66)
- Recent development of the HMM-based speech synthesis system (HTS) (2009) (65)
- Task and domain specific modelling in the Carnegie Mellon communicator system (2000) (64)
- Multilingual text-to-speech synthesis (2004) (63)
- Polyglot Neural Language Models: A Case Study in Cross-Lingual Phonetic Representation Learning (2016) (63)
- Postfilters to Modify the Modulation Spectrum for Statistical Parametric Speech Synthesis (2016) (62)
- SPICE: web-based tools for rapid language adaptation in speech processing systems (2007) (62)
- The Dialog State Tracking Challenge Series (2014) (60)
- Optimal data selection for unit selection synthesis (2001) (58)
- Is voice transformation a threat to speaker identification? (2008) (56)
- SABLE: a standard for TTS markup (1998) (56)
- CMU Wilderness Multilingual Speech Dataset (2019) (56)
- Text processing for text-to-speech systems in Indian languages (2007) (55)
- Paraphrasing 4 Microblog Normalization (2013) (55)
- Evaluating and correcting phoneme segmentation for unit selection synthesis (2003) (54)
- Speaker de-identification via voice transformation (2009) (53)
- Automatic Keyword Extraction on Twitter (2015) (52)
- Speech synthesis by phonological structure matching (1999) (52)
- Improving the understandability of speech synthesis by modeling speech in noise (2005) (51)
- Speechalator: two-way speech-to-speech translation on a consumer PDA (2003) (48)
- Question Answering for Privacy Policies: Combining Computational and Legal Perspectives (2019) (48)
- Learning Conversational Systems that Interleave Task and Non-Task Content (2017) (48)
- Should You Fine-Tune BERT for Automated Essay Scoring? (2020) (47)
- Automatic building of synthetic voices from large multi-paragraph speech databases (2007) (47)
- Identifying speakers in children's stories for speech synthesis (2003) (47)
- Generating f0 contours for speech synthesis using the tilt intonation theory. (1997) (46)
- Boostrapping phonetic lexicons for new languages (2004) (45)
- Segmentation of Monologues in Audio Books for Building Synthetic Voices (2011) (44)
- Voice convergin: Speaker de-identification by voice transformation (2009) (42)
- Exploring Controllable Text Generation Techniques (2020) (42)
- Spoken Dialog Challenge 2010 (2010) (42)
- A Wizard-of-Oz Study on A Non-Task-Oriented Dialog Systems That Reacts to User Engagement (2016) (42)
- 1 Experiments with Unit Selection Speech Databases for Indian Languages (2003) (40)
- On the use of automatically generated discourse-level information in a concept-to-speech synthesis system (1998) (39)
- Chatbot Evaluation and Database Expansion via Crowdsourcing (2016) (38)
- Assigning intonation elements and prosodic phrasing for English speech synthesis from high level linguistic input (1994) (37)
- The Blizzard Challenge 2006 (2006) (36)
- A Computational Framework for Lexical Description (1987) (36)
- Modulation spectrum-constrained trajectory training algorithm for GMM-based Voice Conversion (2015) (35)
- Articulatory features for expressive speech synthesis (2012) (35)
- Linguistic Unit Discovery from Multi-Modal Inputs in Unwritten Languages: Summary of the “Speaking Rosetta” JSALT 2017 Workshop (2018) (35)
- Thai automatic speech recognition (2005) (35)
- Towards a universal speech interface (2000) (34)
- Topological Sort for Sentence Ordering (2020) (34)
- Learning Pronunciation Dictionaries: Language Complexity and Word Selection Strategies (2006) (33)
- Random forests for statistical speech synthesis (2015) (33)
- Synthesizing conversational intonation from a linguistically rich input (1994) (33)
- A Dictionary and Morphological Analyser for English (1986) (32)
- Code-Mixed Question Answering Challenge: Crowd-sourcing Data and Techniques (2018) (32)
- Automatic Recognition of Conversational Strategies in the Service of a Socially-Aware Dialog System (2016) (32)
- Intent transfer in speech-to-speech machine translation (2012) (31)
- Optimizing segment label boundaries for statistical speech synthesis (2009) (31)
- Statistically trained orthographic to sound models for Thai (2000) (31)
- “Love ya, jerkface”: Using Sparse Log-Linear Models to Build Positive and Impolite Relationships with Teens (2012) (30)
- Arabic in my hand: small-footprint synthesis of egyptian arabic (2003) (30)
- Finite State Machines from Feature Grammars (1989) (30)
- The Blizzard Challenge 2013 - Indian Language Tasks (2013) (30)
- ONLINE SUPERVISED LEARNING OF NON-UNDERSTANDING RECOVERY POLICIES (2006) (30)
- Experiments with Cross-lingual Systems for Synthesis of Code-Mixed Text (2016) (30)
- Equity Beyond Bias in Language Technologies for Education (2019) (29)
- Predicting the Intonation of Discourse Segments from Examples in Dialogue Speech (1997) (29)
- Data-driven phrasing for speech synthesis in low-resource languages (2012) (29)
- Unit selection voice for Amharic using Festvox (2004) (29)
- Diphone collection and synthesis (2000) (29)
- Formalisms for Morphographemic Description (1987) (29)
- Using articulatory position data in voice transformation (2007) (28)
- Exploring Phoneme-Level Speech Representations for End-to-End Speech Translation (2019) (28)
- Automatic discovery of a phonetic inventory for unwritten languages for statistical speech synthesis (2014) (28)
- Crowdsourcing High-Quality Parallel Data Extraction from Twitter (2014) (28)
- Storyboarding of Recipes: Grounded Contextual Generation (2019) (27)
- Speech Synthesis of Code-Mixed Text (2016) (27)
- TONGUES: rapid development of a speech-to-speech translation system (2002) (27)
- Automatic Prediction of Friendship via Multi-model Dyadic Features (2013) (26)
- An Empirical Study of Self-Disclosure in Spoken Dialogue Systems (2018) (26)
- Three methods of intonation modeling (1998) (26)
- Unit selection without a phoneme set (2002) (25)
- The CMU TransTac 2007 Eyes-free and Hands-free Two-way Speech-to-Speech Translation System (2007) (25)
- Speechalator: Two-Way Speech-to-Speech Translation in Your Hand (2003) (25)
- WebShodh: A Code Mixed Factoid Question Answering System for Web (2017) (25)
- Challenges with Rapid Adaptation of Speech Translation Systems to New Language Pairs (2006) (25)
- Creating Multi-Modal, User-Centric Records of Meetings with the Carnegie Mellon Meeting Recorder Architecture (2004) (24)
- A Grammar Based Approach to Style Specific Phrase Prediction (2011) (24)
- Global syllable set for building speech synthesis in Indian languages (2008) (24)
- ESPnet-SLU: Advancing Spoken Language Understanding Through ESPnet (2021) (23)
- A family-of-models approach to HMM-based segmentation for unit selection speech synthesis (2004) (23)
- “My Way of Telling a Story”: Persona based Grounded Story Generation (2019) (23)
- A Thai Speech Translation System for Medical Dialogs (2004) (23)
- Boosting Dialog Response Generation (2019) (22)
- Towards Zero-shot Learning for Automatic Phonemic Transcription (2020) (22)
- An annotation scheme for concept-to-speech synthesis. (1999) (22)
- Grounding ‘Grounding’ in NLP (2021) (22)
- Phone Features Improve Speech Translation (2020) (22)
- ClarQ: A large-scale and diverse dataset for Clarification Question Generation (2020) (22)
- Using articulatory features and inferred phonological segments in zero resource speech processing (2015) (21)
- Festvox : Tools for Creation and Analyses of Large Speech Corpora (2010) (21)
- Non-standard word and homograph resolution for asian language text analysis (2000) (21)
- Pronunciation modeling for dialectal arabic speech recognition (2009) (21)
- NoiseQA: Challenge Set Evaluation for User-Centric Question Answering (2021) (21)
- Bootstrapping Text-to-Speech for speech processing in languages without an orthography (2013) (21)
- Quantifying Social Biases in Contextual Word Representations (2019) (21)
- The Spoken Dialogue Challenge (2009) (21)
- Dialog State Tracking Challenge Handbook (2012) (21)
- Building an ASR System for a Low-resource Language Through the Adaptation of a High-resource Language ASR System: Preliminary Results (2017) (20)
- Optimizing components for handheld two-way speech translation for an English-iraqi Arabic system (2006) (20)
- Normalization of Non-Standard Words: WS '99 Final Report (1999) (20)
- Entropy-based Pruning for Phrase-based Machine Translation (2012) (20)
- Flexible Speech Translation Systems (2006) (20)
- Foreign accents in synthetic speech: development and evaluation (2005) (20)
- Speaker Clustering for Multilingual Synthesis (2006) (20)
- Focused Attention Improves Document-Grounded Generation (2021) (20)
- A Statistical Phrase/Accent Model for Intonation Modeling (2011) (19)
- Prediction of pronunciation variations for speech synthesis: a data-driven approach (2005) (19)
- Let's go lab: a platform for evaluation of spoken dialog systems with real world users (2008) (19)
- Impact of durational outlier removal from unit selection catalogs (2004) (19)
- Practical Evaluation of Human and Synthesized Speech for Virtual Human Dialogue Systems (2012) (19)
- Using speech in noise to improve understandability for elderly listeners (2005) (19)
- Modulation spectrum-based post-filter for GMM-based Voice Conversion (2014) (18)
- Speech synthesis for educational technology (2007) (18)
- Analysis of Unknown Words through Morphological Decomposition (1991) (18)
- Significance of early tagged contextual graphemes in grapheme based speech synthesis and recognition systems (2008) (18)
- The Festvox Indic Frontend for Grapheme-to-Phoneme Conversion (2016) (18)
- On Building Mixed Lingual Speech Synthesis Systems (2017) (18)
- Improving speech synthesis of machine translation output (2010) (18)
- Emotion Identification for Evaluation of Synthesized Emotional Speech (2012) (17)
- Learning speaker-specific phrase breaks for text-to-speech systems (2010) (17)
- A research platform for multi-agent dialogue dynamics (2004) (17)
- Field Testing the Tongues Speech-to-Speech Machine Translation System (2002) (17)
- A situation theoretic approach to computational semantics (1993) (17)
- Speech Synthesis for Mixed-Language Navigation Instructions (2017) (17)
- Visualizing Topical Quotations Over Time to Understand News Discourse (2010) (16)
- Case Study: Deontological Ethics in NLP (2020) (16)
- Data Augmentation for Neural Online Chats Response Selection (2018) (16)
- Language Informed Modeling of Code-Switched Text (2018) (16)
- Image 2 speech : Automatically generating audio descriptions of images (2017) (16)
- Towards building an attentive artificial listener: on the perception of attentiveness in audio-visual feedback tokens (2016) (16)
- The Blizzard Challenge 2014 (2014) (15)
- DialCrowd: A toolkit for easy dialog system assessment (2018) (15)
- WriterForcing: Generating more interesting story endings (2019) (15)
- Towards Improving the Naturalness of Social Conversations with Dialogue Systems (2010) (15)
- CMU Blizzard 2007: A Hybrid Acoustic Unit Selection System from Statistically Predicted Parameters (2007) (15)
- Challenges in Speech Synthesis (2010) (15)
- What Code-Switching Strategies are Effective in Dialog Systems? (2020) (15)
- Creating a database of speech in noise for unit selection synthesis (2004) (15)
- Multilingual Speech Recognition with Corpus Relatedness Sampling (2019) (15)
- Building voiceXML-based applications (2002) (14)
- Text to speech in new languages without a standardized orthography (2013) (14)
- Prominence prediction for supersentential prosodic modeling based on a new database (2004) (14)
- An Incremental Turn-Taking Model with Active System Barge-in for Spoken Dialog Systems (2015) (14)
- Multilingual Speech Synthesis (2006) (14)
- The First Conversational Intelligence Challenge (2018) (14)
- Evaluation and collection of proper name pronunciations online (2002) (14)
- Utterance Selection Techniques for TTS Systems Using Found Speech (2016) (14)
- Parameter generation algorithm considering Modulation Spectrum for HMM-based speech synthesis (2015) (14)
- A Dynamic Strategy Coach for Effective Negotiation (2019) (13)
- Incremental Adaptation of Speech-to-Speech Translation (2009) (13)
- Named entity translation using anchor texts (2011) (13)
- Modeling Pause-Duration for Style-Specific Speech Synthesis (2012) (13)
- Incorporating durational modification in voice transformation (2008) (13)
- Generating F 0 contours from ToBI labels using linear regression (2021) (13)
- Recurrent Neural Network Postfilters for Statistical Parametric Speech Synthesis (2016) (12)
- Augmenting Non-Collaborative Dialog Systems with Explicit Semantic and Strategic Dialog History (2019) (12)
- Domain Robust Feature Extraction for Rapid Low Resource ASR Development (2018) (12)
- Intelligibility of machine translation output in speech synthesis (2006) (12)
- Audio signals in speech interfaces (2000) (12)
- A study on speech over the telephone and aging (2001) (12)
- Optimal Utterance Selection for Unit Selection Speech Synthesis Databases (2003) (11)
- Post-Filters to Modify the Modulation Spectrum for Statistical Parametric Speech Synthesis (2016) (11)
- Adaptation techniques for speech synthesis in under-resourced languages (2010) (11)
- Optimizations and fitting procedures for the liljencrants-fant model for statistical parametric speech synthesis (2013) (11)
- Rapid development of speech-to-speech translation systems (2002) (11)
- The ARIEL-CMU situation frame detection pipeline for LoReHLT16: a model translation approach (2018) (11)
- Speech Technology for Unwritten Languages (2020) (11)
- Modelling a Noisy-channel for Voice Conversion Using Articulatory Features (2012) (11)
- KLATTSTAT: knowledge-based parametric speech synthesis (2010) (11)
- An Investigation of Convolution Attention Based Models for Multilingual Speech Synthesis of Indian Languages (2018) (11)
- DialoGraph: Incorporating Interpretable Strategy-Graph Networks into Negotiation Dialogues (2021) (10)
- Mining Parallel Corpora from Sina Weibo and Twitter (2016) (10)
- Multimodal Polynomial Fusion for Detecting Driver Distraction (2018) (10)
- Modified post-filter to recover modulation spectrum for HMM-based speech synthesis (2014) (10)
- A Corpus for Large-Scale Phonetic Typology (2020) (10)
- Open-Source Consumer-Grade Indic Text To Speech (2016) (10)
- Utterance classification in speech-to-speech translation for zero-resource languages in the hospital administration domain (2015) (10)
- Automatic Detection of Code-switching Style from Acoustics (2018) (10)
- Ordinal Triplet Loss: Investigating Sleepiness Detection from Speech (2019) (10)
- Acoustics Based Intent Recognition Using Discovered Phonetic Units for Low Resource Languages (2020) (9)
- Hierarchical Phone Recognition with Compositional Phonetics (2021) (9)
- Evaluating a dialog language generation system: comparing the mountain system to other NLG approaches (2010) (9)
- Handling large audio files in audio books for building synthetic voices (2010) (9)
- Text-dependent pathological voice detection (2012) (9)
- Voice building from insufficient data - classroom experiences with web-based language development tools (2007) (9)
- AlloVera: A Multilingual Allophone Database (2020) (9)
- A Review of Personality in Voice-Based Man Machine Interaction (2011) (9)
- Building a better Indian English voice using "more data" (2007) (8)
- Deriving Phonetic Transcriptions and Discovering Word Segmentations for Speech-to-Speech Translation in Low-Resource Settings (2016) (8)
- Helping Users Understand Privacy Notices with Automated Query Answering Functionality : An Exploratory Study (2018) (8)
- Proceedings of the 7th European Workshop on Natural Language Generation (1999) (8)
- Linguistic Versus Latent Relations for Modeling Coherent Flow in Paragraphs (2019) (8)
- Accent Group modeling for improved prosody in statistical parameteric speech synthesis (2013) (8)
- Multimodal HALEF: An Open-Source Modular Web-Based Multimodal Dialog Framework (2016) (8)
- Semi-Supervised Learning of Acoustic Driven Prosodic Phrase Breaks for Text-to-Speech Systems (2010) (8)
- Minimum error rate training for phrasing in speech synthesis (2013) (8)
- Improving speech synthesis for noisy environments (2010) (8)
- M OUNTAIN : A Translation-based Approach to Natural Language Generation for Dialog Systems (2009) (8)
- Cross-speaker articulatory position data for phonetic feature prediction (2005) (8)
- Principled Frameworks for Evaluating Ethics in NLP Systems (2019) (7)
- Towards Building an Attentive Artificial Listener: On the Perception of Attentiveness in Feedback Utterances (2016) (7)
- CTC Alignments Improve Autoregressive Translation (2022) (7)
- Analyzing Wikipedia Deletion Debates with a Group Decision-Making Forecast Model (2019) (7)
- On data driven parametric backchannel synthesis for expressing attentiveness in conversational agents (2016) (7)
- A Deep Learning Approach to Data-driven Parameterizations for Statistical Parametric Speech Synthesis (2014) (7)
- Building sleek synthesizers for multi-lingual screen reader (2008) (7)
- Unsupervised Self-Training for Sentiment Analysis of Code-Switched Data (2021) (7)
- Tackling Code-Switched NER: Participation of CMU (2018) (7)
- Comparison of algorithms for predicting accent placement in English speech synthesis. (1995) (7)
- User Engagement Study with Virtual Agents Under Different Cultural Contexts (2016) (7)
- Detecting Entailment in Code-Mixed Hindi-English Conversations (2020) (6)
- Analysis and modeling of "focus" in context (2013) (6)
- Discriminative Phrase-based Lexicalized Reordering Models using Weighted Reordering Graphs (2011) (6)
- Variational Attention Using Articulatory Priors for Generating Code Mixed Speech Using Monolingual Corpora (2019) (6)
- End-to-End Speech Summarization Using Restricted Self-Attention (2022) (6)
- Style Transfer Through Multilingual and Feedback-Based Back-Translation (2018) (6)
- Formality Style Transfer for Noisy, User-generated Conversations: Extracting Labeled, Parallel Data from Unlabeled Corpora (2019) (6)
- Switch Point biased Self-Training: Re-purposing Pretrained Models for Code-Switching (2021) (6)
- Using acoustic models to choose pronunciation variations for synthetic voices (2003) (6)
- Mere account mein kitna balance hai? - On building voice enabled Banking Services for Multilingual Communities (2020) (6)
- ASR2K: Speech Recognition for Around 2000 Languages without Audio (2022) (6)
- Improving ASR by integrating lecture audio and slides (2013) (6)
- A Dataset of Topic-Oriented Human-to-Chatbot Dialogues (2018) (6)
- Towards Minimal Supervision BERT-based Grammar Error Correction (2020) (6)
- Multilingual Phonetic Dataset for Low Resource Speech Recognition (2021) (6)
- Introduction to the Issue on Statistical Parametric Speech Synthesis (2014) (6)
- Generating time-constrained audio presentations of structured information (2006) (5)
- Two-Pass Low Latency End-to-End Spoken Language Understanding (2022) (5)
- Rethinking End-to-End Evaluation of Decomposable Tasks: A Case Study on Spoken Language Understanding (2021) (5)
- The ARIEL-CMU Systems for LoReHLT18 (2019) (5)
- NineOneOne: Recognizing and Classifying Speech for Handling Minority Language Emergency Calls (2008) (5)
- Top-Down Structurally-Constrained Neural Response Generation with Lexicalized Probabilistic Context-Free Grammar (2019) (5)
- Improving Relative-Entropy Pruning using Statistical Significance (2012) (5)
- Text-To-Speech for Languages without an Orthography (2012) (5)
- Multimodal, Multilingual Grapheme-to-Phoneme Conversion for Low-Resource Languages (2019) (5)
- Task-Specific Pre-Training and Cross Lingual Transfer for Code-Switched Data (2021) (5)
- Building African Voices (2022) (5)
- Understanding Linguistic Accommodation in Code-Switched Human-Machine Dialogues (2020) (5)
- Zero-shot Learning for Speech Recognition with Universal Phonetic Model (2018) (5)
- Blizzard 2008: Experiments on Unit Size for Unit Selection Speech Synthesis (2008) (4)
- Submission from CMU for Blizzard Challenge 2019 (2018) (4)
- Speech Parameter Generation Algorithm Considering Modulation Spectrum for Statistical Parametric Speech Synthesis (2015) (4)
- Recovery of acronyms, out-of-lattice words and pronunciations from parallel multilingual speech (2012) (4)
- Parallel combination of multilingual speech streams for improved ASR (2012) (4)
- Learning to Order Graph Elements with Application to Multilingual Surface Realization (2019) (4)
- Segment Level Voice Conversion with Recurrent Neural Networks (2017) (4)
- SANTLR: Speech Annotation Toolkit for Low Resource Languages (2019) (4)
- LTIatCMU at SemEval-2020 Task 11: Incorporating Multi-Level Features for Multi-Granular Propaganda Span Identification (2020) (4)
- The Blizzard Challenge 2006 CMU Entry introducing hybrid trajectory-selection synthesis (2006) (4)
- Multimodal Speech Summarization Through Semantic Concept Learning (2021) (4)
- Ugloss: a Framework for Improving Spoken Language Generation Understandability (2007) (4)
- Parallel combination of speech streams for improved ASR (2012) (4)
- Disentangling Speech and Non-Speech Components for Building Robust Acoustic Models from Found Data (2019) (4)
- Phoneme Level Language Models for Sequence Based Low Resource ASR (2019) (4)
- Elderly perception of speech from a computer (2002) (3)
- Using a Computational Situation Theoretic Language to investigate Contemporary Semantic Theories (1993) (3)
- A Resource for Computational Experiments on Mapudungun (2019) (3)
- Speech Summarization using Restricted Self-Attention (2021) (3)
- Universal grapheme-based speech synthesis (2015) (3)
- Stance Classification, Outcome Prediction, and Impact Assessment: NLP Tasks for Studying Group Decision-Making (2019) (3)
- Dataset Analysis and Augmentation for Emoji-Sensitive Irony Detection (2019) (3)
- Learning Disentangled Representation in Latent Stochastic Models: A Case Study with Image Captioning (2019) (3)
- Embedding DRT in a Situation Theoretic Framework (1992) (3)
- CodemixedNLP: An Extensible and Open NLP Toolkit for Code-Mixing (2021) (3)
- Style Variation as a Vantage Point for Code-Switching (2020) (3)
- Intent Classification Using Pre-Trained Embeddings For Low Resource Languages (2021) (3)
- Automatically Identifying Language Family from Acoustic Examples in Low Resource Scenarios (2020) (3)
- Investigating Utterance Level Representations for Detecting Intent from Acoustics (2018) (3)
- Intent Recognition and Unsupervised Slot Identification for Low-Resourced Spoken Dialog Systems (2021) (3)
- Improved punctuation recovery through combination of multiple speech streams (2013) (3)
- Induction and Reference of Entities in a Visual Story (2019) (3)
- Describing Spoken Dialogue Systems Differences (2008) (3)
- Language Technologies for Humanitarian Aid (2006) (3)
- Deep Speech Synthesis from Articulatory Representations (2022) (3)
- Rapid Prototyping of a German TTS System (1998) (3)
- Reading between the Lines: Exploring Infilling in Visual Narratives (2020) (3)
- Using acoustics to improve pronunciation for synthesis of low resource languages (2015) (3)
- Unsupervised Phonetic and Word Level Discovery for Speech to Speech Translation for Unwritten Languages (2019) (3)
- Building Practical Spoken Dialog Systems (2008) (3)
- NAACL-HLT Workshop on Future directions and needs in the Spoken Dialog Community: Tools and Data (SDCTD 2012) (2012) (3)
- Linguistic Markers of Influence in Informal Interactions (2017) (3)
- Zero-shot Learning for Grapheme to Phoneme Conversion with Language Ensemble (2022) (2)
- A style capturing approach to F0 transformation in voice conversion (2013) (2)
- The blizzard machine learning challenge 2017 (2017) (2)
- On Advances in Text Generation from Images Beyond Captioning: A Case Study in Self-Rationalization (2022) (2)
- Cross-Lingual Transfer for Speech Processing Using Acoustic Language Similarity (2021) (2)
- CMU Blizzard 2008: Optimally using a large database for unit selection synthesis. (2008) (2)
- Deep Neural Convolutive Matrix Factorization for Articulatory Representation Decomposition (2022) (2)
- CMU GetGoing: An Understandable and Memorable Dialog System for Seniors (2019) (2)
- Measuring unsupervised acoustic clustering through phoneme pair merge-and-split tests (2005) (2)
- Visual Evaluation of Voice Transformation Based on Knowledge of Speaker (2006) (2)
- International Speech Communication Association (isca) Microsoft Research International Speech Communication Association (isca) Special Interest Group on Discourse and Dialogue (sigdial) Dialogs on Dialogs Student Reading Group Organizing Committee: Advisory Committee: Workshop Program 10:30 -12:00 M (2005) (1)
- Using the Tilt Intonation Model: A Data-Driven Approach (2001) (1)
- This Table is Different: A WordNet-Based Approach to Identifying References to Document Entities (2016) (1)
- Bag-of-Acoustic-Words for Mental Health Assessment: A Deep Autoencoding Approach (2019) (1)
- Introduction to NIPS 2017 Competition Track (2018) (1)
- Real Users and Real Dialog Systems: The Hard Challenge for SDS (2012) (1)
- Doing Research in a Deployed Spoken Dialog System: One Year of Let’s Go! Public Experience (2017) (1)
- Dialog State Tracking Challenge: Information For Prospective Participants (2012) (1)
- The CMU entry to blizzard machine learning challenge (2017) (1)
- Comparison of Interactive Knowledge Base Spelling Correction Models for Low-Resource Languages (2020) (1)
- Computational morphology of English (1988) (1)
- Distributed representation-based spoken word sense induction (2015) (1)
- Modulation spectrum-constrained trajectory training algorithm for HMM-based speech synthesis (2015) (1)
- Initiations and Interruptions in a Spoken Dialog System (2016) (1)
- Evaluating Gender Bias Transfer from Film Data (2022) (1)
- Speaker-Independent Acoustic-to-Articulatory Speech Inversion (2023) (1)
- Unconventional Approaches to Gathering and Sharing Resources for Spoken Dialog Research (2017) (1)
- Data-driven intonational phonology (2013) (1)
- Mixed-mode Multilinguality in TTS : The Case of Canadian French (2006) (1)
- Nonlinear ISA with Auxiliary Variables for Learning Speech Representations (2020) (1)
- Submission from CMU towards 1 st MultiTarget Speaker Detection and Identification Challenge (2018) (1)
- Improving speech systems built from very little data (2008) (1)
- Challenges in Automated Question Answering for Privacy Policies (2019) (1)
- Phone Distribution Estimation for Low Resource Languages (2021) (1)
- Future Directions in Spoken Dialog Systems: A Community of Possibilities (2012) (0)
- Phone Inventories and Recognition for Every Language (2022) (0)
- Intent classification using pre-trained language agnostic embeddings for low resource languages (2021) (0)
- Multimodal Detection of Driver Distraction FINAL RESEARCH REPORT (2018) (0)
- Applause : A Learning Tool for Low-Resource Languages (2014) (0)
- Proc. 2009 Asia-Pacific Signal and Information Processing Association (APSIPA) (2009) (0)
- Dissecting the components and factors of Neural Text Generation (2020) (0)
- Some Diierent Approaches to Drt (1997) (0)
- Articulatory Representation Learning Via Joint Factor Analysis and Neural Matrix Factorization (2022) (0)
- Using Speaker ID to Discover Repeat Callers of a Spoken Dialog System (2011) (0)
- Proceedings of The 8th International Global WordNet Conference (2016) (0)
- Towards Using Heterogeneous Relation Graphs for End-to-End TTS (2021) (0)
- 2 Articulatory Features 2 . 1 Types of Articulatory Representations (2011) (0)
- Generating Mandarin and Cantonese F0 Contours with Decision Trees and BLSTMs (2018) (0)
- Speech Translation for Triage of Emergency Phonecalls in Minority Languages (2008) (0)
- Towards Language Modelling in the Speech Domain Using Sub-word Linguistic Units (2021) (0)
- Text Normalization for Speech Systems for All Languages (2022) (0)
- The 5th International Conference on Spoken Language Processing, Incorporating The 7th Australian International Speech Science and Technology Conference, Sydney Convention Centre, Sydney, Australia, 30th November - 4th December 1998 (1998) (0)
- Formal Properties of Feature Grammars (2007) (0)
- RE-WOCHAT : Workshop on Collecting and Generating Resources for Chatbots and Conversational Agents-Development and Evaluation Workshop Programme ( May 28 (2016) (0)
- Quality Improvement Approaches Based on the Modulation Spectrum to Statistical Parametric Speech Synthesis (2015) (0)
- The Blizzard Challenge: evaluating corpus-based speech synthesis techniques (2007) (0)
- Integrating Verbal and Nonvebval Input into a Dynamic Response Spoken Dialogue System (2017) (0)
- AUGMENTING NON-COLLABORATIVE DIALOG SYS- (2019) (0)
- Detecting Driver Distraction (2018) (0)
- Chapter 2 Challenges in Speech Synthesis (2010) (0)
- Towards Automatic Route Description Unification in Spoken Dialog Systems (2021) (0)
- DUALGRAM: An Efficient Method for Representing Limited-Domain Language Models (1992) (0)
- Towards Improving Intelligibility of Black-Box Speech Synthesizers in Noise (2018) (0)
- Optionality in evaluating prosody (2004) (0)
- Dialogue Context Encoder Structure Encoder Graph Encoding ( GAT ) Structure Encoder u 1 u 2 u 3 u 4 Graph Pooling Graph Pooling Graph Encoding ( GAT ) GCN-ASAPGCN-ASAP Utterance Embedding Utterance Generation (2021) (0)
- Incorporating Dialectal Features in Synthesized Speech using Voice Conversion Techniques (2018) (0)
- Entity Skeletons for Visual Storytelling (2020) (0)
- The Real Challenge 2014: Progress and Prospects (2015) (0)
- C L ] 2 A pr 2 01 9 A Survey of Code-switched Speech and Language Processing (2019) (0)
- A Fast and Accurate Pitch Estimation Algorithm Based on the Pseudo Wigner-Ville Distribution (2022) (0)
- Token-level Sequence Labeling for Spoken Language Understanding using Compositional End-to-End Models (2022) (0)
- Understandable production of massive synthesis (2007) (0)
- Multimodal Detection of Driver Distraction (2017) (0)
This paper list is powered by the following services:
Other Resources About Alan W. Black
What Schools Are Affiliated With Alan W. Black?
Alan W. Black is affiliated with the following schools: