Alan W. Black

Q: What Schools Are Affiliated With Alan W. Black

Alan W. Black is affiliated with the following schools: Carnegie Mellon University, University of Edinburgh, University of Washington

Alan W. Black's AcademicInfluence.com Rankings

Alan W. Black

Computer Science

#3798

World Rank

#3993

Historical Rank

Computational Linguistics

#467

World Rank

#475

Historical Rank

Database

#1837

World Rank

#1926

Historical Rank

computer-science Degrees

Download Badge

Computer Science

Why Is Alan W. Black Influential?

(Suggest an Edit or Addition)

According to Wikipedia, Alan W Black is a Scottish computer scientist, known for his research on speech synthesis. He is a professor in the Language Technologies Institute at Carnegie Mellon University in Pittsburgh, Pennsylvania.

(See a Problem?)

Alan W. Black's Published Works

Number of citations in a given year to any of this author's works

Total number of citations to an author for the works they published in a given year. This highlights publication of the most important work(s) by the author

Published Works

Statistical Parametric Speech Synthesis (2007) (1485)
Unit selection in a concatenative speech synthesis system using a large speech database (1996) (1437)
Voice Conversion Based on Maximum-Likelihood Estimation of Spectral Parameter Trajectory (2007) (1005)
The CMU Arctic speech databases (2004) (646)
Finding Function in Form: Compositional Character Models for Open Vocabulary Word Representation (2015) (616)
The HMM-based speech synthesis system (HTS) version 2.0 (2007) (613)
Festival Speech Synthesis System (1998) (507)
Pocketsphinx: A Free, Real-Time Continuous Speech Recognition System for Hand-Held Devices (2006) (455)
The architecture of the Festival speech synthesis system (1998) (412)
Two/Too Simple Adaptations of Word2Vec for Syntax Problems (2015) (369)
Normalization of non-standard words (2001) (361)
Automatically clustering similar units for unit selection in speech synthesis (1997) (350)
The Dialog State Tracking Challenge (2013) (340)
Style Transfer Through Back-Translation (2018) (316)
Issues in building general letter to sound rules (1998) (292)
Assigning phrase breaks from part-of-speech sequences (1997) (289)
Let's go public! taking a spoken dialog system to the real world (2005) (281)
Spectral Mapping Using Artificial Neural Networks for Voice Conversion (2010) (262)
The Second Conversational Intelligence Challenge (ConvAI2) (2019) (250)
Statistical mapping between articulatory movements and acoustic spectrum using a Gaussian mixture model (2008) (249)
Voice conversion using Artificial Neural Networks (2009) (233)
Measuring Bias in Contextualized Word Representations (2019) (233)
The blizzard challenge - 2005: evaluating corpus-based speech synthesis on common datasets (2005) (223)
Black is to Criminal as Caucasian is to Police: Detecting and Removing Multiclass Bias in Word Embeddings (2019) (200)
Optimising selection of units from speech databases for concatenative synthesis (1995) (196)
A Dataset for Document Grounded Conversations (2018) (175)
Letter to sound rules for accented lexicon compression (1998) (167)
Flite: a small fast run-time synthesis engine (2001) (158)
Prosody and the Selection of Source Units for Concatenative Synthesis (1997) (158)
Doing research on a deployed spoken dialogue system: one year of let's go! experience (2006) (151)
Spectral conversion based on maximum likelihood estimation considering global variance of converted parameter (2005) (145)
Text-Independent Voice Conversion Based on Unit Selection (2006) (141)
CLUSTERGEN: a statistical parametric synthesizer using trajectory modeling (2006) (137)
Not All Contexts Are Created Equal: Better Word Representations with Variable Attention (2015) (132)
Unit size in unit selection speech synthesis (2003) (128)
Limited domain synthesis (2000) (125)
CHATR: a generic speech synthesis system (1994) (108)
The Zero Resource Speech Challenge 2019: TTS without T (2019) (101)
AN HMM-BASED SPEECH SYNTHESIS SYSTEM APPLIED TO ENGLISH (2003) (99)
Politeness Transfer: A Tag and Generate Approach (2020) (97)
Generating F/sub 0/ contours from ToBI labels using linear regression (1996) (90)
Character-based Neural Machine Translation (2015) (90)
Strategy and Policy Learning for Non-Task-Oriented Conversational Systems (2016) (89)
Acoustic-to-articulatory inversion mapping with Gaussian mixture model (2004) (87)
Sub-Phonetic Modeling For Capturing Pronunciation Variations For Conversational Speech Synthesis (2006) (81)
LET's GO: improving spoken dialog systems for the elderly and non-natives (2003) (79)
Universal Phone Recognition with a Multilingual Allophone System (2020) (79)
The IIIT-H Indic Speech Databases (2012) (78)
Perfect synthesis for all of the people all of the time (2002) (78)
Microblogs as Parallel Corpora (2013) (77)
A unit selection approach to F0 modeling and its application to emphasis (2003) (76)
Synthesizer voice quality of new languages calibrated with mean mel cepstral distortion (2008) (76)
A Survey of Code-switched Speech and Language Processing (2019) (75)
Using decision trees within the tilt intonation model to predict F0 contours (1999) (74)
Mapping from articulatory movements to vocal tract spectrum with Gaussian mixture model for articulatory speech synthesis (2004) (74)
Socially-Aware Virtual Agents: Automatically Assessing Dyadic Rapport from Temporal Patterns of Behavior (2016) (74)
Sequence-Based Multi-Lingual Low Resource Speech Recognition (2018) (73)
Knowledge of language origin improves pronunciation accuracy of proper names (2001) (72)
Unit selection and emotional speech (2003) (72)
Spoken Dialog Challenge 2010: Comparison of Live and Control Test Results (2011) (70)
Heterogeneous relation graphs as a formalism for representing linguistic information (2001) (69)
Generating F0 contours from toBI labels using linear regression (1996) (66)
Recent development of the HMM-based speech synthesis system (HTS) (2009) (65)
Task and domain specific modelling in the Carnegie Mellon communicator system (2000) (64)
Multilingual text-to-speech synthesis (2004) (63)
Polyglot Neural Language Models: A Case Study in Cross-Lingual Phonetic Representation Learning (2016) (63)
Postfilters to Modify the Modulation Spectrum for Statistical Parametric Speech Synthesis (2016) (62)
SPICE: web-based tools for rapid language adaptation in speech processing systems (2007) (62)
The Dialog State Tracking Challenge Series (2014) (60)
Optimal data selection for unit selection synthesis (2001) (58)
Is voice transformation a threat to speaker identification? (2008) (56)
SABLE: a standard for TTS markup (1998) (56)
CMU Wilderness Multilingual Speech Dataset (2019) (56)
Text processing for text-to-speech systems in Indian languages (2007) (55)
Paraphrasing 4 Microblog Normalization (2013) (55)
Evaluating and correcting phoneme segmentation for unit selection synthesis (2003) (54)
Speaker de-identification via voice transformation (2009) (53)
Automatic Keyword Extraction on Twitter (2015) (52)
Speech synthesis by phonological structure matching (1999) (52)
Improving the understandability of speech synthesis by modeling speech in noise (2005) (51)
Speechalator: two-way speech-to-speech translation on a consumer PDA (2003) (48)
Question Answering for Privacy Policies: Combining Computational and Legal Perspectives (2019) (48)
Learning Conversational Systems that Interleave Task and Non-Task Content (2017) (48)
Should You Fine-Tune BERT for Automated Essay Scoring? (2020) (47)
Automatic building of synthetic voices from large multi-paragraph speech databases (2007) (47)
Identifying speakers in children's stories for speech synthesis (2003) (47)
Generating f0 contours for speech synthesis using the tilt intonation theory. (1997) (46)
Boostrapping phonetic lexicons for new languages (2004) (45)
Segmentation of Monologues in Audio Books for Building Synthetic Voices (2011) (44)
Voice convergin: Speaker de-identification by voice transformation (2009) (42)
Exploring Controllable Text Generation Techniques (2020) (42)
Spoken Dialog Challenge 2010 (2010) (42)
A Wizard-of-Oz Study on A Non-Task-Oriented Dialog Systems That Reacts to User Engagement (2016) (42)
1 Experiments with Unit Selection Speech Databases for Indian Languages (2003) (40)
On the use of automatically generated discourse-level information in a concept-to-speech synthesis system (1998) (39)
Chatbot Evaluation and Database Expansion via Crowdsourcing (2016) (38)
Assigning intonation elements and prosodic phrasing for English speech synthesis from high level linguistic input (1994) (37)
The Blizzard Challenge 2006 (2006) (36)
A Computational Framework for Lexical Description (1987) (36)
Modulation spectrum-constrained trajectory training algorithm for GMM-based Voice Conversion (2015) (35)
Articulatory features for expressive speech synthesis (2012) (35)
Linguistic Unit Discovery from Multi-Modal Inputs in Unwritten Languages: Summary of the “Speaking Rosetta” JSALT 2017 Workshop (2018) (35)
Thai automatic speech recognition (2005) (35)
Towards a universal speech interface (2000) (34)
Topological Sort for Sentence Ordering (2020) (34)
Learning Pronunciation Dictionaries: Language Complexity and Word Selection Strategies (2006) (33)
Random forests for statistical speech synthesis (2015) (33)
Synthesizing conversational intonation from a linguistically rich input (1994) (33)
A Dictionary and Morphological Analyser for English (1986) (32)
Code-Mixed Question Answering Challenge: Crowd-sourcing Data and Techniques (2018) (32)
Automatic Recognition of Conversational Strategies in the Service of a Socially-Aware Dialog System (2016) (32)
Intent transfer in speech-to-speech machine translation (2012) (31)
Optimizing segment label boundaries for statistical speech synthesis (2009) (31)
Statistically trained orthographic to sound models for Thai (2000) (31)
“Love ya, jerkface”: Using Sparse Log-Linear Models to Build Positive and Impolite Relationships with Teens (2012) (30)
Arabic in my hand: small-footprint synthesis of egyptian arabic (2003) (30)
Finite State Machines from Feature Grammars (1989) (30)
The Blizzard Challenge 2013 - Indian Language Tasks (2013) (30)
ONLINE SUPERVISED LEARNING OF NON-UNDERSTANDING RECOVERY POLICIES (2006) (30)
Experiments with Cross-lingual Systems for Synthesis of Code-Mixed Text (2016) (30)
Equity Beyond Bias in Language Technologies for Education (2019) (29)
Predicting the Intonation of Discourse Segments from Examples in Dialogue Speech (1997) (29)
Data-driven phrasing for speech synthesis in low-resource languages (2012) (29)
Unit selection voice for Amharic using Festvox (2004) (29)
Diphone collection and synthesis (2000) (29)
Formalisms for Morphographemic Description (1987) (29)
Using articulatory position data in voice transformation (2007) (28)
Exploring Phoneme-Level Speech Representations for End-to-End Speech Translation (2019) (28)
Automatic discovery of a phonetic inventory for unwritten languages for statistical speech synthesis (2014) (28)
Crowdsourcing High-Quality Parallel Data Extraction from Twitter (2014) (28)
Storyboarding of Recipes: Grounded Contextual Generation (2019) (27)
Speech Synthesis of Code-Mixed Text (2016) (27)
TONGUES: rapid development of a speech-to-speech translation system (2002) (27)
Automatic Prediction of Friendship via Multi-model Dyadic Features (2013) (26)
An Empirical Study of Self-Disclosure in Spoken Dialogue Systems (2018) (26)
Three methods of intonation modeling (1998) (26)
Unit selection without a phoneme set (2002) (25)
The CMU TransTac 2007 Eyes-free and Hands-free Two-way Speech-to-Speech Translation System (2007) (25)
Speechalator: Two-Way Speech-to-Speech Translation in Your Hand (2003) (25)
WebShodh: A Code Mixed Factoid Question Answering System for Web (2017) (25)
Challenges with Rapid Adaptation of Speech Translation Systems to New Language Pairs (2006) (25)
Creating Multi-Modal, User-Centric Records of Meetings with the Carnegie Mellon Meeting Recorder Architecture (2004) (24)
A Grammar Based Approach to Style Specific Phrase Prediction (2011) (24)
Global syllable set for building speech synthesis in Indian languages (2008) (24)
ESPnet-SLU: Advancing Spoken Language Understanding Through ESPnet (2021) (23)
A family-of-models approach to HMM-based segmentation for unit selection speech synthesis (2004) (23)
“My Way of Telling a Story”: Persona based Grounded Story Generation (2019) (23)
A Thai Speech Translation System for Medical Dialogs (2004) (23)
Boosting Dialog Response Generation (2019) (22)
Towards Zero-shot Learning for Automatic Phonemic Transcription (2020) (22)
An annotation scheme for concept-to-speech synthesis. (1999) (22)
Grounding ‘Grounding’ in NLP (2021) (22)
Phone Features Improve Speech Translation (2020) (22)
ClarQ: A large-scale and diverse dataset for Clarification Question Generation (2020) (22)
Using articulatory features and inferred phonological segments in zero resource speech processing (2015) (21)
Festvox : Tools for Creation and Analyses of Large Speech Corpora (2010) (21)
Non-standard word and homograph resolution for asian language text analysis (2000) (21)
Pronunciation modeling for dialectal arabic speech recognition (2009) (21)
NoiseQA: Challenge Set Evaluation for User-Centric Question Answering (2021) (21)
Bootstrapping Text-to-Speech for speech processing in languages without an orthography (2013) (21)
Quantifying Social Biases in Contextual Word Representations (2019) (21)
The Spoken Dialogue Challenge (2009) (21)
Dialog State Tracking Challenge Handbook (2012) (21)
Building an ASR System for a Low-resource Language Through the Adaptation of a High-resource Language ASR System: Preliminary Results (2017) (20)
Optimizing components for handheld two-way speech translation for an English-iraqi Arabic system (2006) (20)
Normalization of Non-Standard Words: WS '99 Final Report (1999) (20)
Entropy-based Pruning for Phrase-based Machine Translation (2012) (20)
Flexible Speech Translation Systems (2006) (20)
Foreign accents in synthetic speech: development and evaluation (2005) (20)
Speaker Clustering for Multilingual Synthesis (2006) (20)
Focused Attention Improves Document-Grounded Generation (2021) (20)
A Statistical Phrase/Accent Model for Intonation Modeling (2011) (19)
Prediction of pronunciation variations for speech synthesis: a data-driven approach (2005) (19)
Let's go lab: a platform for evaluation of spoken dialog systems with real world users (2008) (19)
Impact of durational outlier removal from unit selection catalogs (2004) (19)
Practical Evaluation of Human and Synthesized Speech for Virtual Human Dialogue Systems (2012) (19)
Using speech in noise to improve understandability for elderly listeners (2005) (19)
Modulation spectrum-based post-filter for GMM-based Voice Conversion (2014) (18)
Speech synthesis for educational technology (2007) (18)
Analysis of Unknown Words through Morphological Decomposition (1991) (18)
Significance of early tagged contextual graphemes in grapheme based speech synthesis and recognition systems (2008) (18)
The Festvox Indic Frontend for Grapheme-to-Phoneme Conversion (2016) (18)
On Building Mixed Lingual Speech Synthesis Systems (2017) (18)
Improving speech synthesis of machine translation output (2010) (18)
Emotion Identification for Evaluation of Synthesized Emotional Speech (2012) (17)
Learning speaker-specific phrase breaks for text-to-speech systems (2010) (17)
A research platform for multi-agent dialogue dynamics (2004) (17)
Field Testing the Tongues Speech-to-Speech Machine Translation System (2002) (17)
A situation theoretic approach to computational semantics (1993) (17)
Speech Synthesis for Mixed-Language Navigation Instructions (2017) (17)
Visualizing Topical Quotations Over Time to Understand News Discourse (2010) (16)
Case Study: Deontological Ethics in NLP (2020) (16)
Data Augmentation for Neural Online Chats Response Selection (2018) (16)
Language Informed Modeling of Code-Switched Text (2018) (16)
Image 2 speech : Automatically generating audio descriptions of images (2017) (16)
Towards building an attentive artificial listener: on the perception of attentiveness in audio-visual feedback tokens (2016) (16)
The Blizzard Challenge 2014 (2014) (15)
DialCrowd: A toolkit for easy dialog system assessment (2018) (15)
WriterForcing: Generating more interesting story endings (2019) (15)
Towards Improving the Naturalness of Social Conversations with Dialogue Systems (2010) (15)
CMU Blizzard 2007: A Hybrid Acoustic Unit Selection System from Statistically Predicted Parameters (2007) (15)
Challenges in Speech Synthesis (2010) (15)
What Code-Switching Strategies are Effective in Dialog Systems? (2020) (15)
Creating a database of speech in noise for unit selection synthesis (2004) (15)
Multilingual Speech Recognition with Corpus Relatedness Sampling (2019) (15)
Building voiceXML-based applications (2002) (14)
Text to speech in new languages without a standardized orthography (2013) (14)
Prominence prediction for supersentential prosodic modeling based on a new database (2004) (14)
An Incremental Turn-Taking Model with Active System Barge-in for Spoken Dialog Systems (2015) (14)
Multilingual Speech Synthesis (2006) (14)
The First Conversational Intelligence Challenge (2018) (14)
Evaluation and collection of proper name pronunciations online (2002) (14)
Utterance Selection Techniques for TTS Systems Using Found Speech (2016) (14)
Parameter generation algorithm considering Modulation Spectrum for HMM-based speech synthesis (2015) (14)
A Dynamic Strategy Coach for Effective Negotiation (2019) (13)
Incremental Adaptation of Speech-to-Speech Translation (2009) (13)
Named entity translation using anchor texts (2011) (13)
Modeling Pause-Duration for Style-Specific Speech Synthesis (2012) (13)
Incorporating durational modification in voice transformation (2008) (13)
Generating F 0 contours from ToBI labels using linear regression (2021) (13)
Recurrent Neural Network Postfilters for Statistical Parametric Speech Synthesis (2016) (12)
Augmenting Non-Collaborative Dialog Systems with Explicit Semantic and Strategic Dialog History (2019) (12)
Domain Robust Feature Extraction for Rapid Low Resource ASR Development (2018) (12)
Intelligibility of machine translation output in speech synthesis (2006) (12)
Audio signals in speech interfaces (2000) (12)
A study on speech over the telephone and aging (2001) (12)
Optimal Utterance Selection for Unit Selection Speech Synthesis Databases (2003) (11)
Post-Filters to Modify the Modulation Spectrum for Statistical Parametric Speech Synthesis (2016) (11)
Adaptation techniques for speech synthesis in under-resourced languages (2010) (11)
Optimizations and fitting procedures for the liljencrants-fant model for statistical parametric speech synthesis (2013) (11)
Rapid development of speech-to-speech translation systems (2002) (11)
The ARIEL-CMU situation frame detection pipeline for LoReHLT16: a model translation approach (2018) (11)
Speech Technology for Unwritten Languages (2020) (11)
Modelling a Noisy-channel for Voice Conversion Using Articulatory Features (2012) (11)
KLATTSTAT: knowledge-based parametric speech synthesis (2010) (11)
An Investigation of Convolution Attention Based Models for Multilingual Speech Synthesis of Indian Languages (2018) (11)
DialoGraph: Incorporating Interpretable Strategy-Graph Networks into Negotiation Dialogues (2021) (10)
Mining Parallel Corpora from Sina Weibo and Twitter (2016) (10)
Multimodal Polynomial Fusion for Detecting Driver Distraction (2018) (10)
Modified post-filter to recover modulation spectrum for HMM-based speech synthesis (2014) (10)
A Corpus for Large-Scale Phonetic Typology (2020) (10)
Open-Source Consumer-Grade Indic Text To Speech (2016) (10)
Utterance classification in speech-to-speech translation for zero-resource languages in the hospital administration domain (2015) (10)
Automatic Detection of Code-switching Style from Acoustics (2018) (10)
Ordinal Triplet Loss: Investigating Sleepiness Detection from Speech (2019) (10)
Acoustics Based Intent Recognition Using Discovered Phonetic Units for Low Resource Languages (2020) (9)
Hierarchical Phone Recognition with Compositional Phonetics (2021) (9)
Evaluating a dialog language generation system: comparing the mountain system to other NLG approaches (2010) (9)
Handling large audio files in audio books for building synthetic voices (2010) (9)
Text-dependent pathological voice detection (2012) (9)
Voice building from insufficient data - classroom experiences with web-based language development tools (2007) (9)
AlloVera: A Multilingual Allophone Database (2020) (9)
A Review of Personality in Voice-Based Man Machine Interaction (2011) (9)
Building a better Indian English voice using "more data" (2007) (8)
Deriving Phonetic Transcriptions and Discovering Word Segmentations for Speech-to-Speech Translation in Low-Resource Settings (2016) (8)
Helping Users Understand Privacy Notices with Automated Query Answering Functionality : An Exploratory Study (2018) (8)
Proceedings of the 7th European Workshop on Natural Language Generation (1999) (8)
Linguistic Versus Latent Relations for Modeling Coherent Flow in Paragraphs (2019) (8)
Accent Group modeling for improved prosody in statistical parameteric speech synthesis (2013) (8)
Multimodal HALEF: An Open-Source Modular Web-Based Multimodal Dialog Framework (2016) (8)
Semi-Supervised Learning of Acoustic Driven Prosodic Phrase Breaks for Text-to-Speech Systems (2010) (8)
Minimum error rate training for phrasing in speech synthesis (2013) (8)
Improving speech synthesis for noisy environments (2010) (8)
M OUNTAIN : A Translation-based Approach to Natural Language Generation for Dialog Systems (2009) (8)
Cross-speaker articulatory position data for phonetic feature prediction (2005) (8)
Principled Frameworks for Evaluating Ethics in NLP Systems (2019) (7)
Towards Building an Attentive Artificial Listener: On the Perception of Attentiveness in Feedback Utterances (2016) (7)
CTC Alignments Improve Autoregressive Translation (2022) (7)
Analyzing Wikipedia Deletion Debates with a Group Decision-Making Forecast Model (2019) (7)
On data driven parametric backchannel synthesis for expressing attentiveness in conversational agents (2016) (7)
A Deep Learning Approach to Data-driven Parameterizations for Statistical Parametric Speech Synthesis (2014) (7)
Building sleek synthesizers for multi-lingual screen reader (2008) (7)
Unsupervised Self-Training for Sentiment Analysis of Code-Switched Data (2021) (7)
Tackling Code-Switched NER: Participation of CMU (2018) (7)
Comparison of algorithms for predicting accent placement in English speech synthesis. (1995) (7)
User Engagement Study with Virtual Agents Under Different Cultural Contexts (2016) (7)
Detecting Entailment in Code-Mixed Hindi-English Conversations (2020) (6)
Analysis and modeling of "focus" in context (2013) (6)
Discriminative Phrase-based Lexicalized Reordering Models using Weighted Reordering Graphs (2011) (6)
Variational Attention Using Articulatory Priors for Generating Code Mixed Speech Using Monolingual Corpora (2019) (6)
End-to-End Speech Summarization Using Restricted Self-Attention (2022) (6)
Style Transfer Through Multilingual and Feedback-Based Back-Translation (2018) (6)
Formality Style Transfer for Noisy, User-generated Conversations: Extracting Labeled, Parallel Data from Unlabeled Corpora (2019) (6)
Switch Point biased Self-Training: Re-purposing Pretrained Models for Code-Switching (2021) (6)
Using acoustic models to choose pronunciation variations for synthetic voices (2003) (6)
Mere account mein kitna balance hai? - On building voice enabled Banking Services for Multilingual Communities (2020) (6)
ASR2K: Speech Recognition for Around 2000 Languages without Audio (2022) (6)
Improving ASR by integrating lecture audio and slides (2013) (6)
A Dataset of Topic-Oriented Human-to-Chatbot Dialogues (2018) (6)
Towards Minimal Supervision BERT-based Grammar Error Correction (2020) (6)
Multilingual Phonetic Dataset for Low Resource Speech Recognition (2021) (6)
Introduction to the Issue on Statistical Parametric Speech Synthesis (2014) (6)
Generating time-constrained audio presentations of structured information (2006) (5)
Two-Pass Low Latency End-to-End Spoken Language Understanding (2022) (5)
Rethinking End-to-End Evaluation of Decomposable Tasks: A Case Study on Spoken Language Understanding (2021) (5)
The ARIEL-CMU Systems for LoReHLT18 (2019) (5)
NineOneOne: Recognizing and Classifying Speech for Handling Minority Language Emergency Calls (2008) (5)
Top-Down Structurally-Constrained Neural Response Generation with Lexicalized Probabilistic Context-Free Grammar (2019) (5)
Improving Relative-Entropy Pruning using Statistical Significance (2012) (5)
Text-To-Speech for Languages without an Orthography (2012) (5)
Multimodal, Multilingual Grapheme-to-Phoneme Conversion for Low-Resource Languages (2019) (5)
Task-Specific Pre-Training and Cross Lingual Transfer for Code-Switched Data (2021) (5)
Building African Voices (2022) (5)
Understanding Linguistic Accommodation in Code-Switched Human-Machine Dialogues (2020) (5)
Zero-shot Learning for Speech Recognition with Universal Phonetic Model (2018) (5)
Blizzard 2008: Experiments on Unit Size for Unit Selection Speech Synthesis (2008) (4)
Submission from CMU for Blizzard Challenge 2019 (2018) (4)
Speech Parameter Generation Algorithm Considering Modulation Spectrum for Statistical Parametric Speech Synthesis (2015) (4)
Recovery of acronyms, out-of-lattice words and pronunciations from parallel multilingual speech (2012) (4)
Parallel combination of multilingual speech streams for improved ASR (2012) (4)
Learning to Order Graph Elements with Application to Multilingual Surface Realization (2019) (4)
Segment Level Voice Conversion with Recurrent Neural Networks (2017) (4)
SANTLR: Speech Annotation Toolkit for Low Resource Languages (2019) (4)
LTIatCMU at SemEval-2020 Task 11: Incorporating Multi-Level Features for Multi-Granular Propaganda Span Identification (2020) (4)
The Blizzard Challenge 2006 CMU Entry introducing hybrid trajectory-selection synthesis (2006) (4)
Multimodal Speech Summarization Through Semantic Concept Learning (2021) (4)
Ugloss: a Framework for Improving Spoken Language Generation Understandability (2007) (4)
Parallel combination of speech streams for improved ASR (2012) (4)
Disentangling Speech and Non-Speech Components for Building Robust Acoustic Models from Found Data (2019) (4)
Phoneme Level Language Models for Sequence Based Low Resource ASR (2019) (4)
Elderly perception of speech from a computer (2002) (3)
Using a Computational Situation Theoretic Language to investigate Contemporary Semantic Theories (1993) (3)
A Resource for Computational Experiments on Mapudungun (2019) (3)
Speech Summarization using Restricted Self-Attention (2021) (3)
Universal grapheme-based speech synthesis (2015) (3)
Stance Classification, Outcome Prediction, and Impact Assessment: NLP Tasks for Studying Group Decision-Making (2019) (3)
Dataset Analysis and Augmentation for Emoji-Sensitive Irony Detection (2019) (3)
Learning Disentangled Representation in Latent Stochastic Models: A Case Study with Image Captioning (2019) (3)
Embedding DRT in a Situation Theoretic Framework (1992) (3)
CodemixedNLP: An Extensible and Open NLP Toolkit for Code-Mixing (2021) (3)
Style Variation as a Vantage Point for Code-Switching (2020) (3)
Intent Classification Using Pre-Trained Embeddings For Low Resource Languages (2021) (3)
Automatically Identifying Language Family from Acoustic Examples in Low Resource Scenarios (2020) (3)
Investigating Utterance Level Representations for Detecting Intent from Acoustics (2018) (3)
Intent Recognition and Unsupervised Slot Identification for Low-Resourced Spoken Dialog Systems (2021) (3)
Improved punctuation recovery through combination of multiple speech streams (2013) (3)
Induction and Reference of Entities in a Visual Story (2019) (3)
Describing Spoken Dialogue Systems Differences (2008) (3)
Language Technologies for Humanitarian Aid (2006) (3)
Deep Speech Synthesis from Articulatory Representations (2022) (3)
Rapid Prototyping of a German TTS System (1998) (3)
Reading between the Lines: Exploring Infilling in Visual Narratives (2020) (3)
Using acoustics to improve pronunciation for synthesis of low resource languages (2015) (3)
Unsupervised Phonetic and Word Level Discovery for Speech to Speech Translation for Unwritten Languages (2019) (3)
Building Practical Spoken Dialog Systems (2008) (3)
NAACL-HLT Workshop on Future directions and needs in the Spoken Dialog Community: Tools and Data (SDCTD 2012) (2012) (3)
Linguistic Markers of Influence in Informal Interactions (2017) (3)
Zero-shot Learning for Grapheme to Phoneme Conversion with Language Ensemble (2022) (2)
A style capturing approach to F0 transformation in voice conversion (2013) (2)
The blizzard machine learning challenge 2017 (2017) (2)
On Advances in Text Generation from Images Beyond Captioning: A Case Study in Self-Rationalization (2022) (2)
Cross-Lingual Transfer for Speech Processing Using Acoustic Language Similarity (2021) (2)
CMU Blizzard 2008: Optimally using a large database for unit selection synthesis. (2008) (2)
Deep Neural Convolutive Matrix Factorization for Articulatory Representation Decomposition (2022) (2)
CMU GetGoing: An Understandable and Memorable Dialog System for Seniors (2019) (2)
Measuring unsupervised acoustic clustering through phoneme pair merge-and-split tests (2005) (2)
Visual Evaluation of Voice Transformation Based on Knowledge of Speaker (2006) (2)
International Speech Communication Association (isca) Microsoft Research International Speech Communication Association (isca) Special Interest Group on Discourse and Dialogue (sigdial) Dialogs on Dialogs Student Reading Group Organizing Committee: Advisory Committee: Workshop Program 10:30 -12:00 M (2005) (1)
Using the Tilt Intonation Model: A Data-Driven Approach (2001) (1)
This Table is Different: A WordNet-Based Approach to Identifying References to Document Entities (2016) (1)
Bag-of-Acoustic-Words for Mental Health Assessment: A Deep Autoencoding Approach (2019) (1)
Introduction to NIPS 2017 Competition Track (2018) (1)
Real Users and Real Dialog Systems: The Hard Challenge for SDS (2012) (1)
Doing Research in a Deployed Spoken Dialog System: One Year of Let’s Go! Public Experience (2017) (1)
Dialog State Tracking Challenge: Information For Prospective Participants (2012) (1)
The CMU entry to blizzard machine learning challenge (2017) (1)
Comparison of Interactive Knowledge Base Spelling Correction Models for Low-Resource Languages (2020) (1)
Computational morphology of English (1988) (1)
Distributed representation-based spoken word sense induction (2015) (1)
Modulation spectrum-constrained trajectory training algorithm for HMM-based speech synthesis (2015) (1)
Initiations and Interruptions in a Spoken Dialog System (2016) (1)
Evaluating Gender Bias Transfer from Film Data (2022) (1)
Speaker-Independent Acoustic-to-Articulatory Speech Inversion (2023) (1)
Unconventional Approaches to Gathering and Sharing Resources for Spoken Dialog Research (2017) (1)
Data-driven intonational phonology (2013) (1)
Mixed-mode Multilinguality in TTS : The Case of Canadian French (2006) (1)
Nonlinear ISA with Auxiliary Variables for Learning Speech Representations (2020) (1)
Submission from CMU towards 1 st MultiTarget Speaker Detection and Identification Challenge (2018) (1)
Improving speech systems built from very little data (2008) (1)
Challenges in Automated Question Answering for Privacy Policies (2019) (1)
Phone Distribution Estimation for Low Resource Languages (2021) (1)
Future Directions in Spoken Dialog Systems: A Community of Possibilities (2012) (0)
Phone Inventories and Recognition for Every Language (2022) (0)
Intent classification using pre-trained language agnostic embeddings for low resource languages (2021) (0)
Multimodal Detection of Driver Distraction FINAL RESEARCH REPORT (2018) (0)
Applause : A Learning Tool for Low-Resource Languages (2014) (0)
Proc. 2009 Asia-Pacific Signal and Information Processing Association (APSIPA) (2009) (0)
Dissecting the components and factors of Neural Text Generation (2020) (0)
Some Diierent Approaches to Drt (1997) (0)
Articulatory Representation Learning Via Joint Factor Analysis and Neural Matrix Factorization (2022) (0)
Using Speaker ID to Discover Repeat Callers of a Spoken Dialog System (2011) (0)
Proceedings of The 8th International Global WordNet Conference (2016) (0)
Towards Using Heterogeneous Relation Graphs for End-to-End TTS (2021) (0)
2 Articulatory Features 2 . 1 Types of Articulatory Representations (2011) (0)
Generating Mandarin and Cantonese F0 Contours with Decision Trees and BLSTMs (2018) (0)
Speech Translation for Triage of Emergency Phonecalls in Minority Languages (2008) (0)
Towards Language Modelling in the Speech Domain Using Sub-word Linguistic Units (2021) (0)
Text Normalization for Speech Systems for All Languages (2022) (0)
The 5th International Conference on Spoken Language Processing, Incorporating The 7th Australian International Speech Science and Technology Conference, Sydney Convention Centre, Sydney, Australia, 30th November - 4th December 1998 (1998) (0)
Formal Properties of Feature Grammars (2007) (0)
RE-WOCHAT : Workshop on Collecting and Generating Resources for Chatbots and Conversational Agents-Development and Evaluation Workshop Programme ( May 28 (2016) (0)
Quality Improvement Approaches Based on the Modulation Spectrum to Statistical Parametric Speech Synthesis (2015) (0)
The Blizzard Challenge: evaluating corpus-based speech synthesis techniques (2007) (0)
Integrating Verbal and Nonvebval Input into a Dynamic Response Spoken Dialogue System (2017) (0)
AUGMENTING NON-COLLABORATIVE DIALOG SYS- (2019) (0)
Detecting Driver Distraction (2018) (0)
Chapter 2 Challenges in Speech Synthesis (2010) (0)
Towards Automatic Route Description Unification in Spoken Dialog Systems (2021) (0)
DUALGRAM: An Efficient Method for Representing Limited-Domain Language Models (1992) (0)
Towards Improving Intelligibility of Black-Box Speech Synthesizers in Noise (2018) (0)
Optionality in evaluating prosody (2004) (0)
Dialogue Context Encoder Structure Encoder Graph Encoding ( GAT ) Structure Encoder u 1 u 2 u 3 u 4 Graph Pooling Graph Pooling Graph Encoding ( GAT ) GCN-ASAPGCN-ASAP Utterance Embedding Utterance Generation (2021) (0)
Incorporating Dialectal Features in Synthesized Speech using Voice Conversion Techniques (2018) (0)
Entity Skeletons for Visual Storytelling (2020) (0)
The Real Challenge 2014: Progress and Prospects (2015) (0)
C L ] 2 A pr 2 01 9 A Survey of Code-switched Speech and Language Processing (2019) (0)
A Fast and Accurate Pitch Estimation Algorithm Based on the Pseudo Wigner-Ville Distribution (2022) (0)
Token-level Sequence Labeling for Spoken Language Understanding using Compositional End-to-End Models (2022) (0)
Understandable production of massive synthesis (2007) (0)
Multimodal Detection of Driver Distraction (2017) (0)

This paper list is powered by the following services:

Other Resources About Alan W. Black

en.wikipedia.org

What Schools Are Affiliated With Alan W. Black?

Alan W. Black is affiliated with the following schools:

Alan W. Black's Academic­Influence.com Rankings

Why Is Alan W. Black Influential?

Alan W. Black's Published Works

Published Works

Other Resources About Alan W. Black

What Schools Are Affiliated With Alan W. Black?

Image Attributions

Alan W. Black's AcademicInfluence.com Rankings