Junichi Yamagishi
#122,770
Most Influential Person Now
Junichi Yamagishi's AcademicInfluence.com Rankings
Junichi Yamagishicomputer-science Degrees
Computer Science
#5017
World Rank
#5302
Historical Rank
Artificial Intelligence
#1407
World Rank
#1436
Historical Rank
Database
#2185
World Rank
#2298
Historical Rank

Download Badge
Computer Science
Why Is Junichi Yamagishi Influential?
(Suggest an Edit or Addition)Junichi Yamagishi's Published Works
Number of citations in a given year to any of this author's works
Total number of citations to an author for the works they published in a given year. This highlights publication of the most important work(s) by the author
Published Works
- MesoNet: a Compact Facial Video Forgery Detection Network (2018) (676)
- The HMM-based speech synthesis system (HTS) version 2.0 (2007) (613)
- SUPERSEDED - CSTR VCTK Corpus: English Multi-speaker Corpus for CSTR Voice Cloning Toolkit (2016) (586)
- Spoofing and countermeasures for speaker verification: A survey (2015) (521)
- Speech Synthesis Based on Hidden Markov Models (2013) (426)
- Analysis of Speaker Adaptation Algorithms for HMM-Based Speech Synthesis and a Constrained SMAPLR Adaptation Algorithm (2009) (388)
- ASVspoof 2015: the first automatic speaker verification spoofing and countermeasures challenge (2015) (386)
- The ASVspoof 2017 Challenge: Assessing the Limits of Replay Spoofing Attack Detection (2017) (379)
- ASVspoof 2019: Future Horizons in Spoofed and Fake Audio Detection (2019) (326)
- Capsule-forensics: Using Capsule Networks to Detect Forged Images and Videos (2018) (284)
- The Voice Conversion Challenge 2018: Promoting Development of Parallel and Nonparallel Methods (2018) (249)
- Multi-task Learning for Detecting and Segmenting Manipulated Facial Images and Videos (2019) (247)
- HMM-Based Speech Synthesis Utilizing Glottal Inverse Filtering (2011) (245)
- CSTR VCTK Corpus: English Multi-speaker Corpus for CSTR Voice Cloning Toolkit (version 0.92) (2019) (226)
- Investigating RNN-based speech enhancement methods for noise-robust Text-to-Speech (2016) (224)
- The voice bank corpus: Design, collection and data analysis of a large regional accent speech database (2013) (221)
- Average-Voice-Based Speech Synthesis Using HSMM-Based Speaker Adaptation and Adaptive Training (2007) (213)
- Evaluation of Speaker Verification Security and Detection of HMM-Based Synthetic Speech (2012) (212)
- Distinguishing computer graphics from natural images using convolution neural networks (2017) (206)
- Robust Speaker-Adaptive HMM-Based Text-to-Speech Synthesis (2009) (205)
- ASVspoof: The Automatic Speaker Verification Spoofing and Countermeasures Challenge (2017) (167)
- ASVspoof 2019: A large-scale public database of synthesized, converted and replayed speech (2019) (163)
- Detection of synthetic speech for the problem of imposture (2011) (159)
- The Voice Conversion Challenge 2016 (2016) (157)
- Acoustic Modeling of Speaking Styles and Emotional Expressions in HMM-Based Speech Synthesis (2005) (151)
- A Style Control Technique for HMM-Based Expressive Speech Synthesis (2007) (151)
- Spoofing and countermeasures for automatic speaker verification (2013) (150)
- ASVspoof 2017 Version 2.0: meta-data analysis and baseline enhancements (2018) (139)
- t-DCF: a Detection Cost Function for the Tandem Assessment of Spoofing Countermeasures and Automatic Speaker Verification (2018) (135)
- An Overview of Voice Conversion and Its Challenges: From Statistical Modeling to Deep Learning (2020) (130)
- MOSNet: Deep Learning based Objective Assessment for Voice Conversion (2019) (128)
- ASVspoof 2021: Automatic Speaker Verification Spoofing and Countermeasures Challenge Evaluation Plan (2021) (122)
- High-Quality Nonparallel Voice Conversion Based on Cycle-Consistent Adversarial Network (2018) (118)
- Integrating Articulatory Features Into HMM-Based Parametric Speech Synthesis (2009) (116)
- Speech Synthesis with Various Emotional Expressions and Speaking Styles by Style Interpolation and Morphing (2005) (114)
- Voice Conversion Challenge 2020: Intra-lingual semi-parallel and cross-lingual voice conversion (2020) (114)
- ASVspoof 2021: accelerating progress in spoofed and deepfake speech detection (2021) (106)
- Use of a Capsule Network to Detect Fake Images and Videos (2019) (102)
- Thousands of Voices for HMM-Based Speech Synthesis–Analysis and Application of TTS Systems Built on Various ASR Corpora (2009) (95)
- Zero-Shot Multi-Speaker Text-To-Speech with State-Of-The-Art Neural Speaker Embeddings (2019) (95)
- Speech synthesis technologies for individuals with vocal disabilities: Voice banking and reconstruction (2012) (90)
- Neural Source-filter-based Waveform Model for Statistical Parametric Speech Synthesis (2018) (86)
- Analysis of statistical parametric and unit selection speech synthesis systems applied to emotional speech (2010) (83)
- Anti-Spoofing for Text-Independent Speaker Verification: An Initial Database, Comparison of Countermeasures, and Human Performance (2016) (82)
- Modeling of various speaking styles and emotions for HMM-based speech synthesis (2003) (82)
- Neural Source-Filter Waveform Models for Statistical Parametric Speech Synthesis (2019) (78)
- Speech Enhancement for a Noise-Robust Text-to-Speech Synthesis System Using Deep Recurrent Neural Networks (2016) (77)
- Introducing the VoicePrivacy Initiative (2020) (77)
- Towards an improved modeling of the glottal source in statistical parametric speech synthesis (2007) (76)
- A Comparative Study on Recent Neural Spoofing Countermeasures for Synthetic Speech Detection (2021) (74)
- Speaker Anonymization Using X-vector and Neural Waveform Models (2019) (71)
- Non-parallel voice conversion using i-vector PLDA: towards unifying speaker verification and transformation (2017) (71)
- The Romanian speech synthesis (RSS) corpus: Building a high quality HMM-based speech synthesis system using a high sampling rate (2011) (70)
- Investigation of Enhanced Tacotron Text-to-speech Synthesis Systems with Self-attention for Pitch Accent Language (2018) (70)
- The HTS-2008 System: Yet Another Evaluation of the Speaker-Adaptive HMM-based Speech Synthesis System in The 2008 Blizzard Challenge (2008) (68)
- Adapting and controlling DNN-based speech synthesis using input codes (2017) (68)
- Average-Voice-Based Speech Synthesis (2006) (68)
- Investigating different representations for modeling and controlling multiple emotions in DNN-based speech synthesis (2018) (68)
- SAS: A speaker verification spoofing database containing diverse attacks (2015) (66)
- Attentive Filtering Networks for Audio Replay Attack Detection (2018) (66)
- Synthetic Speech Discrimination using Pitch Pattern Statistics Derived from Image Analysis (2012) (66)
- Recent development of the HMM-based speech synthesis system (HTS) (2009) (65)
- Glottal spectral separation for parametric speech synthesis (2008) (64)
- A Training Method of Average Voice Model for HMM-Based Speech Synthesis (2003) (64)
- A Comparison of Recent Waveform Generation and Acoustic Modeling Methods for Neural-Network-Based Speech Synthesis (2018) (63)
- Robustness of HMM-based speech synthesis (2008) (63)
- Tandem Assessment of Spoofing Countermeasures and Automatic Speaker Verification: Fundamentals (2020) (63)
- Can we steal your vocal identity from the Internet?: Initial investigation of cloning Obama's voice using GAN, WaveNet and low-quality found data (2018) (63)
- Evaluation of the Vulnerability of Speaker Verification to Synthetic Speech (2010) (62)
- Analysis of the Voice Conversion Challenge 2016 Evaluation Results (2016) (60)
- Generative Adversarial Network-Based Postfilter for STFT Spectrograms (2017) (60)
- Articulatory Control of HMM-Based Parametric Speech Synthesis Using Feature-Space-Switched Multiple Regression (2013) (59)
- Speaker-Independent HMM-based Speech Synthesis System: HTS-2007 System for the Blizzard Challenge 2007 (2007) (59)
- Constrained structural maximum a posteriori linear regression for average-voice-based speech synthesis (2006) (58)
- ASVspoof 2019: Spoofing Countermeasures for the Detection of Synthesized, Converted and Replayed Speech (2021) (56)
- Deep Encoder-Decoder Models for Unsupervised Learning of Controllable Speech Synthesis (2018) (56)
- Introduction to Voice Presentation Attack Detection and Recent Advances (2019) (54)
- Wasserstein GAN and Waveform Loss-Based Acoustic Model Training for Multi-Speaker Text-to-Speech Synthesis Systems Using a WaveNet Vocoder (2018) (54)
- Voice liveness detection algorithms based on pop noise caused by human breath for automatic speaker verification (2015) (54)
- An experimental comparison of multiple vocoder types (2013) (52)
- Generating Sentiment-Preserving Fake Online Reviews Using Neural Language Models and Their Human- and Machine-based Detection (2019) (52)
- An autoregressive recurrent mixture density network for parametric speech synthesis (2017) (52)
- Deep neural network-guided unit selection synthesis (2016) (52)
- HMM-based speech synthesiser using the LF-model of the glottal source (2011) (50)
- The CSTR/EMIME HTS system for Blizzard Challenge 2010 (2010) (49)
- A Deep Generative Architecture for Postfiltering in Statistical Parametric Speech Synthesis (2015) (48)
- Modeling and interpolation of Austrian German and Viennese dialect in HMM-based speech synthesis (2010) (47)
- Speaker Recognition Anti-spoofing (2014) (46)
- Revisiting the security of speaker verification systems against imposture using synthetic speech (2010) (46)
- An Analysis of HMM-based prediction of articulatory movements (2010) (46)
- A deep auto-encoder based low-dimensional feature extraction from FFT spectral envelopes for statistical parametric speech synthesis (2016) (45)
- The VoicePrivacy 2022 Challenge Evaluation Plan (2022) (44)
- TUNDRA: a multilingual corpus of found data for TTS research created with light supervision (2013) (44)
- A Style Adaptation Technique for Speech Synthesis Using HSMM and Suprasegmental Features (2006) (44)
- Joint training framework for text-to-speech and voice conversion using multi-source Tacotron and WaveNet (2019) (42)
- Speaking style adaptation using context clustering decision tree for HMM-based speech synthesis (2004) (42)
- A Context Clustering Technique for Average Voice Models (2002) (41)
- Speech-driven lip motion generation with a trajectory HMM (2008) (41)
- Speech Waveform Synthesis from MFCC Sequences with Generative Adversarial Networks (2018) (41)
- Can Objective Measures Predict the Intelligibility of Modified HMM-Based Synthetic Speech in Noise? (2011) (41)
- Modular Convolutional Neural Network for Discriminating between Computer-Generated Images and Photographic Images (2018) (40)
- Model Adaptation Approach to Speech Synthesis with Diverse Voices and Styles (2007) (39)
- Generalization Ability of MOS Prediction Networks (2021) (39)
- Measuring the Gap Between HMM-Based ASR and TTS (2010) (38)
- Fusion of multiple parameterisations for DNN-based sinusoidal speech synthesis with multi-task learning (2015) (38)
- Speaker-independent raw waveform model for glottal excitation (2018) (38)
- DNN-based stochastic postfilter for HMM-based speech synthesis (2014) (38)
- HSMM-Based Model Adaptation Algorithms for Average-Voice-Based Speech Synthesis (2006) (37)
- Direct Modeling of Frequency Spectra and Waveform Generation Based on Phase Recovery for DNN-Based Speech Synthesis (2017) (37)
- The VoicePrivacy 2020 Challenge: Results and findings (2021) (37)
- Design Choices for X-vector Based Speaker Anonymization (2020) (37)
- Phone duration modeling using gradient tree boosting (2008) (37)
- A Comparison Between STRAIGHT, Glottal, and Sinusoidal Vocoding in Statistical Parametric Speech Synthesis (2018) (36)
- Emotion transplantation through adaptation in HMM-based speech synthesis (2015) (36)
- Human Walking Motion Synthesis with Desired Pace and Stride Length Based on HSMM (2005) (36)
- Unsupervised adaptation for HMM-based speech synthesis (2008) (35)
- Predictions of Subjective Ratings and Spoofing Assessments of Voice Conversion Challenge 2020 Submissions (2020) (35)
- An RNN-Based Quantized F0 Model with Multi-Tier Feedback Links for Text-to-Speech Synthesis (2017) (34)
- An Introduction to HMM-Based Speech Synthesis (2006) (34)
- Glottal Spectral Separation for Speech Synthesis (2014) (34)
- Integrated Presentation Attack Detection and Automatic Speaker Verification: Common Features and Gaussian Back-end Fusion (2018) (33)
- HMM-BASED EXPRESSIVE SPEECH SYNTHESIS — TOWARDS TTS WITH ARBITRARY SPEAKING STYLES AND EMOTIONS (2003) (32)
- Intelligibility enhancement of HMM-generated speech in additive noise by modifying Mel cepstral coefficients to increase the glimpse proportion (2014) (32)
- Unsupervised and lightly-supervised learning for rapid construction of TTS systems in multiple languages from 'found' data: evaluation and analysis (2013) (30)
- Statistical parametric speech synthesis for Ibibio (2014) (29)
- Voice Liveness Detection for Speaker Verification based on a Tandem Single/Double-channel Pop Noise Detector (2016) (29)
- ASVspoof 2019: The 3rd Automatic Speaker Verification Spoofing and Countermeasures Challenge database (2019) (29)
- Mel cepstral coefficient modification based on the Glimpse Proportion measure for improving the intelligibility of HMM-generated synthetic speech in noise (2012) (28)
- Synthesis and evaluation of conversational characteristics in HMM-based speech synthesis (2012) (28)
- Using HMM-based Speech Synthesis to Reconstruct the Voice of Individuals with Degenerative Speech Disorders (2012) (28)
- Noise-robust whispered speech recognition using a non-audible-murmur microphone with VTS compensation (2012) (27)
- ALISA: An automatic lightly supervised speech segmentation and alignment tool (2016) (27)
- NAUTILUS: A Versatile Voice Cloning System (2020) (27)
- Towards speaking style transplantation in speech synthesis (2013) (27)
- Autoregressive Neural F0 Model for Statistical Parametric Speech Synthesis (2018) (27)
- Speaker adaptation and the evaluation of speaker similarity in the EMIME speech-to-speech translation project (2010) (26)
- Evaluation of objective measures for intelligibility prediction of HMM-based synthetic speech in noise (2011) (26)
- Articulatory control of HMM-based parametric speech synthesis driven by phonetic knowledge (2008) (26)
- Privacy-preserving sound to degrade automatic speaker verification performance (2016) (26)
- HMM-Based Speech Synthesis with Various Speaking Styles Using Model Interpolation (2004) (25)
- Investigating self-supervised front ends for speech spoofing countermeasures (2021) (25)
- The VoiceMOS Challenge 2022 (2022) (25)
- Human vs machine spoofing detection on wideband and narrowband data (2015) (25)
- Rating Naturalness in Speech Synthesis: The Effect of Style and Expectation (2014) (25)
- GELP: GAN-Excited Linear Prediction for Speech Synthesis from Mel-spectrogram (2019) (24)
- A Vector Quantized Variational Autoencoder (VQ-VAE) Autoregressive Neural $F_0$ Model for Statistical Parametric Speech Synthesis (2020) (24)
- A Hierarchical Predictor of Synthetic Speech Naturalness Using Neural Networks (2016) (24)
- Roles of the average voice in speaker-adaptive HMM-based speech synthesis (2010) (24)
- Investigating very deep highway networks for parametric speech synthesis (2018) (23)
- Adaptive training for hidden semi-Markov model [speech synthesis applications] (2005) (23)
- Cepstral analysis based on the glimpse proportion measure for improving the intelligibility of HMM-based synthetic speech in noise (2012) (23)
- Unsupervised Continuous-Valued Word Features for Phrase-Break Prediction without a Part-of-Speech Tagger (2011) (23)
- Identification of contrast and its emphatic realization in HMM based speech synthesis (2009) (22)
- Performance evaluation of the speaker-independent HMM-based speech synthesis system “HTS 2007” for the Blizzard Challenge 2007 (2008) (22)
- Analysis of speaker clustering strategies for HMM-based speech synthesis (2012) (21)
- The role of higher-level linguistic features in HMM-based speech synthesis (2010) (21)
- Towards Personalised Synthesised Voices for Individuals with Vocal Disabilities: Voice Banking and Reconstruction (2013) (21)
- Lightly supervised GMM VAD to use audiobook for speech synthesiser (2013) (21)
- Towards Glottal Source Controllability in Expressive Speech Synthesis (2012) (21)
- MLLR adaptation for hidden semi-Markov model based speech synthesis (2004) (21)
- How do Voices from Past Speech Synthesis Challenges Compare Today? (2021) (21)
- Neural Harmonic-plus-Noise Waveform Model with Trainable Maximum Voice Frequency for Text-to-Speech Synthesis (2019) (21)
- LDNet: Unified Listener Dependent Modeling in MOS Prediction for Synthetic Speech (2021) (21)
- Festival multisyn voices for the 2007 blizzard challenge. (2007) (21)
- OpenForensics: Large-Scale Challenging Dataset For Multi-Face Forgery Detection And Segmentation In-The-Wild (2021) (20)
- Speech driven head motion synthesis based on a trajectory model (2007) (20)
- Acoustic model training based on linear transformation and MAP modification for HSMM-based speech synthesis (2006) (20)
- A Spoofing Benchmark for the 2018 Voice Conversion Challenge: Leveraging from Spoofing Countermeasures for Speech Artifact Assessment (2018) (19)
- Speaker-Independent HMM-based Speech Synthesis System (2007) (19)
- A perceptual investigation of wavelet-based decomposition of f0 for text-to-speech synthesis (2015) (19)
- Human walking motion synthesis based on multiple regression hidden semi-Markov model (2005) (19)
- A training method for average voice model based on shared decision tree context clustering and speaker adaptive training (2003) (19)
- The Privacy ZEBRA: Zero Evidence Biometric Recognition Assessment (2020) (18)
- Building personalised synthetic voices for individuals with severe speech impairment (2013) (18)
- Principles for Learning Controllable TTS from Annotated and Latent Variation (2017) (18)
- Training Multi-Speaker Neural Text-to-Speech Systems using Speaker-Imbalanced Speech Corpora (2019) (18)
- Simple methods for improving speaker-similarity of HMM-based speech synthesis (2010) (18)
- STFT Spectral Loss for Training a Neural Speech Waveform Model (2018) (18)
- Neural net word representations for phrase-break prediction without a part of speech tagger (2014) (18)
- Synthesis of Child Speech With HMM Adaptation and Voice Conversion (2010) (17)
- Unsupervised cross-lingual speaker adaptation for HMM-based speech synthesis (2010) (17)
- HMM-based synthesis of child speech (2008) (17)
- The use of articulatory movement data in speech synthesis applications: An overview — Application of articulatory movements using machine learning algorithms — (2015) (17)
- The SIWIS French Speech Synthesis Database ? Design and recording of a high quality French database for speech synthesis (2017) (17)
- A style control technique for speech synthesis using multiple regression HSMM (2006) (17)
- A technique for controlling voice quality of synthetic speech using multiple regression HSMM (2006) (16)
- Simple4All proposals for the Albayzin Evaluations in Speech Synthesis (2012) (16)
- Analysis of unsupervised cross-lingual speaker adaptation for HMM-based speech synthesis using KLD-based transform mapping (2012) (16)
- Automatic speaker verification spoofing and deepfake detection using wav2vec 2.0 and data augmentation (2022) (16)
- An investigation of the application of dynamic sinusoidal models to statistical parametric speech synthesis (2014) (16)
- Investigating accuracy of pitch-accent annotations in neural network-based speech synthesis and denoising effects (2018) (16)
- Bootstrapping Non-Parallel Voice Conversion from Speaker-Adaptive Text-to-Speech (2019) (16)
- Speech Enhancement of Noisy and Reverberant Speech for Text-to-Speech (2018) (16)
- Improved Prosody from Learned F0 Codebook Representations for VQ-VAE Speech Waveform Reconstruction (2020) (16)
- Investigation of learning abilities on linguistic features in sequence-to-sequence text-to-speech synthesis (2020) (16)
- Combining Statistical Parameteric Speech Synthesis and Unit-Selection for Automatic Voice Cloning (2008) (16)
- Introduction to the Issue on Spoofing and Countermeasures for Automatic Speaker Verification (2017) (15)
- A Comparative Study of the Performance of HMM, DNN, and RNN based Speech Synthesis Systems Trained on Very Large Speaker-Dependent Corpora (2016) (15)
- Utilization of an HMM-based feature generation module in 5 ms segment concatenative speech synthesis (2007) (15)
- Waveform Generation for Text-to-speech Synthesis Using Pitch-synchronous Multi-scale Generative Adversarial Networks (2018) (15)
- Synthesis of fast speech with interpolation of adapted HSMMs and its evaluation by blind and sighted listeners (2010) (15)
- An Initial Investigation for Detecting Partially Spoofed Audio (2021) (15)
- Identifying computer-generated text using statistical analysis (2017) (15)
- Lightly supervised discriminative training of grapheme models for improved sentence-level alignment of speech and text data (2013) (15)
- Testing the consistency assumption: Pronunciation variant forced alignment in read and spontaneous speech synthesis (2016) (14)
- Combining perceptually-motivated spectral shaping with loudness and duration modification for intelligibility enhancement of HMM-based synthetic speech in noise (2013) (14)
- The SIWIS Database: A Multilingual Speech Database with Acted Emphasis (2016) (14)
- Learning Disentangled Phone and Speaker Representations in a Semi-Supervised VQ-VAE Paradigm (2020) (13)
- Can Speaker Augmentation Improve Multi-Speaker End-to-End TTS? (2020) (13)
- Generating Master Faces for Use in Performing Wolf Attacks on Face Recognition Systems (2020) (13)
- Transferring Neural Speech Waveform Synthesizers to Musical Instrument Sounds Generation (2019) (13)
- Multiple feed-forward deep neural networks for statistical parametric speech synthesis (2015) (13)
- Speech synthesis without a phone inventory (2009) (13)
- An approach for gait anonymization using deep learning (2017) (13)
- Evaluating speech intelligibility enhancement for HMM-based synthetic speech in noise (2012) (13)
- A context clustering technique for average voice model in HMM-based speech synthesis (2002) (13)
- Utilising spontaneous conversational speech in HMM-based speech synthesis (2010) (13)
- Towards Cross-Lingual Emotion Transplantation (2014) (12)
- Generating segmental foreign accent (2014) (12)
- HMM-based text-to-articulatory-movement prediction and analysis of critical articulators (2010) (12)
- Spatio-temporal generative adversarial network for gait anonymization (2019) (12)
- Noise Tokens: Learning Neural Noise Templates for Environment-Aware Speech Enhancement (2020) (12)
- Impacts of machine translation and speech synthesis on speech-to-speech translation (2012) (12)
- Personalising speech-to-speech translation: Unsupervised cross-lingual speaker adaptation for HMM-based speech synthesis (2013) (12)
- Translation and Prosody in Swiss Languages (2014) (12)
- Improving intelligibility in noise of HMM-generated speech via noise-dependent and -independent methods (2013) (11)
- Improved average-voice-based speech synthesis using gender-mixed modeling and a parameter generation algorithm considering GV (2007) (11)
- Privacy and Utility of X-Vector Based Speaker Anonymization (2022) (11)
- iMetricGAN: Intelligibility Enhancement for Speech-in-Noise using Generative Adversarial Network-based Metric Learning (2020) (11)
- Multimodal speech synthesis architecture for unsupervised speaker adaptation (2018) (10)
- A Multi-Level Attention Model for Evidence-Based Fact Checking (2021) (10)
- Letter-based speech synthesis (2010) (10)
- Complex-Valued Restricted Boltzmann Machine for Direct Learning of Frequency Spectra (2017) (10)
- Proceedings of COLING 2016, the 26th International Conference on Computational Linguistics: Technical Papers (2016) (10)
- Proc. Blizzard Challenge Workshop (in Proc. SSW6) (2007) (10)
- Deep neural network context embeddings for model selection in rich-context HMM synthesis (2015) (10)
- Attention Back-End for Automatic Speaker Verification with Multiple Enrollment Utterances (2021) (10)
- Fashion-Guided Adversarial Attack on Person Segmentation (2021) (10)
- Initial investigation of speech synthesis based on complex-valued neural networks (2016) (10)
- Optimizing Tandem Speaker Verification and Anti-Spoofing Systems (2022) (9)
- A Unified Speaker Adaptation Method for Speech Synthesis using Transcribed and Untranscribed Speech with Backpropagation (2019) (9)
- The NII speech synthesis entry for Blizzard Challenge 2016 (2016) (9)
- The ASVspoof 2019 database (2019) (9)
- Using Cyclic Noise as the Source Signal for Neural Source-Filter-Based Speech Waveform Model (2020) (9)
- 8th ISCA Workshop on Speech Synthesis (2004) (9)
- An analysis of machine translation and speech synthesis in speech-to-speech translation system (2011) (9)
- Building personalised synthesised voices for individuals with dysarthia using the HTS toolkit (2010) (9)
- Building Personalized Synthetic Voices for Individuals with Dysarthria using the HTS Toolkit (2010) (9)
- Wavelet-based decomposition of F0 as a secondary task for DNN-based speech synthesis with multi-task learning (2016) (8)
- Combining vocal tract length normalization with hierarchial linear transformations (2012) (8)
- A fixed dimension and perceptually based dynamic sinusoidal model of speech (2014) (8)
- An RGB Gait Anonymization Model for Low-Quality Silhouettes (2019) (8)
- Enhance the Word Vector with Prosodic Information for the Recurrent Neural Network Based TTS System (2016) (8)
- Intelligibility analysis of fast synthesized speech (2014) (8)
- Multidimensional scaling of systems in the Voice Conversion Challenge 2016 (2016) (8)
- Speaker Adaptation of Various Components in Deep Neural Network based Speech Synthesis (2016) (8)
- Model adaptation and adaptive training using ESAT algorithm for HMM-based speech synthesis (2005) (8)
- End-to-End Text-to-Speech Using Latent Duration Based on VQ-VAE (2020) (8)
- The SIWIS French Speech Synthesis Database (2017) (8)
- A Function-wise Pre-training Technique for Constructing a Deep Neural Network based Spectral Model in Statistical Parametric Speech Synthesis (2015) (7)
- Multi-Task Learning in Utterance-Level and Segmental-Level Spoof Detection (2021) (7)
- The Voice Conversion Challenge 2018: database and results (2018) (7)
- Transforming acoustic characteristics to deceive playback spoofing countermeasures of speaker verification systems (2018) (7)
- Performance evaluation of style adaptation for hidden semi-Markov model based speech synthesis (2005) (7)
- Methods for applying dynamic sinusoidal models to statistical parametric speech synthesis (2015) (7)
- Using Text and Acoustic Features in Predicting Glottal Excitation Waveforms for Parametric Speech Synthesis with Recurrent Neural Networks (2016) (7)
- Cycle-consistent Adversarial Networks for Non-parallel Vocal Effort Based Speaking Style Conversion (2019) (7)
- Initial investigation of an encoder-decoder end-to-end TTS framework using marginalization of monotonic hard latent alignments (2019) (7)
- Feature-Space Transform Tying in Unified Acoustic-Articulatory Modelling for Articulatory Control of HMM-Based Speech Synthesis (2011) (7)
- Analysis of unsupervised and noise-robust speaker-adaptive HMM-based speech synthesis systems toward a unified ASR and TTS framework (2009) (7)
- Initial investigation of encoder-decoder end-to-end TTS using marginalization of monotonic hard alignments (2019) (7)
- Language-Independent Speaker Anonymization Approach using Self-Supervised Pre-Trained Models (2022) (7)
- Parallel and cascaded deep neural networks for text-to-speech synthesis (2016) (7)
- Benchmarking and challenges in security and privacy for voice biometrics (2021) (7)
- Vowel Creation by Articulatory Control in HMM-based Parametric Speech Synthesis (2012) (7)
- Syllable-Level Representations of Suprasegmental Features for DNN-Based Text-to-Speech Synthesis (2016) (7)
- Mage - reactive articulatory feature control of HMM-based parametric speech synthesis (2013) (7)
- ASVspoof 2021: Towards Spoofed and Deepfake Speech Detection in the Wild (2022) (7)
- Cyborg Speech: Deep Multilingual Speech Synthesis for Generating Segmental Foreign Accent with Natural Prosody (2018) (6)
- Constructing a Deep Neural Network Based Spectral Model for Statistical Speech Synthesis (2016) (6)
- Formant-Controlled HMM-Based Speech Synthesis (2011) (6)
- Reactive accent interpolation through an interactive map application (2013) (6)
- The PartialSpoof Database and Countermeasures for the Detection of Short Generated Audio Segments Embedded in a Speech Utterance (2022) (6)
- Master Face Attacks on Face Recognition Systems (2021) (6)
- Deep Denoising Auto-encoder for Statistical Speech Synthesis (2015) (6)
- Efficient Pitch Estimation on Natural Opera-Singing by a Spectral Correlation based Strategy (2015) (6)
- Rakugo speech synthesis using segment-to-segment neural transduction and style tokens — toward speech synthesis for entertaining audiences (2019) (6)
- Reducing Mismatch in Training of DNN-Based Glottal Excitation Models in a Statistical Parametric Text-to-Speech System (2017) (6)
- On the evaluation of inversion mapping performance in the acoustic domain (2013) (5)
- Scaling and Bias Codes for Modeling Speaker-Adaptive DNN-Based Speech Synthesis Systems (2018) (5)
- Exploring Disentanglement with Multilingual and Monolingual VQ-VAE (2021) (5)
- On the Interplay between Sparsity, Naturalness, Intelligibility, and Prosody in Speech Synthesis (2021) (5)
- Transformation on Computer-Generated Facial Image to Avoid Detection by Spoofing Detector (2018) (5)
- Multiple-average-voice-based speech synthesis (2014) (5)
- Using neighbourhood density and selective SNR boosting to increase the intelligibility of synthetic speech in noise (2013) (5)
- Learning Word Vector Representations Based on Acoustic Counts (2017) (5)
- Combining Vocal Tract Length Normalization With Hierarchical Linear Transformations (2014) (5)
- A Practical Guide to Logical Access Voice Presentation Attack Detection (2022) (5)
- Audiovisual Speaker Conversion: Jointly and Simultaneously Transforming Facial Expression and Acoustic Characteristics (2018) (5)
- Revisiting Speech Content Privacy (2021) (5)
- Glottal Source and Prosodic Prominence Modelling in HMM-based Speech Synthesis for the Blizzard Challenge 2009 (2009) (5)
- The 2nd Automatic Speaker Verification Spoofing and Countermeasures Challenge (ASVspoof 2017) Database, Version 2 (2018) (5)
- Speech Intelligibility in Cars: The Effect of Speaking Style, Noise and Listener Age (2017) (5)
- HMM adaptation and voice conversion for the synthesis of child speech: a comparison (2009) (5)
- Intelligibility of time-compressed synthetic speech: Compression method and speaking style (2015) (5)
- Capsule-Forensics Networks for Deepfake Detection (2022) (4)
- Multi-Metric Optimization Using Generative Adversarial Networks for Near-End Speech Intelligibility Enhancement (2021) (4)
- Identifying Computer-Translated Paragraphs using Coherence Features (2018) (4)
- Evaluation of a Transplantation Algorithm for Expressive Speech Synthesis (2013) (4)
- SUPERSEDED - The 2nd Automatic Speaker Verification Spoofing and Countermeasures Challenge (ASVspoof 2017) Database (2017) (4)
- Post-evaluation analysis for the VoicePrivacy 2020Challenge: Using anonymized speech data to train attackmodels and ASR (2020) (4)
- Modeling of Rakugo Speech and Its Limitations: Toward Speech Synthesis That Entertains Audiences (2019) (4)
- Influence of speaker familiarity on blind and visually impaired children's and young adults' perception of synthetic voices (2017) (4)
- Voice banking and voice reconstruction for MND patients (2011) (4)
- Latent linguistic embedding for cross-lingual text-to-speech and voice conversion (2020) (4)
- Towards an Unsupervised Speaking Style Voice Building Framework: Multi-Style Speaker Diarization (2012) (4)
- Unsupervised speaker adaptation for speech-to-speech translation system (言語理解とコミュニケーション) (2009) (4)
- Transformation of low-quality device-recorded speech to high-quality speech using improved SEGAN model (2019) (4)
- Text-to-Speech Synthesis Techniques for MIDI-to-Audio Synthesis (2021) (4)
- Vocal attractiveness of statistical speech synthesisers (2011) (3)
- Influence of speaker familiarity on blind and visually impaired children's perception of synthetic voices in audio games (2015) (3)
- Anti-spoofing, Voice Databases (2015) (3)
- Detecting and Correcting Adversarial Images Using Image Processing Operations and Convolutional Neural Networks (2019) (3)
- Expressive Speech Synthesis Using Sentiment Embeddings (2018) (3)
- Pretraining Strategies, Waveform Model Choice, and Acoustic Configurations for Multi-Speaker End-to-End Speech Synthesis (2020) (3)
- Security of Facial Forensics Models Against Adversarial Attacks (2019) (3)
- Device Recorded VCTK (Small subset version) (2018) (3)
- A simple RNN-plus-highway network for statistical parametric speech synthesis (2017) (3)
- HMM-based Speech Synthesis with an Acoustic Glottal Source Model (2009) (3)
- "Developing a Test Bed of English Text-to-Speech System XIMERA for the Blizzard Challenge 2006 for the Blizzard Challenge 2006" (2006) (3)
- Reconstructing voices within the multiple-average-voice-model framework (2015) (3)
- Does the Lombard Effect Improve Emotional Communication in Noise? - Analysis of Emotional Speech Acted in Noise - (2019) (3)
- Supplementary material to the paper The VoicePrivacy 2020 Challenge: Results and findings (2021) (3)
- Proceedings of Interspeech 2015 (2015) (3)
- LaughNet: synthesizing laughter utterances from waveform silhouettes and a single laughter example (2021) (3)
- A Method for Identifying Origin of Digital Images Using a Convolutional Neural Network (2019) (3)
- Estimating the Confidence of Speech Spoofing Countermeasure (2021) (3)
- Complex-Valued Restricted Boltzmann Machine for Speaker-Dependent Speech Parameterization From Complex Spectra (2019) (3)
- Speaker adaptation using context clustering decision tree for HMM-based speech synthesis (2003) (3)
- Investigation of Using Continuous Representation of Various Linguistic Units in Neural Network Based Text-to-Speech Synthesis (2016) (3)
- Investigating Active-Learning-Based Training Data Selection for Speech Spoofing Countermeasure (2022) (3)
- SVSNet: An End-to-End Speaker Voice Similarity Assessment Model (2021) (2)
- Denoising-and-Dereverberation Hierarchical Neural Vocoder for Robust Waveform Generation (2020) (2)
- Effects of Image Processing Operations on Adversarial Noise and Their Use in Detecting and Correcting Adversarial Images (2022) (2)
- Investigating Very Deep Highway Networks for Parametric Speech Synthesis (2016) (2)
- ASVspoof 2021 Challenge - Speech Deepfake Database (2021) (2)
- Using an intelligibility measure to create noise robust cepstral coefficients for HMM-based speech synthesis (2012) (2)
- Unsupervised Speaker Adaptation for DNN-based Speech Synthesis using Input Codes (2018) (2)
- Spoofing and Anti-Spoofing (SAS) corpus v1.0 (2015) (2)
- Intelligibility Enhancement of Speech in Noise (2014) (2)
- An HMM-based speech synthesiser using glottal post-filtering (2010) (2)
- Proc. of The 1st Workshop on Child, Computer and Interaction (ICMI'08 post-conference workshop) (2008) (2)
- Analyzing Language-Independent Speaker Anonymization Framework under Unseen Conditions (2022) (2)
- Alba speech corpus (2019) (2)
- Lip Motion synthesis using a context dependent trajectory Hidden Markov Model (2007) (2)
- Training a Neural Speech Waveform Model using Spectral Losses of Short-Time Fourier Transform and Continuous Wavelet Transform (2019) (2)
- Listening test materials for "Deep neural network-guided unit selection synthesis" (2016) (2)
- Effectiveness of Detection-based and Regression-based Approaches for Estimating Mask-Wearing Ratio (2021) (2)
- Automatic speaker verification spoofing and countermeasures (ASVspoof 2015): open discussion and future plans (2015) (2)
- Reverberation Modeling for Source-Filter-based Neural Vocoder (2020) (2)
- A Study on Context Clustering Techniques and Speaker Adaptive Training for Average Voice Model (2002) (2)
- ASVspoof 2021 Challenge - Physical Access Database (2021) (2)
- An unified and automatic approach of Mandarin HTS system (2010) (2)
- DDS: A new device-degraded speech dataset for speech enhancement (2021) (2)
- An initial investigation on optimizing tandem speaker verification and countermeasure systems using reinforcement learning (2020) (2)
- The GTH-CSTR Entries for the Speech Synthesis Albayzin 2010 Evaluation: HMM-based Speech Synthesis Systems considering morphosyntactic features and Speaker Adaptation Techniques (2010) (2)
- Mage - HMM-based speech synthesis reactively controlled by the articulators (2013) (1)
- Gesture Control of HMM-Based Singing Voice Synthesis (2013) (1)
- User Generated Dialogue Systems: uDialogue (2017) (1)
- Enhancing Low-Quality Voice Recordings Using Disentangled Channel Factor and Neural Waveform Model (2020) (1)
- Robust Deepfake On Unrestricted Media: Generation And Detection (2022) (1)
- Lessons Learned from ASVSpoof and Remaining Challenges (2022) (1)
- Speech Prosody 2014 (2014) (1)
- Unsupervised English-to-Japanese speaker adaptation for HMM-based speech synthesis. (2009) (1)
- Spoofing-Aware Attention based ASV Back-end with Multiple Enrollment Utterances and a Sampling Strategy for the SASV Challenge 2022 (2022) (1)
- Voice banking and reconstruction 解説 — Speech synthesis technologies for individuals with vocal disabilities — (2011) (1)
- Introduction to the special issue "Speaker and language characterization and recognition: Voice modeling, conversion, synthesis and ethical aspects" (2020) (1)
- Color Transfer to Anonymized Gait Images While Maintaining Anonymization (2020) (1)
- Using adaptation to improve speech transcription alignment in noisy and reverberant environments (2013) (1)
- Development of a statistical parametric synthesis system for operatic singing in German (2016) (1)
- Use of Speaker Recognition Approaches for Learning and Evaluating Embedding Representations of Musical Instrument Sounds (2021) (1)
- Voice Conversion Challenge 2020 Listening Test Data (2020) (1)
- Applying Spectral Normalisation and Efficient Envelope Estimation and Statistical Transformation for the Voice Conversion Challenge 2016 (2016) (1)
- A Comparison of Manual and Automatic Voice Repair for Individual with Vocal Disabilities (2015) (1)
- Advanced speech synthesis technologies for vocal disabilities (2015) (1)
- Spoofing and Anti-Spoofing: A Shared View of Speaker Verification, Speech Synthesis and Voice Conversion (2015) (1)
- Misperceptions of the Emotional Content of Natural and Vocoded Speech in a Car (2017) (1)
- Performance evaluation of HMM-based style classification with a small amount of training data (2007) (1)
- Preventing Fake Information Generation Against Media Clone Attacks (2021) (1)
- Proceedings of INTERSPEECH 2012 13th Annual Conference of the International Speech Communication Association (2012) (1)
- Viable Threat on News Reading: Generating Biased News Using Natural Language Models (2020) (1)
- Human vs Machine Spoofing (2015) (1)
- Visualizing Classifier Adjacency Relations: A Case Study in Speaker Verification and Voice Anti-Spoofing (2021) (1)
- Generation and Detection of Media Clones (2021) (1)
- SLPAT 2013, 4th Workshop on Speech and Language Processing for Assistive Technologies (2013) (1)
- Joint Noise Reduction and Listening Enhancement for Full-End Speech Enhancement (2022) (0)
- Visualizing Classifier Adjacency Relations: A Case Study in Speaker Verification and Voice Anti-Spoofing (2021) (0)
- Key files for Spoofing and Anti-Spoofing (SAS) corpus v1.0 (2017) (0)
- Cyber Vaccine for Deepfake Immunity (2023) (0)
- J2330102 Improvement of Stress Evaluation System by Onomatopeias (2015) (0)
- DDS (Device-Degraded Speech) Dataset - VCTK portion - Part 1 (2021) (0)
- International Conference on NONLINEAR SPEECH PROCESSING 2015 (2015) (0)
- 2nd International Workshop on Speech, Language and Audio in Multimedia (SLAM2014) (2014) (0)
- Evaluation of the new speech signal model in conjunction with HMM-based acoustic models (2013) (0)
- Future Trends in Digital Face Manipulation and Detection (2022) (0)
- Edinburgh Research Explorer Integrating Articulatory Features Into HMM-Based Parametric Speech Synthesis (2008) (0)
- Joint Speaker Encoder and Neural Back-end Model for Fully End-to-End Automatic Speaker Verification with Multiple Enrollment Utterances (2022) (0)
- Detecting and Correcting Adversarial Images Using Image Processing Operations (2019) (0)
- Non linear time compression of clear and normal speech at high rates (2019) (0)
- Spoofed training data for speech spoofing countermeasure can be efficiently created using neural vocoders (2022) (0)
- Identifying Machine-Generated Text Using Statistical Analysis (2018) (0)
- Development of a genre-dependent TTS system with cross-speaker speaking-style transplantation (2014) (0)
- Explorer Phone Duration Modeling Using Gradient Tree Boosting (2008) (0)
- Number 41 (2019) (0)
- Interspeech 2009 Edinburgh. (2009) (0)
- Effect of Choice of Probability Distribution, Randomness, and Search Methods for Alignment Modeling in Sequence-to-Sequence Text-to-Speech Synthesis Using Hard Alignment (2019) (0)
- Outlier-Aware Training for Improving Group Accuracy Disparities (2022) (0)
- Proc. The First Young Researchers Workshop in Speech Technology (2009) (0)
- How Similar or Different is Rakugo Speech Synthesizer to Professional Performers? (2020) (0)
- Mitigating the Diminishing Effect of Elastic Weight Consolidation (2022) (0)
- The Voice Conversion Challenge, 2016: multidimensional scaling (MDS) listening test results (2016) (0)
- Edinburgh Explorer Using Text and Acoustic Features in Predicting Glottal Excitation Waveforms for Parametric Speech Synthesis with Recurrent Neural Networks (2016) (0)
- Listening test results of the Voice Conversion Challenge 2018 (2019) (0)
- Autoregressive quantized F0 modeling using a recurrent neural network with feedback links (音声) (2017) (0)
- Fifty-Storms 2017 : Team Description Paper (2017) (0)
- INTERSPEECH 2008, 9th Annual Conference of the International Speech Communication Association, Brisbane, Australia, September 22-26, 2008 (2008) (0)
- An overview of the current speech and language resources (2015) (0)
- A Message from the Co-Chairs (2007) (0)
- Explorer The Voice Conversion Challenge 2016 (2016) (0)
- Edinburgh Research Explorer Multiple-average-voice-based speech synthesis (2017) (0)
- Analysis of Master Vein Attacks on Finger Vein Recognition Systems (2022) (0)
- Proc. Speech Synthesis Workshop 2010 (2010) (0)
- Explorer Speech Enhancement for a Noise-Robust Text-to-Speech Synthesis System using Deep Recurrent Neural Networks (2018) (0)
- PAID I BIAS ADAPTATION FOR VOCAL TRACT LENGTH NORMALIZATION (2013) (0)
- Proc. Interspeech 2012 (2012) (0)
- 1F33 Threshold of Multilayer Softness for Tactile Display (2016) (0)
- Investigating different representations for modeling multiple emotions in DNN-based speech synthesis (2017) (0)
- SUPERSEDED - The 2nd Automatic Speaker Verification Spoofing and Countermeasures Challenge (ASVspoof 2017) Database, Version 2 (2018) (0)
- Modeling of Rakugo Speech and Its Various Speaking Styles: Toward Speech Synthesis That Entertains Audiences (2019) (0)
- Can Knowledge of End-to-End Text-to-Speech Models Improve Neural MIDI-to-Audio Synthesis Systems? (2022) (0)
- Bias Adaptation for Vocal Tract Length Normalization (2013) (0)
- A Combination of Speech Synthesis and Speech Recognition Creates an Affluent Society (2014) (0)
- Proc. 2009 Asia-Pacific Signal and Information Processing Association (APSIPA) (2009) (0)
- Microphone Feature Extraction Classifier Decision Speaker Template Storage Logic 1 2 3 4 5 8 9 6 7 (2019) (0)
- Grapheme or phoneme? An Analysis of Tacotron's Embedded Representations (2020) (0)
- Explorer Robust Speaker-Adaptive HMM-Based Text-to-Speech Synthesis (2009) (0)
- Proceedings of 9th ISCA Speech Synthesis Workshop (2016) (0)
- Explorer An Autoregressive Recurrent Mixture density Network For Parametric Speech Synthesis (2017) (0)
- Reactive Control of Expressive Speech Synthesis Using Kinect Skeleton Tracking (音声・第14回音声言語シンポジウム) (2012) (0)
- The Eighth ISCA Tutorial and Research Workshop on Speech Synthesis, Barcelona, Spain, August 31-September 2, 2013 (2013) (0)
- Preliminary study on using vector quantization latent spaces for TTS/VC systems with consistent performance (2021) (0)
- Edinburgh Research Explorer Wavelet-based decomposition of F0 as a secondary task for DNN-based speech synthesis with multi-task learning (2015) (0)
- Proc. SAPA-SCALE Workshop on Statistical and Perceptual Audition (SAPA-SCALE 2012) (2012) (0)
- Proc. 7th ISCA Speech Synthesis Workshop (SSW7) (2010) (0)
- COMBINING VOCAL TRACT LENGTH NORMALIZATION WITH LINEAR TRANSFORMATIONS IN A BAYESIAN FRAMEWORK (2012) (0)
- SUPERSEDED - The Voice Conversion Challenge 2016 (2016) (0)
- 8th ISCA Workshop on Speech Synthesis - Barcelona, Spain (2013) (0)
- Proceedings SLP 2009 (2009) (0)
- 1012 Physical Threshold of the Multi-layer Sense (2016) (0)
- PartialSpoof Database - Partially Spoofed Audio Dataset for Anti-spoofing (2021) (0)
- Investigation on an autoregressive recurrent mixture density network for parametric speech synthesis (2017) (0)
- Edinburgh Research Explorer Reducing mismatch in training of DNN-based glottal excitation models in a statistical parametric text-to-speech system (2017) (0)
- Tires with flame retardant rubber (1992) (0)
- Complex-Valued Restricted Boltzmann Machine for Direct Speech Parameterization from Complex Spectra (2018) (0)
- J1630304 Quantification of Physiological Multilayer Softness for Tactile Display (2015) (0)
- HMM-based speech synthesis adapted to listeners’ and talkers’ conditions (2012) (0)
- Listening test materials for "Multiple Feed-forward Deep Neural Networks for Statistical Parametric Speech Synthesis" (2015) (0)
- Transforming Voice Source Parameters in a HMM-based Speech Synthesiser with Glottal Post-Filtering (2010) (0)
- Analyzing the impact of including listener perception annotations in RNN-based emotional speech synthesis (2017) (0)
- The 42nd IEEE International Conference on Acoustics, Speech and Signal Processing (2016) (0)
- A Study on Coded Parallel Combinatorial Spread Spectrum Systems Using Multiphase Modulation (2000) (0)
- Real-time control of expressive speech synthesis using kinect body tracking (2013) (0)
- 96kHz version of the CSTR VCTK Corpus (2017) (0)
- The HMM-basedSpeechSynthesisSystem(HTS)Version2.0 (2007) (0)
- Voice Conversion Challenge 2020 database v1.0 (2020) (0)
- Investigation of using the highway network to predict the F0 trajectory for text-to-speech synthesis (2016) (0)
- Majorisation-Minimisation Based Optimisation of the Composite Autoregressive System with Application to Glottal Inverse Filtering (2016) (0)
- Proc. ICASSP - Vancouver, Canada (2013) (0)
- Explorer A Hierarchical Predictor of Synthetic Speech Naturalness Using Neural Networks (2017) (0)
- The VoicePrivacy 2020 Challenge: Post-evaluation analysis (2020) (0)
- The PartialSpoof Database and Countermeasures for the Detection of Short Fake Speech Segments Embedded in an Utterance (2022) (0)
- Improving Spanish speech synthesis intelligibility under noisy environments (2016) (0)
- Explorer Integrating Articulatory Features Into HMM-Based Parametric Speech Synthesis (2008) (0)
- Edinburgh Research Explorer A deep auto-encoder based low-dimensional feature extraction from FFT spectral envelopes for statistical parametric speech synthesis (2015) (0)
- Spoofing and countermeasures for automatic speaker verification (2021) (0)
- Proceedings of the ACL 2010 System Demonstrations (2010) (0)
- INTERSPEECH 2010, 11th Annual Conference of the International Speech Communication Association, Makuhari, Chiba, Japan, September 26-30, 2010 (2010) (0)
- Proc. Iberspeech 2012 (2012) (0)
- Use of speaker recognition approaches for learning timbre representations of musical instrument sounds from raw waveforms (2021) (0)
- Continuous Expressive Speaking Styles Synthesis based on CVSM and MR-HMM (2016) (0)
- Superseded - Human vs Machine Spoofing (2015) (0)
- Edinburgh Research Explorer Vowel Creation by Articulatory Control in HMM-based Parametric Speech Synthesis (2017) (0)
- Edinburgh Research Explorer An Approach for Gait Anonymization Using Deep Learning (2018) (0)
- Proceedings of the Institute of Acoustics 2014 (2014) (0)
- Comparative Evaluation of Synthetic and Voice-Converted Speech for Speaker Verification Spoofing and Countermeasures. (2014) (0)
- Explorer Analysis of the Voice Conversion Challenge 2016 Evaluation (2017) (0)
- Voice Conversion Challenge 2020 -- submitted waveforms v1.0.0 (2021) (0)
- 9th ISCA Speech Synthesis Workshop (2016) (0)
- Hiding speaker's sex in speech using zero-evidence speaker representation in an analysis/synthesis pipeline (2022) (0)
- PAPER Special Section on Recent Advances in Machine Learning for Spoken Language Processing Investigation of Using Continuous Representation of Various Linguistic Units in Neural Network Based Text-to-Speech Synthesis (2016) (0)
- Generating segmental foreign accent Marı́a (2014) (0)
This paper list is powered by the following services:
What Schools Are Affiliated With Junichi Yamagishi?
Junichi Yamagishi is affiliated with the following schools: