Tomoki Toda

Q: What Schools Are Affiliated With Tomoki Toda

Tomoki Toda is affiliated with the following schools: Nara Institute of Science and Technology, Nagoya University

Tomoki Toda's AcademicInfluence.com Rankings

Tomoki Toda

Engineering

#5878

World Rank

#7165

Historical Rank

Electrical Engineering

#1687

World Rank

#1785

Historical Rank

Applied Physics

#1791

World Rank

#1823

Historical Rank

engineering Degrees

Download Badge

Engineering

Tomoki Toda's Degrees

PhD Electrical and Computer Engineering Nagoya Institute of Technology
Masters Electrical and Computer Engineering Nagoya Institute of Technology
Bachelors Electrical and Computer Engineering Nagoya Institute of Technology

Why Is Tomoki Toda Influential?

(Suggest an Edit or Addition)

(See a Problem?)

Tomoki Toda's Published Works

Number of citations in a given year to any of this author's works

Total number of citations to an author for the works they published in a given year. This highlights publication of the most important work(s) by the author

Published Works

Voice Conversion Based on Maximum-Likelihood Estimation of Spectral Parameter Trajectory (2007) (1005)
A Speech Parameter Generation Algorithm Considering Global Variance for HMM-Based Speech Synthesis (2007) (508)
Speech Synthesis Based on Hidden Markov Models (2013) (426)
Details of the Nitech HMM-Based Speech Synthesis System for the Blizzard Challenge 2005 (2007) (265)
Statistical mapping between articulatory movements and acoustic spectrum using a Gaussian mixture model (2008) (249)
Speaker-Dependent WaveNet Vocoder (2017) (249)
The Voice Conversion Challenge 2018: Promoting Development of Parallel and Nonparallel Methods (2018) (249)
Learning to Generate Pseudo-Code from Source Code Using Statistical Machine Translation (T) (2015) (229)
Robust Speaker-Adaptive HMM-Based Text-to-Speech Synthesis (2009) (205)
Speaking-aid systems using GMM-based voice conversion for electrolaryngeal speech (2012) (196)
Statistical Voice Conversion Techniques for Body-Conducted Unvoiced Speech Enhancement (2012) (176)
Voice conversion algorithm based on Gaussian mixture model with dynamic frequency warping of STRAIGHT spectrum (2001) (172)
The Voice Conversion Challenge 2016 (2016) (157)
XIMERA: a new TTS from ATR based on corpus-based technologies (2004) (145)
Spectral conversion based on maximum likelihood estimation considering global variance of converted parameter (2005) (145)
Eigenvoice conversion based on Gaussian mixture model (2006) (135)
Espnet-TTS: Unified, Reproducible, and Integratable Open Source End-to-End Text-to-Speech Toolkit (2019) (135)
Maximum likelihood voice conversion based on GMM with STRAIGHT mixed excitation (2006) (121)
An overview of nitech HMM-based speech synthesis system for blizzard challenge 2005 (2005) (115)
Voice Conversion Challenge 2020: Intra-lingual semi-parallel and cross-lingual voice conversion (2020) (114)
GMM-based voice conversion applied to emotional speech synthesis (2003) (103)
One-to-Many and Many-to-One Voice Conversion Based on Eigenvoices (2007) (101)
An excitation model for HMM-based speech synthesis based on residual modeling (2007) (97)
An investigation of multi-speaker training for wavenet vocoder (2017) (96)
Statistical Voice Conversion with WaveNet-Based Waveform Generation (2017) (91)
Acoustic-to-articulatory inversion mapping with Gaussian mixture model (2004) (87)
Anti-Spoofing for Text-Independent Speaker Verification: An Initial Database, Comparison of Countermeasures, and Human Performance (2016) (82)
Back-Translation-Style Data Augmentation for end-to-end ASR (2018) (78)
The Nitech-NAIST HMM-Based Speech Synthesis System for the Blizzard Challenge 2006 (2008) (77)
A postfilter to modify the modulation spectrum in HMM-based speech synthesis (2014) (76)
Statistical singing voice conversion with direct waveform modification based on the spectrum differential (2014) (76)
Implementation of Computationally Efficient Real-Time Voice Conversion (2012) (75)
Mapping from articulatory movements to vocal tract spectrum with Gaussian mixture model for articulatory speech synthesis (2004) (74)
Evaluation of cross-language voice conversion based on GMM and straight (2001) (72)
Optimizing Segmentation Strategies for Simultaneous Speech Translation (2014) (70)
NAM-to-speech conversion with Gaussian mixture models (2005) (70)
The HTS-2008 System: Yet Another Evaluation of the Speaker-Adaptive HMM-based Speech Synthesis System in The 2008 Blizzard Challenge (2008) (68)
SAS: A speaker verification spoofing database containing diverse attacks (2015) (66)
Duration-Controlled LSTM for Polyphonic Sound Event Detection (2017) (66)
Recent development of the HMM-based speech synthesis system (HTS) (2009) (65)
Improvement to a NAM-captured whisper-to-speech system (2008) (64)
Voice Transformer Network: Sequence-to-Sequence Voice Conversion Using Transformer with Text-to-Speech Pretraining (2019) (63)
Silent-speech enhancement using body-conducted vocal-tract resonance signals (2010) (63)
Postfilters to Modify the Modulation Spectrum for Statistical Parametric Speech Synthesis (2016) (62)
Speaker-Independent HMM-based Speech Synthesis System: HTS-2007 System for the Blizzard Challenge 2007 (2007) (59)
Alaryngeal Speech Enhancement Based on One-to-Many Eigenvoice Conversion (2014) (56)
Speaking aid system for total laryngectomees using voice conversion of body transmitted artificial speech (2006) (52)
Non-Parallel Voice Conversion with Cyclic Variational Autoencoder (2019) (50)
Weakly-Supervised Sound Event Detection with Self-Attention (2020) (47)
Esophageal Speech Enhancement Based on Statistical Voice Conversion with Gaussian Mixture Models (2010) (46)
An evaluation of automatic phone segmentation for concatenative speech synthesis (2004) (46)
sprocket: Open-Source Voice Conversion Software (2018) (45)
Pre-Trained Text Embeddings for Enhanced Text-to-Speech Synthesis (2019) (45)
Singing voice conversion method based on many-to-many eigenvoice conversion and training data generation using a singing-to-singing synthesis system (2012) (44)
Automated Social Skills Trainer (2015) (43)
Simple, lexicalized choice of translation timing for simultaneous speech translation (2013) (41)
An Investigation of Noise Shaping with Perceptual Weighting for Wavenet-Based Speech Generation (2018) (39)
Generalization Ability of MOS Prediction Networks (2021) (39)
Developing Non-goal Dialog System Based on Examples of Drama Television (2012) (38)
Many-to-many eigenvoice conversion with reference voice (2009) (37)
Modulation spectrum-constrained trajectory training algorithm for GMM-based Voice Conversion (2015) (35)
Predictions of Subjective Ratings and Spoofing Assessments of Voice Conversion Challenge 2020 Submissions (2020) (35)
CONFORMER-BASED SOUND EVENT DETECTION WITH SEMI-SUPERVISED LEARNING AND DATA AUGMENTATION (2020) (34)
A Hybrid Approach to Electrolaryngeal Speech Enhancement Based on Noise Reduction and Statistical Excitation Generation (2014) (34)
Improving body transmitted unvoiced speech with statistical voice conversion (2006) (33)
Technologies for processing body-conducted speech detected with non-audible murmur microphone (2009) (33)
Unit selection algorithm for Japanese speech synthesis based on both phoneme unit and diphone unit (2002) (32)
CONVOLUTION-AUGMENTED TRANSFORMER FOR SEMI-SUPERVISED SOUND EVENT DETECTION Technical Report (2020) (31)
Optimizing sub-cost functions for segment selection based on perceptual evaluations in concatenative speech synthesis (2004) (31)
Bidirectional LSTM-HMM Hybrid System for Polyphonic Sound Event Detection (2016) (31)
Syntax-based Simultaneous Translation through Prediction of Unseen Syntactic Constituents (2015) (31)
Statistical approach to vocal tract transfer function estimation based on factor analyzed trajectory HMM (2008) (31)
The NU Non-Parallel Voice Conversion System for the Voice Conversion Challenge 2018 (2018) (31)
Preserving Word-Level Emphasis in Speech-to-Speech Translation (2017) (31)
Parameter Generation Methods With Rich Context Models for High-Quality and Flexible Text-To-Speech Synthesis (2014) (31)
The NAIST Text-to-Speech System for the Blizzard Challenge 2015 (2015) (31)
Statistical singing voice conversion based on direct waveform modification with global variance (2015) (30)
Voice conversion for various types of body transmitted speech (2009) (30)
Cross-language Voice Conversion Evaluation Using Bilingual Databases (2002) (30)
Intra-gender statistical singing voice conversion with direct waveform modification using log-spectral differential (2018) (30)
Trajectory training considering global variance for HMM-based speech synthesis (2009) (30)
Utilizing Human-to-Human Conversation Examples for a Multi Domain Chat-Oriented Dialog System (2014) (29)
The Sequence-to-Sequence Baseline for the Voice Conversion Challenge 2020: Cascading ASR and TTS (2020) (29)
Collection of a Simultaneous Translation Corpus for Comparative Analysis (2014) (29)
Reinforcement Learning of Cooperative Persuasive Dialogue Policies using Framing (2014) (28)
Collapsed speech segment detection and suppression for WaveNet vocoder (2018) (27)
High quality voice conversion based on Gaussian mixture model with dynamic frequency warping (2001) (27)
Cross-language voice conversion based on eigenvoices (2009) (27)
A New Cosine Series Antialiasing Function and its Application to Aliasing-Free Glottal Source Models for Speech and Singing Synthesis (2017) (26)
Probablistic modelling of F0 in unvoiced regions in HMM based speech synthesis (2009) (26)
Acquiring a Dictionary of Emotion-Provoking Events (2014) (25)
The VoiceMOS Challenge 2022 (2022) (25)
EEG signal enhancement using multi-channel wiener filter with a spatial correlation prior (2015) (24)
The NU-NAIST Voice Conversion System for the Voice Conversion Challenge 2016 (2016) (24)
Statistical approach to enhancing esophageal speech based on Gaussian mixture models (2010) (23)
Teaching Social Communication Skills Through Human-Agent Interaction (2016) (23)
An evaluation of alaryngeal speech enhancement methods based on voice conversion techniques (2011) (23)
Performance evaluation of the speaker-independent HMM-based speech synthesis system “HTS 2007” for the Blizzard Challenge 2007 (2008) (22)
An Investigation of Subband Wavenet Vocoder Covering Entire Audible Frequency Range with Limited Acoustic Features (2018) (21)
Emphasized speech synthesis based on hidden Markov models (2009) (21)
A hybrid approach to electrolaryngeal speech enhancement based on spectral subtraction and statistical voice conversion (2013) (21)
LDNet: Unified Listener Dependent Modeling in MOS Prediction for Synthetic Speech (2021) (21)
Learning Novel Objects for Extended Mobile Manipulation (2012) (21)
Anomalous Sound Event Detection Based on WaveNet (2018) (21)
Refined WaveNet Vocoder for Variational Autoencoder Based Voice Conversion (2018) (20)
Real-Time Neural Text-to-Speech with Sequence-to-Sequence Acoustic Model and WaveGlow or Single Gaussian WaveRNN Vocoders (2019) (20)
Linguistic and Acoustic Features for Automatic Identification of Autism Spectrum Disorders in Children’s Narrative (2014) (20)
Voice Conversion with Cyclic Recurrent Neural Network and Fine-tuned Wavenet Vocoder (2019) (20)
Speaker adaptive training for one-to-many eigenvoice conversion based on Gaussian mixture model (2007) (20)
An evaluation of cost functions sensitively capturing local degradation of naturalness for segment selection in concatenative speech synthesis (2006) (20)
Non-parallel training for many-to-many eigenvoice conversion (2010) (20)
A Spoofing Benchmark for the 2018 Voice Conversion Challenge: Leveraging from Spoofing Countermeasures for Speech Artifact Assessment (2018) (19)
Speaker-Independent HMM-based Speech Synthesis System (2007) (19)
NU Voice Conversion System for the Voice Conversion Challenge 2018 (2018) (19)
The use of air-pressure sensor in electrolaryngeal speech enhancement based on statistical voice conversion (2009) (19)
Pretraining Techniques for Sequence-to-Sequence Voice Conversion (2020) (19)
Voice Timbre Control Based on Perceived Age in Singing Voice Conversion (2014) (18)
Modulation spectrum-based post-filter for GMM-based Voice Conversion (2014) (18)
On the Use of Phonetic Information for Mapping from Articulatory Movements to Vocal Tract Spectrum (2006) (18)
A trainable excitation model for HMM-based speech synthesis (2007) (18)
Many-to-Many Voice Transformer Network (2020) (18)
A method for translation of paralinguistic information (2012) (18)
Straight-based voice conversion algorithm based on Gaussian mixture model (2000) (17)
Combination of two-dimensional cochleogram and spectrogram features for deep learning-based ASR (2015) (17)
Augmented speech production based on real-time statistical voice conversion (2014) (17)
Adaptive Training for Voice Conversion Based on Eigenvoices (2010) (17)
Generalized Multichannel Variational Autoencoder for Underdetermined Source Separation (2018) (17)
Quasi-Periodic WaveNet Vocoder: A Pitch Dependent Dilated Convolution Model for Parametric Speech Generation (2019) (17)
Regression approaches to voice quality controll based on one-to-many eigenvoice conversion (2007) (17)
F0 transformation techniques for statistical voice conversion with direct waveform modification with spectral differential (2016) (17)
ATRECSS — ATR ENGLISH SPEECH CORPUS FOR SPEECH SYNTHESIS (2007) (17)
Evaluation of cross-language voice conversion using bilingual and non-bilingual databases (2002) (17)
Aliasing-free implementation of discrete-time glottal source models and their applications to speech synthesis and F0 extractor evaluation (2015) (16)
BLSTM-HMM hybrid system combined with sound activity detection network for polyphonic Sound Event Detection (2017) (16)
Any-to-One Sequence-to-Sequence Voice Conversion Using Self-Supervised Discrete Speech Representations (2020) (16)
Ckylark: A More Robust PCFG-LA Parser (2015) (16)
BANDWIDTH EXTENSION OF CELLULAR PHONE SPEECH BASED ON MAXIMUM LIKELIHOOD ESTIMATION WITH GMM (2008) (16)
Adaptive voice-quality control based on one-to-many eigenvoice conversion (2010) (16)
Development of preschool children subsystem for ASR and q&a in a real-environment speech-oriented guidance task (2007) (16)
Generalizing continuous-space translation of paralinguistic information (2013) (16)
Tacotron-Based Acoustic Model Using Phoneme Alignment for Practical Neural Text-to-Speech Systems (2019) (16)
Pseudogen: A Tool to Automatically Generate Pseudo-Code from Source Code (2015) (15)
Speed or accuracy? a study in evaluation of simultaneous speech translation (2015) (15)
Acoustic model training for non-audible murmur recognition using transformed normal speech data (2011) (15)
Spectral conversion based on statistical models including time-sequence matching (2007) (15)
Multi-Head Decoder for End-to-End Speech Recognition (2018) (14)
A Speech Communication Aid System for Total Laryngectomees Using Voice Conversion of Body Transmitted Artificial Speech (2006) (14)
Improving FFTNet Vocoder with Noise Shaping and Subband Approaches (2018) (14)
Constructing a speech translation system using simultaneous interpretation data (2013) (14)
Crank: An Open-Source Software for Nonparallel Voice Conversion Based on Vector-Quantized Variational Autoencoder (2021) (14)
Emotion and Its Triggers in Human Spoken Dialogue: Recognition and Analysis (2014) (14)
Deep acoustic-to-articulatory inversion mapping with latent trajectory modeling (2017) (14)
Parameter generation algorithm considering Modulation Spectrum for HMM-based speech synthesis (2015) (14)
S3PRL-VC: Open-Source Voice Conversion Framework with Self-Supervised Speech Representations (2021) (13)
Transformer-Based Text-to-Speech with Weighted Forced Attention (2020) (13)
Linguistic Individuality Transformation for Spoken Language (2015) (13)
Multimodal HMM-based NAM-to-speech conversion (2009) (13)
Speaker-Adaptive Speech Synthesis Based on Eigenvoice Conversion and Language-Dependent Prosodic Conversion in Speech-to-Speech Translation (2011) (13)
Underdetermined Source Separation Based on Generalized Multichannel Variational Autoencoder (2019) (12)
Evaluation of speaking-aid system with voice conversion for laryngectomees toward its use in practical environments (2008) (12)
Improving Rapid Unsupervised Speaker Adaptation Based On Hmm Sufficient Statistics (2006) (12)
Subband wavenet with overlapped single-sideband filterbanks (2017) (12)
Modeling of Speech Parameter Sequence Considering Global Variance for HMM-Based Speech Synthesis (2011) (12)
Building a free, general-domain paraphrase database for Japanese (2014) (12)
Quasi-Periodic Parallel WaveGAN: A Non-Autoregressive Raw Waveform Generative Model With Pitch-Dependent Dilated Convolution Neural Network (2020) (12)
Speech-to-Singing Voice Conversion: The Challenges and Strategies for Improving Vocal Conversion Processes (2019) (12)
Selective EM training of acoustic models based on sufficient statistics of single utterances (2005) (11)
Improved average-voice-based speech synthesis using gender-mixed modeling and a parameter generation algorithm considering GV (2007) (11)
Non-verbal cognitive skills and autistic conditions: An analysis and training tool (2012) (11)
Baseline System of Voice Conversion Challenge 2020 with Cyclic Variational Autoencoder and Parallel WaveGAN (2020) (11)
An Evaluation of Deep Spectral Mappings and WaveNet Vocoder for Voice Conversion (2018) (11)
Post-Filters to Modify the Modulation Spectrum for Statistical Parametric Speech Synthesis (2016) (11)
Daily Activity Recognition with Large-Scaled Real-Life Recording Datasets Based on Deep Neural Network Using Multi-Modal Signals (2018) (11)
Speech Recognition by Simply Fine-Tuning Bert (2021) (11)
Predicting F0 and voicing from NAM-captured whispered speech (2008) (11)
Optimizing integrated cost function for segment selection in concatenative speech synthesis based on perceptual evaluations (2003) (10)
Non-Autoregressive Sequence-To-Sequence Voice Conversion (2021) (10)
Acoustic compensation methods for body transmitted speech conversion (2009) (10)
Designing speech database with prosodic variety for expressive TTS system (2002) (10)
Investigation of F0 conditioning and Fully Convolutional Networks in Variational Autoencoder based Voice Conversion (2019) (10)
Regression approaches to perceptual age control in singing voice conversion (2014) (10)
ANOMALOUS SOUND DETECTION WITH ENSEMBLE OF AUTOENCODER AND BINARY CLASSIFICATION APPROACHES Technical Report (2021) (10)
Dialogue management for leading the conversation in persuasive dialogue systems (2013) (10)
Modified post-filter to recover modulation spectrum for HMM-based speech synthesis (2014) (10)
Customer Satisfaction Estimation in Contact Center Calls Based on a Hierarchical Multi-Task Model (2020) (10)
Active Learning for Example-Based Dialog Systems (2016) (10)
Electrolaryngeal Speech Enhancement with Statistical Voice Conversion based on CLDNN (2018) (10)
NOCOA+: Multimodal Computer-Based Training for Social and Communication Skills (2015) (10)
An investigation of recurrent neural network for daily activity recognition using multi-modal signals (2017) (10)
Grapheme-to-phoneme conversion based on adaptive regularization of weight vectors (2013) (9)
Combination of Example-based and SMT-based Approaches in a Chat-oriented Dialog System (2013) (9)
The ASVspoof 2019 database (2019) (9)
Collection and analysis of a Japanese-English emphasized speech corpora (2014) (9)
Frequency domain variants of velvet noise and their application to speech processing and synthesis: with appendices (2018) (9)
Simultaneous Acoustic, Prosodic, and Phrasing Model Training for TTs Conversion Systems (2008) (9)
Maximum a posteriori adaptation for many-to-one eigenvoice conversion (2008) (9)
Construction and Analysis of a Persuasive Dialogue Corpus (2014) (9)
Techniques in rapid unsupervised speaker adaptation based on HMM-Sufficient Statistics (2009) (9)
Utterance-Based Selective Training for the Automatic Creation of Task-Dependent Acoustic Models (2006) (9)
An evaluation of excitation feature prediction in a hybrid approach to electrolaryngeal speech enhancement (2014) (9)
Simultaneous conversion of duration and spectrum based on statistical models including time-sequence matching (2008) (9)
Semantic Parsing of Ambiguous Input through Paraphrasing and Verification (2015) (8)
Voice conversion based on mixtures of factor analyzers (2006) (8)
Deep neural network-based power spectrum reconstruction to improve quality of vocoded speech with limited acoustic parameters (2018) (8)
High-Intelligibility Speech Synthesis for Dysarthric Speakers with LPCNet-Based TTS and CycleVAE-Based VC (2021) (8)
Evaluation of Extremely Small Sound Source Signals Used in Speaking-Aid System with Statistical Voice Conversion (2010) (8)
Enhancement of Esophageal Speech Using Statistical Voice Conversion (2009) (8)
High-quality and flexible speech synthesis with segment selection and voice conversion (2003) (8)
Implementation of F0 transformation for statistical singing voice conversion based on direct waveform modification (2016) (8)
Non-audible murmur enhancement based on statistical conversion using air- and body-conductive microphones in noisy environments (2015) (8)
An investigation of acoustic features for singing voice conversion based on perceptual age (2013) (8)
The 2012 KIT and KIT-NAIST English ASR systems for the IWSLT evaluation (2012) (8)
Quasi-Periodic WaveNet: An Autoregressive Raw Waveform Generative Model With Pitch-Dependent Dilated Convolution Neural Network (2020) (8)
Learning cooperative persuasive dialogue policies using framing (2016) (8)
Conversation dialog corpora from television and movie scripts (2014) (7)
Articulatory Controllable Speech Modification Based on Statistical Inversion and Production Mappings (2017) (7)
An empirical comparison of joint optimization techniques for speech translation (2013) (7)
The NU Voice Conversion System for the Voice Conversion Challenge 2020: On the Effectiveness of Sequence-to-sequence Models and Autoregressive Neural Vocoders (2020) (7)
Towards High-Reliability Speech Translation in the Medical Domain (2013) (7)
A noise suppression method for body-conducted soft speech based on non-negative tensor factorization of air- and body-conducted signals (2017) (7)
Voice Conversion With CycleRNN-Based Spectral Mapping and Finely Tuned WaveNet Vocoder (2019) (7)
Statistical approaches to enhancement of body-conducted speech detected with non-audible murmur microphone (2012) (7)
Improving translation of emphasis with pause prediction in speech-to-speech translation systems (2015) (7)
Investigating Self-supervised Pretraining Frameworks for Pathological Speech Recognition (2022) (7)
Efficient Shallow Wavenet Vocoder Using Multiple Samples Output Based on Laplacian Distribution and Linear Prediction (2020) (7)
Improving the robustness of example-based dialog retrieval using recursive neural network paraphrase identification (2014) (7)
Incremental sentence compression using LSTM recurrent networks (2015) (7)
The Voice Conversion Challenge 2018: database and results (2018) (7)
Cyclic Spectral Modeling for Unsupervised Unit Discovery into Voice Conversion with Excitation and Waveform Modeling (2020) (7)
The use of semantic and acoustic features for open-domain TED talk summarization (2014) (7)
A latent variable model for joint pause prediction and dependency parsing (2015) (7)
Investigation of training data size for real-time neural vocoders on CPUs (2021) (7)
Statistical approach to voice quality control in esophageal speech enhancement (2012) (6)
An end-to-end model for cross-lingual transformation of paralinguistic information (2018) (6)
A digital signal processor implementation of silent/electrolaryngeal speech enhancement based on real-time statistical voice conversion (2013) (6)
Direct F0 control of an electrolarynx based on statistical excitation feature prediction and its evaluation through simulation (2014) (6)
The KIT-NAIST (contrastive) English ASR system for IWSLT 2012 (2012) (6)
Unified Source-Filter GAN: Unified Source-filter Network Based On Factorization of Quasi-Periodic Parallel WaveGAN (2021) (6)
Accurate estimation of f0 and aperiodicity based on periodicity detector residuals and deviations of phase derivatives (2017) (6)
A study of social-affective communication: Automatic prediction of emotion triggers and responses in television talk shows (2015) (6)
Construction and analysis of social-affective interaction corpus in English and Indonesian (2015) (6)
Discriminative Language Models as a Tool for Machine Translation Error Analysis (2014) (6)
On the state definition for a trainable excitation model in HMM-based speech synthesis (2008) (6)
A Hybrid System for Continuous Word-Level Emphasis Modeling Based on HMM State Clustering and Adaptive Training (2016) (6)
Blind noise suppression for Non-Audible Murmur recognition with stereo signal processing (2011) (6)
Speech Emotion Recognition Based on Listener Adaptive Models (2021) (5)
Noise Level Limited Sub-Modeling for Diffusion Probabilistic Vocoders (2021) (5)
Excitation source analysis for high-quality speech manipulation systems based on an interference-free representation of group delay with minimum phase response compensation (2014) (5)
A Preliminary Study of a Two-Stage Paradigm for Preserving Speaker Identity in Dysarthric Voice Conversion (2021) (5)
An Analysis Towards Dialogue-Based Deception Detection (2015) (5)
Model Integration for HMM- and DNN-Based Speech Synthesis Using Product-of-Experts Framework (2016) (5)
A Modulation Property of Time-Frequency Derivatives of Filtered Phase and its Application to Aperiodicity and fo Estimation (2017) (5)
Full-Band LPCNet: A Real-Time Neural Vocoder for 48 kHz Audio With a CPU (2021) (5)
Eigenvoice-Based Approach to Voice Conversion and Voice Quality Control (2009) (5)
A Cyclical Post-filtering Approach to Mismatch Refinement of Neural Vocoder for Text-to-speech Systems (2020) (5)
Improving Pivot Translation by Remembering the Pivot (2015) (5)
Environmental sound processing and its applications (2019) (5)
Improvements of the One-to-Many Eigenvoice Conversion System (2010) (5)
Improving Rapid Unsupervised Speaker Adaptation Based on HMM-Sufficient Statistics in Noisy Environments Using Multi-Template Models (2006) (5)
Noisy-to-Noisy Voice Conversion Framework with Denoising Model (2021) (5)
An Extended Mobile Manipulation Robot Learning Novel Objects (2012) (5)
A decision tree-based clustering approach to state definition in an excitation modeling framework for HMM-based speech synthesis (2009) (5)
Investigations of Real-time Gaussian Fftnet and Parallel Wavenet Neural Vocoders with Simple Acoustic Features (2019) (5)
Building an Effective Speech Corpus by Utilizing Statistical Multidimensional Scaling Method (2008) (5)
Non-native speech synthesis preserving speaker individuality based on partial correction of prosodic and phonetic characteristics (2015) (5)
Articulatory controllable speech modification based on statistical feature mapping with Gaussian mixture models (2014) (5)
Daily activity recognition based on recurrent neural network using multi-modal signals (2018) (5)
Structured soft margin confidence weighted learning for grapheme-to-phoneme conversion (2014) (5)
An Investigation of Machine Translation Evaluation Metrics in Cross-lingual Question Answering (2015) (5)
Segment selection considering local degradation of naturalness in concatenative speech synthesis (2003) (5)
NAIST at the CLEF 2013 QA4MRE Pilot Task (2013) (5)
Implementation of low-latency electrolaryngeal speech enhancement based on multi-task CLDNN (2021) (5)
Towards Identity Preserving Normal to Dysarthric Voice Conversion (2021) (5)
HASA-Net: A Non-Intrusive Hearing-Aid Speech Assessment Network (2021) (4)
Vowel Recognition Based on Surface Electromyography with Electrode Grid on Submental Region (2012) (4)
The NICT/ATR speech synthesis system for the Blizzard Challenge 2008 (2008) (4)
Beyond bandlimited sampling of speech spectral envelope imposed by the harmonic structure of voiced sounds (2013) (4)
Scene-dependent Anomalous Acoustic-event Detection Based on Conditional Wavenet and I-vector (2019) (4)
Non-Native Text-to-Speech Preserving Speaker Individuality Based on Partial Correction of Prosodic and Phonetic Characteristics (2016) (4)
Robustness of Statistical Voice Conversion Based on Direct Waveform Modification Against Background Sounds (2019) (4)
Direct Noisy Speech Modeling for Noisy-To-Noisy Voice Conversion (2021) (4)
Quasi-Periodic Parallel WaveGAN Vocoder: A Non-autoregressive Pitch-dependent Dilated Convolution Model for Parametric Speech Generation (2020) (4)
Impact of various small sound source signals on voice conversion accuracy in speech communication aid for laryngectomees (2007) (4)
Unified Source-Filter GAN with Harmonic-plus-Noise Source Excitation Generation (2022) (4)
The NAIST Simultaneous Translation Corpus (2018) (4)
A Segment Selection Algorithm for Japanese Concatenative Speech Synthesis Based on Both Phoneme Unit and diphone Unit (2002) (4)
Speech Parameter Generation Algorithm Considering Modulation Spectrum for Statistical Parametric Speech Synthesis (2015) (4)
Acoustic modeling for spoken dialogue systems based on unsupervised utterance-based selective training (2006) (4)
An Evaluation of Parameter Generation Methods with Rich Context Models in HMM-Based Speech Synthesis (2012) (4)
Towards Multilingual Conversations in the Medical Domain: Development of Multilingual Medical Data and A Network-based ASR System (2014) (4)
An Enhanced Electrolarynx with Automatic Fundamental Frequency Control based on Statistical Prediction (2015) (4)
Articulatory controllable speech modification based on Gaussian mixture models with direct waveform modification using spectrum differential (2015) (4)
Evaluation of a singing voice conversion method based on many-to-many eigenvoice conversion (2013) (3)
On Prosody Modeling for ASR+TTS Based Voice Conversion (2021) (3)
Designing Japanese speech database covering wide range in prosody for hybrid speech synthesizer (2002) (3)
Blind speech extraction for Non-Audible Murmur speech with speaker's movement noise (2012) (3)
Speech emotion recognition based on listener-dependent emotion perception models (2021) (3)
Generalization of Spectrum Differential based Direct Waveform Modification for Voice Conversion (2019) (3)
The NAIST machine translation system for IWSLT2012 (2012) (3)
Anomalous Sound Detection Using a Binary Classification Model and Class Centroids (2021) (3)
Narrow Adaptive Regularization of weights for grapheme-to-phoneme conversion (2014) (3)
Low-Latency Real-Time Non-Parallel Voice Conversion based on Cyclic Variational Autoencoder and Multiband WaveRNN with Data-Driven Linear Prediction (2021) (3)
Designing a Pneumatic Bionic Voice Prosthesis - A Statistical Approach for Source Excitation Generation (2018) (3)
Modality and contextual differences in computer based non-verbal communication training (2013) (3)
An improved one-to-many eigenvoice conversion system (2008) (3)
An evaluation of many-to-one voice conversion algorithms with pre-stored speaker data sets (2007) (3)
Memorable spoken quote corpora of TED public speaking (2014) (3)
"Developing a Test Bed of English Text-to-Speech System XIMERA for the Blizzard Challenge 2006 for the Blizzard Challenge 2006" (2006) (3)
Cost Reduction of Acoustic Modeling for Real-Environment Applications Using Unsupervised and Selective Training (2008) (3)
Adaptive selection from multiple response candidates in example-based dialogue (2015) (3)
Audio-visual Voice Conversion Using Deep Canonical Correlation Analysis for Deep Bottleneck Features (2018) (3)
Development of a Real-time Bionic Voice Generation System based on Statistical Excitation Prediction (2019) (3)
Structured Adaptive Regularization of Weight Vectors for a Robust Grapheme-to-Phoneme Conversion Model (2014) (3)
Noise suppression method for body-conducted soft speech enhancement based on external noise monitoring (2016) (3)
Gender-dependent spectrum differential models for perceived age control based on direct waveform modification in singing voice conversion (2014) (3)
An inter-speaker evaluation through simulation of electrolarynx control based on statistical F0 prediction (2014) (3)
Non-Parallel Voice Conversion System With WaveNet Vocoder and Collapsed Speech Suppression (2020) (3)
The NAIST ASR system for the 2015 Multi-Genre Broadcast challenge: On combination of deep learning systems using a rank-score function (2015) (3)
Evaluation of a Fully Automatic Cooperative Persuasive Dialogue System (2015) (3)
Electrolaryngeal speech modification towards singing aid system for laryngectomees (2017) (3)
Multi-Stream HiFi-GAN with Data-Driven Waveform Decomposition (2021) (3)
Neural speech-rate conversion with multispeaker WaveNet vocoder (2022) (3)
Voice Conversion Algorithm Based on Gaussian Mixture Model Applied to STRAIGHT (2000) (2)
Evaluation of electrolarynx controlled by real-time statistical F0 prediction (2016) (2)
Evaluation of eigenvoice conversion based on Gaussian mixture model (2006) (2)
Model training using parallel data with mismatched pause positions in statistical esophageal speech enhancement (2012) (2)
Rule-based Syntactic Preprocessing for Syntax-based Machine Translation (2014) (2)
PAPER Special Section on Processing Natural Speech Variability for Improved Verbal Human-Computer Interaction Esophageal Speech Enhancement Based on Statistical Voice Conversion with Gaussian Mixture Models (2010) (2)
Semi-Supervised Self-Produced Speech Enhancement and Suppression Based on Joint Source Modeling of Air- and Body-Conducted Signals Using Variational Autoencoder (2020) (2)
Self-Produced Speech Enhancement and Suppression Method using Air- and Body-Conductive Microphones (2018) (2)
Introduction to the Special Section on Voice Transformation (2010) (2)
Spoofing and Anti-Spoofing (SAS) corpus v1.0 (2015) (2)
Speaking-Aid Systems Based on One-to-Many Eigenvoice Conversion for Total Laryngectomees (2010) (2)
An event-related brain potential study on the impact of speech recognition errors (2014) (2)
PROSODY-CONTROLLABLE HMM-BASED SPEECH SYNTHESIS USING SPEECH INPUT (2015) (2)
Speaker Adaptive Training for Voice Conversion based on Eigenvoice (2006) (2)
Communicative speech synthesis with XIMERA: a first step (2007) (2)
Statistical Voice Conversion with Quasi-Periodic WaveNet Vocoder (2019) (2)
A Vibration Control Method of an Electrolarynx Based on Statistical F0 Pattern Prediction (2017) (2)
Rapid unsupervised speaker adaptation using single utterance based on MLLR and speaker selection (2007) (2)
Inter-Sentence Features and Thresholded Minimum Error Rate Training: NAIST at CLEF 2013 QA4MRE (2013) (2)
The NICT Entry for the Blizzard Challenge 2009: an Enhanced HMM-based Speech Synthesis System with Trajectory Training Considering Global Variance and State-Dependent Mixed Excitation (2009) (2)
Acoustic Compensation Method for Accepting Different Recording Devices in Body-Conducted Voice Conversion (2010) (2)
Statistical F0 prediction for electrolaryngeal speech enhancement considering generative process of F0 contours within product of experts framework (2016) (2)
High-Fidelity and Low-Latency Universal Neural Vocoder based on Multiband WaveRNN with Data-Driven Linear Prediction for Discrete Waveform Modeling (2021) (2)
An Investigation of Streaming Non-Autoregressive sequence-to-sequence Voice Conversion (2022) (2)
X-Ray Structure Determination and NMR Characterization of Some Fused Heterocycles with a 1,3,5-Triazine-2,4(1H,3H)-dione Ring. Reaction of 2-Amino-4(3H)-pyrimidinone with Chloroformyl Isocyanate (1988) (2)
Investigation of Japanese PnG BERT Language Model in Text-to-Speech Synthesis for Pitch Accent Language (2022) (2)
Real-time vibration control of an electrolarynx based on statistical F0 contour prediction (2016) (2)
Data-driven generation of text balloons based on linguistic and acoustic features of a comics-anime corpus (2014) (2)
Investigation of intra-speaker spectral parameter variation and its prediction towards improvement of spectral conversion metric (2013) (2)
Improved training of excitation for HMM-based parametric speech synthesis (2010) (2)
An Investigation of Features for Fundamental Frequency Pattern Prediction in Electrolaryngeal Speech Enhancement (2019) (2)
NICT Blizzard Challenge 2010 Entry (2010) (1)
NOCOA: A Computer-Based Training Tool for Social and Communication Skills That Exploits Non-verbal Behaviors (2013) (1)
Relational Data Selection for Data Augmentation of Speaker-Dependent Multi-Band MelGAN Vocoder (2021) (1)
Study on Word-Level Emphasis Across English and Japanese ∗ ☆ (2015) (1)
A Study on the Speech Synthesis Method by Using Database with Variety of Speech-Rate (2002) (1)
Acoustic-to-Articulatory Inversion Mapping Based on Latent Trajectory Gaussian Mixture Model (2016) (1)
Comparison of real-time multi-speaker neural vocoders on CPUs (2022) (1)
Perceptual Evaluation of Quality Deterioration Owing to Prosody Modification (2004) (1)
Semi-Supervised Enhancement and Suppression of Self-Produced Speech Using Correspondence between Air- and Body-Conducted Signals (2021) (1)
Improvements of Voice Timbre Control Based on Perceived Age in Singing Voice Conversion (2016) (1)
An evaluation of voice conversion with neural network spectral mapping models and WaveNet vocoder (2020) (1)
A Comparative Study of Self-Supervised Speech Representation Based Voice Conversion (2022) (1)
Designing Target Cost Function Based on Prosody of Speech Database (2005) (1)
Voice Conversion Challenge 2020 Listening Test Data (2020) (1)
Computationally efficient body-conducted voice conversion with original excitation signals (2011) (1)
Voice conversion for enhancing various types of body‐conducted speech detected with non‐audible murmur microphone. (2010) (1)
Speech Enhancement Using Non-Negative Spectrogram Models with Mel-Generalized Cepstral Regularization (2017) (1)
Improving Body Transmitted Unvoiced Spee (2006) (1)
Cross-Lingual Voice Conversion using a Cyclic Variational Auto-encoder and a WaveNet Vocoder (2020) (1)
An Ensemble Approach to Anomalous Sound Detection Based on Conformer-Based Autoencoder and Binary Classifier Incorporated with Metric Learning (2021) (1)
An evaluation of target speech for a nonaudible murmur enhancement system in noisy environments (2014) (1)
Intermediate Fine-Tuning Using Imperfect Synthetic Speech for Improving Electrolaryngeal Speech Recognition (2022) (1)
Note-level Automatic Guitar Transcription Using Attention Mechanism (2022) (1)
An Evaluation through Simulation of Electrolarynx Control based on Statistical F 0 Prediction for Multiple Speakers (2014) (1)
Ongaku Symposium 2014 : The 2nd Symposium on All Topics Related to Acoustics, Audition and Natural Language (2014) (1)
Anaphora Resolution for Transforming Regular Expressions into Honorifics in Japanese (2014) (1)
Investigation of Shallow Wavenet Vocoder with Laplacian Distribution Output (2019) (1)
An estimation method of voice timbre evaluation values using feature extraction with Gaussian mixture model based on reference singer (2016) (1)
Improvements to HMM-based speech synthesis based on parameter generation with rich context models (2013) (1)
An evaluation of EEG ocular artifact removal with a multi-channel wiener filter based on probabilistic generative model (2015) (1)
An end-to-end model for cross-lingual transformation of paralinguistic information (2018) (1)
Enhancing Event-Related Potentials Based on Maximum a Posteriori Estimation with a Spatial Correlation Prior (2016) (1)
A hearing impairment simulation method using audiogram-based approximation of auditory charatecteristics (2014) (1)
Investigation of Text-to-Speech-based Synthetic Parallel Data for Sequence-to-Sequence Non-Parallel Voice Conversion (2021) (1)
Stereophonic music separation based on non-negative tensor factorization with cepstrum regularization (2017) (1)
Two-Stage Training Method for Japanese Electrolaryngeal Speech Enhancement Based on Sequence-to-Sequence Voice Conversion (2022) (1)
An evaluation of acoustic-to-articulatory inversion mapping with latent trajectory Gaussian mixture model (信号処理) (2016) (1)
A Statistical Sample-Based Approach to GMM-Based Voice Conversion Using Tied-Covariance Acoustic Models (2016) (1)
Non-verbal Communication Training with an Interactive Multimedia Application (2014) (1)
Study on conversion-accuracy on speaker individuality of voice conversion algorithm with dynamic frequency warping (2001) (1)
Improving quality of small body transmitted ordinary speech with statistical voice conversion (2006) (1)
Convolutional bidirectional long short-term memory hidden Markov model hybrid system for polyphonic sound event detection (2016) (1)
Stereo channel music signal separation based on non-negative tensor factorization with cepstrum regularization (2016) (1)
Removing noise from event-related potentials using a probabilistic generative model with grouped covariance matrices (2016) (1)
Reducing Computation Time of the Rapid Unsupervised Speaker Adaptation Based on HMM-Sufficient Statistics (2007) (1)
Physically Constrained Statistical F0 Prediction for Electrolaryngeal Speech Enhancement (2017) (1)
Example Based Dialogue System Based on Satisfaction Prediction (2016) (1)
Designing and evaluation of specch database with prosodic variety (2002) (1)
Simple designing methods of corpus-based visual speech synthesis (2003) (1)
The AS-NU System for the M2VoC Challenge (2021) (1)
Low delay statistical singing voice conversion with direct waveform modification based on spectral differential considering global variance (2016) (1)
Adaptive Approach to Varying Recording Conditions in Body Transmitted Voice Conversion Based on Acoustic Compensation (2009) (1)
Excitation source design for high-quality speech manipulation systems based on a temporally static group delay representation of periodic signals (2014) (1)
Connectionist Temporal Classification-based Sound Event Encoder for Converting Sound Events into Onomatopoeic Representations (2018) (1)
Mandarin Electrolaryngeal Speech Voice Conversion with Sequence-to-Sequence Modeling (2021) (1)
Improving Singing Aid System for Laryngectomees With Statistical Voice Conversion and VAE-SPACE (2019) (1)
Modulation spectrum-constrained trajectory training algorithm for HMM-based speech synthesis (2015) (1)
Representation of Vocal Tract Length Transformation Based on Group Theory (2023) (0)
A Joint Model for Pause Prediction and Dependency Parsing using Latent Variables The (2016) (0)
Analysis of Noisy-target Training for DNN-based speech enhancement (2022) (0)
Modified Sound Field Interpolation Method for Rotation-robust Beamforming with Unequally Spaced Circular Microphone Array (2022) (0)
Analysis of Emphasis on Japanese-English Bilingual Corpora (2014) (0)
Voice Conversion Challenge 2020 -- submitted waveforms v1.0.0 (2021) (0)
Nonaudible murmur enhancement based on statistical voice conversion and noise suppression with external noise monitoring (2016) (0)
Real-time Cepstrum Mean Normalization Using Codebook (2006) (0)
Voice Conversion Based on Mixtu (2006) (0)
Explorer Robust Speaker-Adaptive HMM-Based Text-to-Speech Synthesis (2009) (0)
Sequence-wise Optimization for Quasi-Harmonic Speech Waveform Modeling (2022) (0)
Acoustic Modeling for Spoke Based on Unsupervised Utterance (2006) (0)
Date of publication xxxx 00, 0000, date of current version xxxx 00, 0000 (2020) (0)
Reaction of Isocytosine with N-Chlorocarbonyl Isocyanate. (1986) (0)
F 0 Contour Generation Using Rich Context Models in HMM-Based Speech Synthesis (2013) (0)
An Evaluation of Articulatory Controllable Speech Modification based on Gaussian Mixture Models with Direct Waveform Modification (2015) (0)
SUPERSEDED - The Voice Conversion Challenge 2016 (2016) (0)
Paper Template for INTERSPEECH 2015 (2019) (0)
Source-Filter HiFi-GAN: Fast and Pitch Controllable High-Fidelity Neural Vocoder (2022) (0)
Maximum Likelihood Voice Con with STRAIGHT Mix (2006) (0)
Acoustic modeling of spontaneous speech of Japanese preschool children (2006) (0)
The NAIST English speech recognition system for IWSLT 2013 (2013) (0)
A Dialog System with Human-to-Human Conversation Example (2014) (0)
Removing noise from event-related potentials using a probabilistic generative model with grouped covariance matrices. (2016) (0)
Voice Conversion Challenge 2020 database v1.0 (2020) (0)
NNSVS: A Neural Network-Based Singing Voice Synthesis Toolkit (2022) (0)
Spoken-Text-Style Transfer with Conditional Variational Autoencoder and Content Word Storage (2022) (0)
Mandarin Electro-Laryngeal Speech Enhancement based on Statistical Voice Conversion and Manual Tone Control (2021) (0)
A Study on Natural Expressive Speech: Automatic Memorable Spoken Quote Detection (2015) (0)
An Evaluation of Three-Stage Voice Conversion Framework for Noisy and Reverberant Conditions (2022) (0)
Phoneme Embeddings on Predicting Fundamental Frequency Pattern for Electrolaryngeal Speech (2020) (0)
Explorer The Voice Conversion Challenge 2016 (2016) (0)
Singing Fundamental Frequency Contour Generation Using Generalized Command-Response Model and Score-Conditional Variational Autoencoder (2021) (0)
Bottleneck Features for Emotional Speech Recognition (2015) (0)
The Network-based Multilingual ASR System Towards Multilingual Conversations in Medical Domain (2014) (0)
An Evaluation of Discriminative Training for Hidden Markov Models in a Real-Environment Speech-Oriented Guidance System (2010) (0)
E-Society Software Development Project for Speech Recognition and Synthesis(Technical Report) (2003) (0)
Linear transformation approaches to many-to-one voice conversion (2010) (0)
PREDICTION FOR ELECTROLARYNGEAL SPEECH ENHANCEMENT CONSIDERING GENERATIVE PROCESS OF F 0 CONTOURS WITHIN PRODUCT OF EXPERTS FRAMEWORK (2016) (0)
Quality Improvement Approaches Based on the Modulation Spectrum to Statistical Parametric Speech Synthesis (2015) (0)
Comparison of Effective Features and Analysis of Questions Towards Dialogue-based Deception Detection (2014) (0)
Statistical approach to perceived age control of singing voice (2014) (0)
Key files for Spoofing and Anti-Spoofing (SAS) corpus v1.0 (2017) (0)
Statistical conversion of speech parameter trajectory for mapping between features of different modalities (2008) (0)
A Cyclical Approach to Synthetic and Natural Speech Mismatch Refinement of Neural Post-filter for Low-cost Text-to-speech System (2022) (0)
Statistical voice conversion techniques for alaryngeal speech enhancement (2013) (0)
Multiple‐prosody speech databases and their effectiveness in high‐quality speech synthesis at arbitrary rates (2005) (0)
Speaking Aid System for Total Laryngect Body Transmitted Art (2006) (0)
Error Selection Methods for Machine Translation Error Analysis (2016) (0)
ON THE USE OF PHONETIC INFORお1ATION FOR恥1APPINGFROおf ARTICULATORYお10VEMENTS TO VOCAL TRACT SPECTRUM (2012) (0)
Language Model Adaptation and Analysis for Individuality Transforming 水上雅博 (2014) (0)
Intelligibility Enhancement Based on Speech Waveform Modification Using Hearing Impairment (2019) (0)
An investigation of how to design control parameters for statistical voice timbre control (2017) (0)
Interpretable Control for Emotional Text-to-Speech System toward Development of Sympathetic Educational-Support Robots (2022) (0)
Probabilistic Enhancement of EEG Component Using Prior Distribution of Correlations Between Channels (2014) (0)
VOICE CONVERSION FOR ,泊ruous T YPES OF BODY TRANS民自TTEDSPEECH (2012) (0)
Learning Novel Objects for Extended Mobile Manipulation (2011) (0)
Articulatory Controllable Speech Modification using Sequential Inversion and Production Mapping with Gaussian Mixture Models (音声) -- (第16回音声言語シンポジウム) (2014) (0)
Word-level Emphasis Transfer in Speech-to-speech Translation | Article Information | J-GLOBAL (2016) (0)
Unnecessary utterance detection for avoiding digressions in discussion (2014) (0)
Music Similarity Calculation of Individual Instrumental Sounds Using Metric Learning (2022) (0)
Unknown Word Detection Based on Event-Related Brain Desynchronization Responses (2015) (0)
Low-Latency Electrolaryngeal Speech Enhancement Based on Fastspeech2-Based Voice Conversion and Self-Supervised Speech Representation (2023) (0)
Proc. 2009 Asia-Pacific Signal and Information Processing Association (APSIPA) (2009) (0)
Transcription cost reduction for Acoustic model construction by speech data selection based on acoustic likelihoods (2005) (0)
Recursive neural network paraphrase identification for example-based dialog retrieval (2014) (0)
A Dialog System to Detect Deception (2015) (0)
UTTERANCE-BASED SELECTIVE TRAINING FOR COST-EFFECTIVE TASK-ADAPTATION OF ACOUSTIC MODELS (2006) (0)
Recognition and Analysis of Emotion in Indonesian Conversational Speech (音声) -- (第16回音声言語シンポジウム) (2014) (0)
Stereophonic Music Separation Based on Non-Negative Tensor Factorization with Cepstral Distance Regularization (2018) (0)
Improvement of Serial Approach to Anomalous Sound Detection by Incorporating Two Binary Cross-Entropies for Outlier Exposure (2022) (0)
Time Alignment using Lip Images for Frame-based Electrolaryngeal Voice Conversion (2021) (0)
Text-to-speech synthesis based on latent variable conversion using diffusion probabilistic model and variational autoencoder (2022) (0)
Development of "KamiRepo" system with automatic student identification to handle handwritten assignments on LMS (2018) (0)
Missing component restoration for masked speech signals based on time-domain spectrogram factorization (2017) (0)
Direction-aware target speaker extraction with a dual-channel system based on conditional variational autoencoders under underdetermined conditions (2022) (0)
Proposed Voice Conversion System with Quasi-periodic WaveNet Vocoder (2019) (0)
Total LaryngectomeesLaryngeal speaker Tracheostoma Expired air Nasal cavity Vocal folds Oral cavity Trachea Expired air Esophagus (2011) (0)
ICSLP 2006 Summary -Acoustic Modeling and Speech Synthesis- (2006) (0)
English-Read-By-Japanese Speech Synthesis Preserving Speaker Individuality Based on Partial Correction of Prosody and Phonetic Sounds and Effects of English Proficiency Level on Its Performance (2015) (0)
The Voice Conversion Challenge, 2016: multidimensional scaling (MDS) listening test results (2016) (0)

This paper list is powered by the following services:

What Schools Are Affiliated With Tomoki Toda?

Tomoki Toda is affiliated with the following schools:

Tomoki Toda's Academic­Influence.com Rankings

Tomoki Toda's Degrees

Why Is Tomoki Toda Influential?

Tomoki Toda's Published Works

Published Works

What Schools Are Affiliated With Tomoki Toda?

Tomoki Toda's AcademicInfluence.com Rankings