Tomoki Toda
#148,269
Most Influential Person Now
Tomoki Toda's AcademicInfluence.com Rankings
Tomoki Todaengineering Degrees
Engineering
#5878
World Rank
#7165
Historical Rank
Electrical Engineering
#1687
World Rank
#1785
Historical Rank
Applied Physics
#1791
World Rank
#1823
Historical Rank

Download Badge
Engineering
Tomoki Toda's Degrees
- PhD Electrical and Computer Engineering Nagoya Institute of Technology
- Masters Electrical and Computer Engineering Nagoya Institute of Technology
- Bachelors Electrical and Computer Engineering Nagoya Institute of Technology
Why Is Tomoki Toda Influential?
(Suggest an Edit or Addition)Tomoki Toda's Published Works
Number of citations in a given year to any of this author's works
Total number of citations to an author for the works they published in a given year. This highlights publication of the most important work(s) by the author
Published Works
- Voice Conversion Based on Maximum-Likelihood Estimation of Spectral Parameter Trajectory (2007) (1005)
- A Speech Parameter Generation Algorithm Considering Global Variance for HMM-Based Speech Synthesis (2007) (508)
- Speech Synthesis Based on Hidden Markov Models (2013) (426)
- Details of the Nitech HMM-Based Speech Synthesis System for the Blizzard Challenge 2005 (2007) (265)
- Statistical mapping between articulatory movements and acoustic spectrum using a Gaussian mixture model (2008) (249)
- Speaker-Dependent WaveNet Vocoder (2017) (249)
- The Voice Conversion Challenge 2018: Promoting Development of Parallel and Nonparallel Methods (2018) (249)
- Learning to Generate Pseudo-Code from Source Code Using Statistical Machine Translation (T) (2015) (229)
- Robust Speaker-Adaptive HMM-Based Text-to-Speech Synthesis (2009) (205)
- Speaking-aid systems using GMM-based voice conversion for electrolaryngeal speech (2012) (196)
- Statistical Voice Conversion Techniques for Body-Conducted Unvoiced Speech Enhancement (2012) (176)
- Voice conversion algorithm based on Gaussian mixture model with dynamic frequency warping of STRAIGHT spectrum (2001) (172)
- The Voice Conversion Challenge 2016 (2016) (157)
- XIMERA: a new TTS from ATR based on corpus-based technologies (2004) (145)
- Spectral conversion based on maximum likelihood estimation considering global variance of converted parameter (2005) (145)
- Eigenvoice conversion based on Gaussian mixture model (2006) (135)
- Espnet-TTS: Unified, Reproducible, and Integratable Open Source End-to-End Text-to-Speech Toolkit (2019) (135)
- Maximum likelihood voice conversion based on GMM with STRAIGHT mixed excitation (2006) (121)
- An overview of nitech HMM-based speech synthesis system for blizzard challenge 2005 (2005) (115)
- Voice Conversion Challenge 2020: Intra-lingual semi-parallel and cross-lingual voice conversion (2020) (114)
- GMM-based voice conversion applied to emotional speech synthesis (2003) (103)
- One-to-Many and Many-to-One Voice Conversion Based on Eigenvoices (2007) (101)
- An excitation model for HMM-based speech synthesis based on residual modeling (2007) (97)
- An investigation of multi-speaker training for wavenet vocoder (2017) (96)
- Statistical Voice Conversion with WaveNet-Based Waveform Generation (2017) (91)
- Acoustic-to-articulatory inversion mapping with Gaussian mixture model (2004) (87)
- Anti-Spoofing for Text-Independent Speaker Verification: An Initial Database, Comparison of Countermeasures, and Human Performance (2016) (82)
- Back-Translation-Style Data Augmentation for end-to-end ASR (2018) (78)
- The Nitech-NAIST HMM-Based Speech Synthesis System for the Blizzard Challenge 2006 (2008) (77)
- A postfilter to modify the modulation spectrum in HMM-based speech synthesis (2014) (76)
- Statistical singing voice conversion with direct waveform modification based on the spectrum differential (2014) (76)
- Implementation of Computationally Efficient Real-Time Voice Conversion (2012) (75)
- Mapping from articulatory movements to vocal tract spectrum with Gaussian mixture model for articulatory speech synthesis (2004) (74)
- Evaluation of cross-language voice conversion based on GMM and straight (2001) (72)
- Optimizing Segmentation Strategies for Simultaneous Speech Translation (2014) (70)
- NAM-to-speech conversion with Gaussian mixture models (2005) (70)
- The HTS-2008 System: Yet Another Evaluation of the Speaker-Adaptive HMM-based Speech Synthesis System in The 2008 Blizzard Challenge (2008) (68)
- SAS: A speaker verification spoofing database containing diverse attacks (2015) (66)
- Duration-Controlled LSTM for Polyphonic Sound Event Detection (2017) (66)
- Recent development of the HMM-based speech synthesis system (HTS) (2009) (65)
- Improvement to a NAM-captured whisper-to-speech system (2008) (64)
- Voice Transformer Network: Sequence-to-Sequence Voice Conversion Using Transformer with Text-to-Speech Pretraining (2019) (63)
- Silent-speech enhancement using body-conducted vocal-tract resonance signals (2010) (63)
- Postfilters to Modify the Modulation Spectrum for Statistical Parametric Speech Synthesis (2016) (62)
- Speaker-Independent HMM-based Speech Synthesis System: HTS-2007 System for the Blizzard Challenge 2007 (2007) (59)
- Alaryngeal Speech Enhancement Based on One-to-Many Eigenvoice Conversion (2014) (56)
- Speaking aid system for total laryngectomees using voice conversion of body transmitted artificial speech (2006) (52)
- Non-Parallel Voice Conversion with Cyclic Variational Autoencoder (2019) (50)
- Weakly-Supervised Sound Event Detection with Self-Attention (2020) (47)
- Esophageal Speech Enhancement Based on Statistical Voice Conversion with Gaussian Mixture Models (2010) (46)
- An evaluation of automatic phone segmentation for concatenative speech synthesis (2004) (46)
- sprocket: Open-Source Voice Conversion Software (2018) (45)
- Pre-Trained Text Embeddings for Enhanced Text-to-Speech Synthesis (2019) (45)
- Singing voice conversion method based on many-to-many eigenvoice conversion and training data generation using a singing-to-singing synthesis system (2012) (44)
- Automated Social Skills Trainer (2015) (43)
- Simple, lexicalized choice of translation timing for simultaneous speech translation (2013) (41)
- An Investigation of Noise Shaping with Perceptual Weighting for Wavenet-Based Speech Generation (2018) (39)
- Generalization Ability of MOS Prediction Networks (2021) (39)
- Developing Non-goal Dialog System Based on Examples of Drama Television (2012) (38)
- Many-to-many eigenvoice conversion with reference voice (2009) (37)
- Modulation spectrum-constrained trajectory training algorithm for GMM-based Voice Conversion (2015) (35)
- Predictions of Subjective Ratings and Spoofing Assessments of Voice Conversion Challenge 2020 Submissions (2020) (35)
- CONFORMER-BASED SOUND EVENT DETECTION WITH SEMI-SUPERVISED LEARNING AND DATA AUGMENTATION (2020) (34)
- A Hybrid Approach to Electrolaryngeal Speech Enhancement Based on Noise Reduction and Statistical Excitation Generation (2014) (34)
- Improving body transmitted unvoiced speech with statistical voice conversion (2006) (33)
- Technologies for processing body-conducted speech detected with non-audible murmur microphone (2009) (33)
- Unit selection algorithm for Japanese speech synthesis based on both phoneme unit and diphone unit (2002) (32)
- CONVOLUTION-AUGMENTED TRANSFORMER FOR SEMI-SUPERVISED SOUND EVENT DETECTION Technical Report (2020) (31)
- Optimizing sub-cost functions for segment selection based on perceptual evaluations in concatenative speech synthesis (2004) (31)
- Bidirectional LSTM-HMM Hybrid System for Polyphonic Sound Event Detection (2016) (31)
- Syntax-based Simultaneous Translation through Prediction of Unseen Syntactic Constituents (2015) (31)
- Statistical approach to vocal tract transfer function estimation based on factor analyzed trajectory HMM (2008) (31)
- The NU Non-Parallel Voice Conversion System for the Voice Conversion Challenge 2018 (2018) (31)
- Preserving Word-Level Emphasis in Speech-to-Speech Translation (2017) (31)
- Parameter Generation Methods With Rich Context Models for High-Quality and Flexible Text-To-Speech Synthesis (2014) (31)
- The NAIST Text-to-Speech System for the Blizzard Challenge 2015 (2015) (31)
- Statistical singing voice conversion based on direct waveform modification with global variance (2015) (30)
- Voice conversion for various types of body transmitted speech (2009) (30)
- Cross-language Voice Conversion Evaluation Using Bilingual Databases (2002) (30)
- Intra-gender statistical singing voice conversion with direct waveform modification using log-spectral differential (2018) (30)
- Trajectory training considering global variance for HMM-based speech synthesis (2009) (30)
- Utilizing Human-to-Human Conversation Examples for a Multi Domain Chat-Oriented Dialog System (2014) (29)
- The Sequence-to-Sequence Baseline for the Voice Conversion Challenge 2020: Cascading ASR and TTS (2020) (29)
- Collection of a Simultaneous Translation Corpus for Comparative Analysis (2014) (29)
- Reinforcement Learning of Cooperative Persuasive Dialogue Policies using Framing (2014) (28)
- Collapsed speech segment detection and suppression for WaveNet vocoder (2018) (27)
- High quality voice conversion based on Gaussian mixture model with dynamic frequency warping (2001) (27)
- Cross-language voice conversion based on eigenvoices (2009) (27)
- A New Cosine Series Antialiasing Function and its Application to Aliasing-Free Glottal Source Models for Speech and Singing Synthesis (2017) (26)
- Probablistic modelling of F0 in unvoiced regions in HMM based speech synthesis (2009) (26)
- Acquiring a Dictionary of Emotion-Provoking Events (2014) (25)
- The VoiceMOS Challenge 2022 (2022) (25)
- EEG signal enhancement using multi-channel wiener filter with a spatial correlation prior (2015) (24)
- The NU-NAIST Voice Conversion System for the Voice Conversion Challenge 2016 (2016) (24)
- Statistical approach to enhancing esophageal speech based on Gaussian mixture models (2010) (23)
- Teaching Social Communication Skills Through Human-Agent Interaction (2016) (23)
- An evaluation of alaryngeal speech enhancement methods based on voice conversion techniques (2011) (23)
- Performance evaluation of the speaker-independent HMM-based speech synthesis system “HTS 2007” for the Blizzard Challenge 2007 (2008) (22)
- An Investigation of Subband Wavenet Vocoder Covering Entire Audible Frequency Range with Limited Acoustic Features (2018) (21)
- Emphasized speech synthesis based on hidden Markov models (2009) (21)
- A hybrid approach to electrolaryngeal speech enhancement based on spectral subtraction and statistical voice conversion (2013) (21)
- LDNet: Unified Listener Dependent Modeling in MOS Prediction for Synthetic Speech (2021) (21)
- Learning Novel Objects for Extended Mobile Manipulation (2012) (21)
- Anomalous Sound Event Detection Based on WaveNet (2018) (21)
- Refined WaveNet Vocoder for Variational Autoencoder Based Voice Conversion (2018) (20)
- Real-Time Neural Text-to-Speech with Sequence-to-Sequence Acoustic Model and WaveGlow or Single Gaussian WaveRNN Vocoders (2019) (20)
- Linguistic and Acoustic Features for Automatic Identification of Autism Spectrum Disorders in Children’s Narrative (2014) (20)
- Voice Conversion with Cyclic Recurrent Neural Network and Fine-tuned Wavenet Vocoder (2019) (20)
- Speaker adaptive training for one-to-many eigenvoice conversion based on Gaussian mixture model (2007) (20)
- An evaluation of cost functions sensitively capturing local degradation of naturalness for segment selection in concatenative speech synthesis (2006) (20)
- Non-parallel training for many-to-many eigenvoice conversion (2010) (20)
- A Spoofing Benchmark for the 2018 Voice Conversion Challenge: Leveraging from Spoofing Countermeasures for Speech Artifact Assessment (2018) (19)
- Speaker-Independent HMM-based Speech Synthesis System (2007) (19)
- NU Voice Conversion System for the Voice Conversion Challenge 2018 (2018) (19)
- The use of air-pressure sensor in electrolaryngeal speech enhancement based on statistical voice conversion (2009) (19)
- Pretraining Techniques for Sequence-to-Sequence Voice Conversion (2020) (19)
- Voice Timbre Control Based on Perceived Age in Singing Voice Conversion (2014) (18)
- Modulation spectrum-based post-filter for GMM-based Voice Conversion (2014) (18)
- On the Use of Phonetic Information for Mapping from Articulatory Movements to Vocal Tract Spectrum (2006) (18)
- A trainable excitation model for HMM-based speech synthesis (2007) (18)
- Many-to-Many Voice Transformer Network (2020) (18)
- A method for translation of paralinguistic information (2012) (18)
- Straight-based voice conversion algorithm based on Gaussian mixture model (2000) (17)
- Combination of two-dimensional cochleogram and spectrogram features for deep learning-based ASR (2015) (17)
- Augmented speech production based on real-time statistical voice conversion (2014) (17)
- Adaptive Training for Voice Conversion Based on Eigenvoices (2010) (17)
- Generalized Multichannel Variational Autoencoder for Underdetermined Source Separation (2018) (17)
- Quasi-Periodic WaveNet Vocoder: A Pitch Dependent Dilated Convolution Model for Parametric Speech Generation (2019) (17)
- Regression approaches to voice quality controll based on one-to-many eigenvoice conversion (2007) (17)
- F0 transformation techniques for statistical voice conversion with direct waveform modification with spectral differential (2016) (17)
- ATRECSS — ATR ENGLISH SPEECH CORPUS FOR SPEECH SYNTHESIS (2007) (17)
- Evaluation of cross-language voice conversion using bilingual and non-bilingual databases (2002) (17)
- Aliasing-free implementation of discrete-time glottal source models and their applications to speech synthesis and F0 extractor evaluation (2015) (16)
- BLSTM-HMM hybrid system combined with sound activity detection network for polyphonic Sound Event Detection (2017) (16)
- Any-to-One Sequence-to-Sequence Voice Conversion Using Self-Supervised Discrete Speech Representations (2020) (16)
- Ckylark: A More Robust PCFG-LA Parser (2015) (16)
- BANDWIDTH EXTENSION OF CELLULAR PHONE SPEECH BASED ON MAXIMUM LIKELIHOOD ESTIMATION WITH GMM (2008) (16)
- Adaptive voice-quality control based on one-to-many eigenvoice conversion (2010) (16)
- Development of preschool children subsystem for ASR and q&a in a real-environment speech-oriented guidance task (2007) (16)
- Generalizing continuous-space translation of paralinguistic information (2013) (16)
- Tacotron-Based Acoustic Model Using Phoneme Alignment for Practical Neural Text-to-Speech Systems (2019) (16)
- Pseudogen: A Tool to Automatically Generate Pseudo-Code from Source Code (2015) (15)
- Speed or accuracy? a study in evaluation of simultaneous speech translation (2015) (15)
- Acoustic model training for non-audible murmur recognition using transformed normal speech data (2011) (15)
- Spectral conversion based on statistical models including time-sequence matching (2007) (15)
- Multi-Head Decoder for End-to-End Speech Recognition (2018) (14)
- A Speech Communication Aid System for Total Laryngectomees Using Voice Conversion of Body Transmitted Artificial Speech (2006) (14)
- Improving FFTNet Vocoder with Noise Shaping and Subband Approaches (2018) (14)
- Constructing a speech translation system using simultaneous interpretation data (2013) (14)
- Crank: An Open-Source Software for Nonparallel Voice Conversion Based on Vector-Quantized Variational Autoencoder (2021) (14)
- Emotion and Its Triggers in Human Spoken Dialogue: Recognition and Analysis (2014) (14)
- Deep acoustic-to-articulatory inversion mapping with latent trajectory modeling (2017) (14)
- Parameter generation algorithm considering Modulation Spectrum for HMM-based speech synthesis (2015) (14)
- S3PRL-VC: Open-Source Voice Conversion Framework with Self-Supervised Speech Representations (2021) (13)
- Transformer-Based Text-to-Speech with Weighted Forced Attention (2020) (13)
- Linguistic Individuality Transformation for Spoken Language (2015) (13)
- Multimodal HMM-based NAM-to-speech conversion (2009) (13)
- Speaker-Adaptive Speech Synthesis Based on Eigenvoice Conversion and Language-Dependent Prosodic Conversion in Speech-to-Speech Translation (2011) (13)
- Underdetermined Source Separation Based on Generalized Multichannel Variational Autoencoder (2019) (12)
- Evaluation of speaking-aid system with voice conversion for laryngectomees toward its use in practical environments (2008) (12)
- Improving Rapid Unsupervised Speaker Adaptation Based On Hmm Sufficient Statistics (2006) (12)
- Subband wavenet with overlapped single-sideband filterbanks (2017) (12)
- Modeling of Speech Parameter Sequence Considering Global Variance for HMM-Based Speech Synthesis (2011) (12)
- Building a free, general-domain paraphrase database for Japanese (2014) (12)
- Quasi-Periodic Parallel WaveGAN: A Non-Autoregressive Raw Waveform Generative Model With Pitch-Dependent Dilated Convolution Neural Network (2020) (12)
- Speech-to-Singing Voice Conversion: The Challenges and Strategies for Improving Vocal Conversion Processes (2019) (12)
- Selective EM training of acoustic models based on sufficient statistics of single utterances (2005) (11)
- Improved average-voice-based speech synthesis using gender-mixed modeling and a parameter generation algorithm considering GV (2007) (11)
- Non-verbal cognitive skills and autistic conditions: An analysis and training tool (2012) (11)
- Baseline System of Voice Conversion Challenge 2020 with Cyclic Variational Autoencoder and Parallel WaveGAN (2020) (11)
- An Evaluation of Deep Spectral Mappings and WaveNet Vocoder for Voice Conversion (2018) (11)
- Post-Filters to Modify the Modulation Spectrum for Statistical Parametric Speech Synthesis (2016) (11)
- Daily Activity Recognition with Large-Scaled Real-Life Recording Datasets Based on Deep Neural Network Using Multi-Modal Signals (2018) (11)
- Speech Recognition by Simply Fine-Tuning Bert (2021) (11)
- Predicting F0 and voicing from NAM-captured whispered speech (2008) (11)
- Optimizing integrated cost function for segment selection in concatenative speech synthesis based on perceptual evaluations (2003) (10)
- Non-Autoregressive Sequence-To-Sequence Voice Conversion (2021) (10)
- Acoustic compensation methods for body transmitted speech conversion (2009) (10)
- Designing speech database with prosodic variety for expressive TTS system (2002) (10)
- Investigation of F0 conditioning and Fully Convolutional Networks in Variational Autoencoder based Voice Conversion (2019) (10)
- Regression approaches to perceptual age control in singing voice conversion (2014) (10)
- ANOMALOUS SOUND DETECTION WITH ENSEMBLE OF AUTOENCODER AND BINARY CLASSIFICATION APPROACHES Technical Report (2021) (10)
- Dialogue management for leading the conversation in persuasive dialogue systems (2013) (10)
- Modified post-filter to recover modulation spectrum for HMM-based speech synthesis (2014) (10)
- Customer Satisfaction Estimation in Contact Center Calls Based on a Hierarchical Multi-Task Model (2020) (10)
- Active Learning for Example-Based Dialog Systems (2016) (10)
- Electrolaryngeal Speech Enhancement with Statistical Voice Conversion based on CLDNN (2018) (10)
- NOCOA+: Multimodal Computer-Based Training for Social and Communication Skills (2015) (10)
- An investigation of recurrent neural network for daily activity recognition using multi-modal signals (2017) (10)
- Grapheme-to-phoneme conversion based on adaptive regularization of weight vectors (2013) (9)
- Combination of Example-based and SMT-based Approaches in a Chat-oriented Dialog System (2013) (9)
- The ASVspoof 2019 database (2019) (9)
- Collection and analysis of a Japanese-English emphasized speech corpora (2014) (9)
- Frequency domain variants of velvet noise and their application to speech processing and synthesis: with appendices (2018) (9)
- Simultaneous Acoustic, Prosodic, and Phrasing Model Training for TTs Conversion Systems (2008) (9)
- Maximum a posteriori adaptation for many-to-one eigenvoice conversion (2008) (9)
- Construction and Analysis of a Persuasive Dialogue Corpus (2014) (9)
- Techniques in rapid unsupervised speaker adaptation based on HMM-Sufficient Statistics (2009) (9)
- Utterance-Based Selective Training for the Automatic Creation of Task-Dependent Acoustic Models (2006) (9)
- An evaluation of excitation feature prediction in a hybrid approach to electrolaryngeal speech enhancement (2014) (9)
- Simultaneous conversion of duration and spectrum based on statistical models including time-sequence matching (2008) (9)
- Semantic Parsing of Ambiguous Input through Paraphrasing and Verification (2015) (8)
- Voice conversion based on mixtures of factor analyzers (2006) (8)
- Deep neural network-based power spectrum reconstruction to improve quality of vocoded speech with limited acoustic parameters (2018) (8)
- High-Intelligibility Speech Synthesis for Dysarthric Speakers with LPCNet-Based TTS and CycleVAE-Based VC (2021) (8)
- Evaluation of Extremely Small Sound Source Signals Used in Speaking-Aid System with Statistical Voice Conversion (2010) (8)
- Enhancement of Esophageal Speech Using Statistical Voice Conversion (2009) (8)
- High-quality and flexible speech synthesis with segment selection and voice conversion (2003) (8)
- Implementation of F0 transformation for statistical singing voice conversion based on direct waveform modification (2016) (8)
- Non-audible murmur enhancement based on statistical conversion using air- and body-conductive microphones in noisy environments (2015) (8)
- An investigation of acoustic features for singing voice conversion based on perceptual age (2013) (8)
- The 2012 KIT and KIT-NAIST English ASR systems for the IWSLT evaluation (2012) (8)
- Quasi-Periodic WaveNet: An Autoregressive Raw Waveform Generative Model With Pitch-Dependent Dilated Convolution Neural Network (2020) (8)
- Learning cooperative persuasive dialogue policies using framing (2016) (8)
- Conversation dialog corpora from television and movie scripts (2014) (7)
- Articulatory Controllable Speech Modification Based on Statistical Inversion and Production Mappings (2017) (7)
- An empirical comparison of joint optimization techniques for speech translation (2013) (7)
- The NU Voice Conversion System for the Voice Conversion Challenge 2020: On the Effectiveness of Sequence-to-sequence Models and Autoregressive Neural Vocoders (2020) (7)
- Towards High-Reliability Speech Translation in the Medical Domain (2013) (7)
- A noise suppression method for body-conducted soft speech based on non-negative tensor factorization of air- and body-conducted signals (2017) (7)
- Voice Conversion With CycleRNN-Based Spectral Mapping and Finely Tuned WaveNet Vocoder (2019) (7)
- Statistical approaches to enhancement of body-conducted speech detected with non-audible murmur microphone (2012) (7)
- Improving translation of emphasis with pause prediction in speech-to-speech translation systems (2015) (7)
- Investigating Self-supervised Pretraining Frameworks for Pathological Speech Recognition (2022) (7)
- Efficient Shallow Wavenet Vocoder Using Multiple Samples Output Based on Laplacian Distribution and Linear Prediction (2020) (7)
- Improving the robustness of example-based dialog retrieval using recursive neural network paraphrase identification (2014) (7)
- Incremental sentence compression using LSTM recurrent networks (2015) (7)
- The Voice Conversion Challenge 2018: database and results (2018) (7)
- Cyclic Spectral Modeling for Unsupervised Unit Discovery into Voice Conversion with Excitation and Waveform Modeling (2020) (7)
- The use of semantic and acoustic features for open-domain TED talk summarization (2014) (7)
- A latent variable model for joint pause prediction and dependency parsing (2015) (7)
- Investigation of training data size for real-time neural vocoders on CPUs (2021) (7)
- Statistical approach to voice quality control in esophageal speech enhancement (2012) (6)
- An end-to-end model for cross-lingual transformation of paralinguistic information (2018) (6)
- A digital signal processor implementation of silent/electrolaryngeal speech enhancement based on real-time statistical voice conversion (2013) (6)
- Direct F0 control of an electrolarynx based on statistical excitation feature prediction and its evaluation through simulation (2014) (6)
- The KIT-NAIST (contrastive) English ASR system for IWSLT 2012 (2012) (6)
- Unified Source-Filter GAN: Unified Source-filter Network Based On Factorization of Quasi-Periodic Parallel WaveGAN (2021) (6)
- Accurate estimation of f0 and aperiodicity based on periodicity detector residuals and deviations of phase derivatives (2017) (6)
- A study of social-affective communication: Automatic prediction of emotion triggers and responses in television talk shows (2015) (6)
- Construction and analysis of social-affective interaction corpus in English and Indonesian (2015) (6)
- Discriminative Language Models as a Tool for Machine Translation Error Analysis (2014) (6)
- On the state definition for a trainable excitation model in HMM-based speech synthesis (2008) (6)
- A Hybrid System for Continuous Word-Level Emphasis Modeling Based on HMM State Clustering and Adaptive Training (2016) (6)
- Blind noise suppression for Non-Audible Murmur recognition with stereo signal processing (2011) (6)
- Speech Emotion Recognition Based on Listener Adaptive Models (2021) (5)
- Noise Level Limited Sub-Modeling for Diffusion Probabilistic Vocoders (2021) (5)
- Excitation source analysis for high-quality speech manipulation systems based on an interference-free representation of group delay with minimum phase response compensation (2014) (5)
- A Preliminary Study of a Two-Stage Paradigm for Preserving Speaker Identity in Dysarthric Voice Conversion (2021) (5)
- An Analysis Towards Dialogue-Based Deception Detection (2015) (5)
- Model Integration for HMM- and DNN-Based Speech Synthesis Using Product-of-Experts Framework (2016) (5)
- A Modulation Property of Time-Frequency Derivatives of Filtered Phase and its Application to Aperiodicity and fo Estimation (2017) (5)
- Full-Band LPCNet: A Real-Time Neural Vocoder for 48 kHz Audio With a CPU (2021) (5)
- Eigenvoice-Based Approach to Voice Conversion and Voice Quality Control (2009) (5)
- A Cyclical Post-filtering Approach to Mismatch Refinement of Neural Vocoder for Text-to-speech Systems (2020) (5)
- Improving Pivot Translation by Remembering the Pivot (2015) (5)
- Environmental sound processing and its applications (2019) (5)
- Improvements of the One-to-Many Eigenvoice Conversion System (2010) (5)
- Improving Rapid Unsupervised Speaker Adaptation Based on HMM-Sufficient Statistics in Noisy Environments Using Multi-Template Models (2006) (5)
- Noisy-to-Noisy Voice Conversion Framework with Denoising Model (2021) (5)
- An Extended Mobile Manipulation Robot Learning Novel Objects (2012) (5)
- A decision tree-based clustering approach to state definition in an excitation modeling framework for HMM-based speech synthesis (2009) (5)
- Investigations of Real-time Gaussian Fftnet and Parallel Wavenet Neural Vocoders with Simple Acoustic Features (2019) (5)
- Building an Effective Speech Corpus by Utilizing Statistical Multidimensional Scaling Method (2008) (5)
- Non-native speech synthesis preserving speaker individuality based on partial correction of prosodic and phonetic characteristics (2015) (5)
- Articulatory controllable speech modification based on statistical feature mapping with Gaussian mixture models (2014) (5)
- Daily activity recognition based on recurrent neural network using multi-modal signals (2018) (5)
- Structured soft margin confidence weighted learning for grapheme-to-phoneme conversion (2014) (5)
- An Investigation of Machine Translation Evaluation Metrics in Cross-lingual Question Answering (2015) (5)
- Segment selection considering local degradation of naturalness in concatenative speech synthesis (2003) (5)
- NAIST at the CLEF 2013 QA4MRE Pilot Task (2013) (5)
- Implementation of low-latency electrolaryngeal speech enhancement based on multi-task CLDNN (2021) (5)
- Towards Identity Preserving Normal to Dysarthric Voice Conversion (2021) (5)
- HASA-Net: A Non-Intrusive Hearing-Aid Speech Assessment Network (2021) (4)
- Vowel Recognition Based on Surface Electromyography with Electrode Grid on Submental Region (2012) (4)
- The NICT/ATR speech synthesis system for the Blizzard Challenge 2008 (2008) (4)
- Beyond bandlimited sampling of speech spectral envelope imposed by the harmonic structure of voiced sounds (2013) (4)
- Scene-dependent Anomalous Acoustic-event Detection Based on Conditional Wavenet and I-vector (2019) (4)
- Non-Native Text-to-Speech Preserving Speaker Individuality Based on Partial Correction of Prosodic and Phonetic Characteristics (2016) (4)
- Robustness of Statistical Voice Conversion Based on Direct Waveform Modification Against Background Sounds (2019) (4)
- Direct Noisy Speech Modeling for Noisy-To-Noisy Voice Conversion (2021) (4)
- Quasi-Periodic Parallel WaveGAN Vocoder: A Non-autoregressive Pitch-dependent Dilated Convolution Model for Parametric Speech Generation (2020) (4)
- Impact of various small sound source signals on voice conversion accuracy in speech communication aid for laryngectomees (2007) (4)
- Unified Source-Filter GAN with Harmonic-plus-Noise Source Excitation Generation (2022) (4)
- The NAIST Simultaneous Translation Corpus (2018) (4)
- A Segment Selection Algorithm for Japanese Concatenative Speech Synthesis Based on Both Phoneme Unit and diphone Unit (2002) (4)
- Speech Parameter Generation Algorithm Considering Modulation Spectrum for Statistical Parametric Speech Synthesis (2015) (4)
- Acoustic modeling for spoken dialogue systems based on unsupervised utterance-based selective training (2006) (4)
- An Evaluation of Parameter Generation Methods with Rich Context Models in HMM-Based Speech Synthesis (2012) (4)
- Towards Multilingual Conversations in the Medical Domain: Development of Multilingual Medical Data and A Network-based ASR System (2014) (4)
- An Enhanced Electrolarynx with Automatic Fundamental Frequency Control based on Statistical Prediction (2015) (4)
- Articulatory controllable speech modification based on Gaussian mixture models with direct waveform modification using spectrum differential (2015) (4)
- Evaluation of a singing voice conversion method based on many-to-many eigenvoice conversion (2013) (3)
- On Prosody Modeling for ASR+TTS Based Voice Conversion (2021) (3)
- Designing Japanese speech database covering wide range in prosody for hybrid speech synthesizer (2002) (3)
- Blind speech extraction for Non-Audible Murmur speech with speaker's movement noise (2012) (3)
- Speech emotion recognition based on listener-dependent emotion perception models (2021) (3)
- Generalization of Spectrum Differential based Direct Waveform Modification for Voice Conversion (2019) (3)
- The NAIST machine translation system for IWSLT2012 (2012) (3)
- Anomalous Sound Detection Using a Binary Classification Model and Class Centroids (2021) (3)
- Narrow Adaptive Regularization of weights for grapheme-to-phoneme conversion (2014) (3)
- Low-Latency Real-Time Non-Parallel Voice Conversion based on Cyclic Variational Autoencoder and Multiband WaveRNN with Data-Driven Linear Prediction (2021) (3)
- Designing a Pneumatic Bionic Voice Prosthesis - A Statistical Approach for Source Excitation Generation (2018) (3)
- Modality and contextual differences in computer based non-verbal communication training (2013) (3)
- An improved one-to-many eigenvoice conversion system (2008) (3)
- An evaluation of many-to-one voice conversion algorithms with pre-stored speaker data sets (2007) (3)
- Memorable spoken quote corpora of TED public speaking (2014) (3)
- "Developing a Test Bed of English Text-to-Speech System XIMERA for the Blizzard Challenge 2006 for the Blizzard Challenge 2006" (2006) (3)
- Cost Reduction of Acoustic Modeling for Real-Environment Applications Using Unsupervised and Selective Training (2008) (3)
- Adaptive selection from multiple response candidates in example-based dialogue (2015) (3)
- Audio-visual Voice Conversion Using Deep Canonical Correlation Analysis for Deep Bottleneck Features (2018) (3)
- Development of a Real-time Bionic Voice Generation System based on Statistical Excitation Prediction (2019) (3)
- Structured Adaptive Regularization of Weight Vectors for a Robust Grapheme-to-Phoneme Conversion Model (2014) (3)
- Noise suppression method for body-conducted soft speech enhancement based on external noise monitoring (2016) (3)
- Gender-dependent spectrum differential models for perceived age control based on direct waveform modification in singing voice conversion (2014) (3)
- An inter-speaker evaluation through simulation of electrolarynx control based on statistical F0 prediction (2014) (3)
- Non-Parallel Voice Conversion System With WaveNet Vocoder and Collapsed Speech Suppression (2020) (3)
- The NAIST ASR system for the 2015 Multi-Genre Broadcast challenge: On combination of deep learning systems using a rank-score function (2015) (3)
- Evaluation of a Fully Automatic Cooperative Persuasive Dialogue System (2015) (3)
- Electrolaryngeal speech modification towards singing aid system for laryngectomees (2017) (3)
- Multi-Stream HiFi-GAN with Data-Driven Waveform Decomposition (2021) (3)
- Neural speech-rate conversion with multispeaker WaveNet vocoder (2022) (3)
- Voice Conversion Algorithm Based on Gaussian Mixture Model Applied to STRAIGHT (2000) (2)
- Evaluation of electrolarynx controlled by real-time statistical F0 prediction (2016) (2)
- Evaluation of eigenvoice conversion based on Gaussian mixture model (2006) (2)
- Model training using parallel data with mismatched pause positions in statistical esophageal speech enhancement (2012) (2)
- Rule-based Syntactic Preprocessing for Syntax-based Machine Translation (2014) (2)
- PAPER Special Section on Processing Natural Speech Variability for Improved Verbal Human-Computer Interaction Esophageal Speech Enhancement Based on Statistical Voice Conversion with Gaussian Mixture Models (2010) (2)
- Semi-Supervised Self-Produced Speech Enhancement and Suppression Based on Joint Source Modeling of Air- and Body-Conducted Signals Using Variational Autoencoder (2020) (2)
- Self-Produced Speech Enhancement and Suppression Method using Air- and Body-Conductive Microphones (2018) (2)
- Introduction to the Special Section on Voice Transformation (2010) (2)
- Spoofing and Anti-Spoofing (SAS) corpus v1.0 (2015) (2)
- Speaking-Aid Systems Based on One-to-Many Eigenvoice Conversion for Total Laryngectomees (2010) (2)
- An event-related brain potential study on the impact of speech recognition errors (2014) (2)
- PROSODY-CONTROLLABLE HMM-BASED SPEECH SYNTHESIS USING SPEECH INPUT (2015) (2)
- Speaker Adaptive Training for Voice Conversion based on Eigenvoice (2006) (2)
- Communicative speech synthesis with XIMERA: a first step (2007) (2)
- Statistical Voice Conversion with Quasi-Periodic WaveNet Vocoder (2019) (2)
- A Vibration Control Method of an Electrolarynx Based on Statistical F0 Pattern Prediction (2017) (2)
- Rapid unsupervised speaker adaptation using single utterance based on MLLR and speaker selection (2007) (2)
- Inter-Sentence Features and Thresholded Minimum Error Rate Training: NAIST at CLEF 2013 QA4MRE (2013) (2)
- The NICT Entry for the Blizzard Challenge 2009: an Enhanced HMM-based Speech Synthesis System with Trajectory Training Considering Global Variance and State-Dependent Mixed Excitation (2009) (2)
- Acoustic Compensation Method for Accepting Different Recording Devices in Body-Conducted Voice Conversion (2010) (2)
- Statistical F0 prediction for electrolaryngeal speech enhancement considering generative process of F0 contours within product of experts framework (2016) (2)
- High-Fidelity and Low-Latency Universal Neural Vocoder based on Multiband WaveRNN with Data-Driven Linear Prediction for Discrete Waveform Modeling (2021) (2)
- An Investigation of Streaming Non-Autoregressive sequence-to-sequence Voice Conversion (2022) (2)
- X-Ray Structure Determination and NMR Characterization of Some Fused Heterocycles with a 1,3,5-Triazine-2,4(1H,3H)-dione Ring. Reaction of 2-Amino-4(3H)-pyrimidinone with Chloroformyl Isocyanate (1988) (2)
- Investigation of Japanese PnG BERT Language Model in Text-to-Speech Synthesis for Pitch Accent Language (2022) (2)
- Real-time vibration control of an electrolarynx based on statistical F0 contour prediction (2016) (2)
- Data-driven generation of text balloons based on linguistic and acoustic features of a comics-anime corpus (2014) (2)
- Investigation of intra-speaker spectral parameter variation and its prediction towards improvement of spectral conversion metric (2013) (2)
- Improved training of excitation for HMM-based parametric speech synthesis (2010) (2)
- An Investigation of Features for Fundamental Frequency Pattern Prediction in Electrolaryngeal Speech Enhancement (2019) (2)
- NICT Blizzard Challenge 2010 Entry (2010) (1)
- NOCOA: A Computer-Based Training Tool for Social and Communication Skills That Exploits Non-verbal Behaviors (2013) (1)
- Relational Data Selection for Data Augmentation of Speaker-Dependent Multi-Band MelGAN Vocoder (2021) (1)
- Study on Word-Level Emphasis Across English and Japanese ∗ ☆ (2015) (1)
- A Study on the Speech Synthesis Method by Using Database with Variety of Speech-Rate (2002) (1)
- Acoustic-to-Articulatory Inversion Mapping Based on Latent Trajectory Gaussian Mixture Model (2016) (1)
- Comparison of real-time multi-speaker neural vocoders on CPUs (2022) (1)
- Perceptual Evaluation of Quality Deterioration Owing to Prosody Modification (2004) (1)
- Semi-Supervised Enhancement and Suppression of Self-Produced Speech Using Correspondence between Air- and Body-Conducted Signals (2021) (1)
- Improvements of Voice Timbre Control Based on Perceived Age in Singing Voice Conversion (2016) (1)
- An evaluation of voice conversion with neural network spectral mapping models and WaveNet vocoder (2020) (1)
- A Comparative Study of Self-Supervised Speech Representation Based Voice Conversion (2022) (1)
- Designing Target Cost Function Based on Prosody of Speech Database (2005) (1)
- Voice Conversion Challenge 2020 Listening Test Data (2020) (1)
- Computationally efficient body-conducted voice conversion with original excitation signals (2011) (1)
- Voice conversion for enhancing various types of body‐conducted speech detected with non‐audible murmur microphone. (2010) (1)
- Speech Enhancement Using Non-Negative Spectrogram Models with Mel-Generalized Cepstral Regularization (2017) (1)
- Improving Body Transmitted Unvoiced Spee (2006) (1)
- Cross-Lingual Voice Conversion using a Cyclic Variational Auto-encoder and a WaveNet Vocoder (2020) (1)
- An Ensemble Approach to Anomalous Sound Detection Based on Conformer-Based Autoencoder and Binary Classifier Incorporated with Metric Learning (2021) (1)
- An evaluation of target speech for a nonaudible murmur enhancement system in noisy environments (2014) (1)
- Intermediate Fine-Tuning Using Imperfect Synthetic Speech for Improving Electrolaryngeal Speech Recognition (2022) (1)
- Note-level Automatic Guitar Transcription Using Attention Mechanism (2022) (1)
- An Evaluation through Simulation of Electrolarynx Control based on Statistical F 0 Prediction for Multiple Speakers (2014) (1)
- Ongaku Symposium 2014 : The 2nd Symposium on All Topics Related to Acoustics, Audition and Natural Language (2014) (1)
- Anaphora Resolution for Transforming Regular Expressions into Honorifics in Japanese (2014) (1)
- Investigation of Shallow Wavenet Vocoder with Laplacian Distribution Output (2019) (1)
- An estimation method of voice timbre evaluation values using feature extraction with Gaussian mixture model based on reference singer (2016) (1)
- Improvements to HMM-based speech synthesis based on parameter generation with rich context models (2013) (1)
- An evaluation of EEG ocular artifact removal with a multi-channel wiener filter based on probabilistic generative model (2015) (1)
- An end-to-end model for cross-lingual transformation of paralinguistic information (2018) (1)
- Enhancing Event-Related Potentials Based on Maximum a Posteriori Estimation with a Spatial Correlation Prior (2016) (1)
- A hearing impairment simulation method using audiogram-based approximation of auditory charatecteristics (2014) (1)
- Investigation of Text-to-Speech-based Synthetic Parallel Data for Sequence-to-Sequence Non-Parallel Voice Conversion (2021) (1)
- Stereophonic music separation based on non-negative tensor factorization with cepstrum regularization (2017) (1)
- Two-Stage Training Method for Japanese Electrolaryngeal Speech Enhancement Based on Sequence-to-Sequence Voice Conversion (2022) (1)
- An evaluation of acoustic-to-articulatory inversion mapping with latent trajectory Gaussian mixture model (信号処理) (2016) (1)
- A Statistical Sample-Based Approach to GMM-Based Voice Conversion Using Tied-Covariance Acoustic Models (2016) (1)
- Non-verbal Communication Training with an Interactive Multimedia Application (2014) (1)
- Study on conversion-accuracy on speaker individuality of voice conversion algorithm with dynamic frequency warping (2001) (1)
- Improving quality of small body transmitted ordinary speech with statistical voice conversion (2006) (1)
- Convolutional bidirectional long short-term memory hidden Markov model hybrid system for polyphonic sound event detection (2016) (1)
- Stereo channel music signal separation based on non-negative tensor factorization with cepstrum regularization (2016) (1)
- Removing noise from event-related potentials using a probabilistic generative model with grouped covariance matrices (2016) (1)
- Reducing Computation Time of the Rapid Unsupervised Speaker Adaptation Based on HMM-Sufficient Statistics (2007) (1)
- Physically Constrained Statistical F0 Prediction for Electrolaryngeal Speech Enhancement (2017) (1)
- Example Based Dialogue System Based on Satisfaction Prediction (2016) (1)
- Designing and evaluation of specch database with prosodic variety (2002) (1)
- Simple designing methods of corpus-based visual speech synthesis (2003) (1)
- The AS-NU System for the M2VoC Challenge (2021) (1)
- Low delay statistical singing voice conversion with direct waveform modification based on spectral differential considering global variance (2016) (1)
- Adaptive Approach to Varying Recording Conditions in Body Transmitted Voice Conversion Based on Acoustic Compensation (2009) (1)
- Excitation source design for high-quality speech manipulation systems based on a temporally static group delay representation of periodic signals (2014) (1)
- Connectionist Temporal Classification-based Sound Event Encoder for Converting Sound Events into Onomatopoeic Representations (2018) (1)
- Mandarin Electrolaryngeal Speech Voice Conversion with Sequence-to-Sequence Modeling (2021) (1)
- Improving Singing Aid System for Laryngectomees With Statistical Voice Conversion and VAE-SPACE (2019) (1)
- Modulation spectrum-constrained trajectory training algorithm for HMM-based speech synthesis (2015) (1)
- Representation of Vocal Tract Length Transformation Based on Group Theory (2023) (0)
- A Joint Model for Pause Prediction and Dependency Parsing using Latent Variables The (2016) (0)
- Analysis of Noisy-target Training for DNN-based speech enhancement (2022) (0)
- Modified Sound Field Interpolation Method for Rotation-robust Beamforming with Unequally Spaced Circular Microphone Array (2022) (0)
- Analysis of Emphasis on Japanese-English Bilingual Corpora (2014) (0)
- Voice Conversion Challenge 2020 -- submitted waveforms v1.0.0 (2021) (0)
- Nonaudible murmur enhancement based on statistical voice conversion and noise suppression with external noise monitoring (2016) (0)
- Real-time Cepstrum Mean Normalization Using Codebook (2006) (0)
- Voice Conversion Based on Mixtu (2006) (0)
- Explorer Robust Speaker-Adaptive HMM-Based Text-to-Speech Synthesis (2009) (0)
- Sequence-wise Optimization for Quasi-Harmonic Speech Waveform Modeling (2022) (0)
- Acoustic Modeling for Spoke Based on Unsupervised Utterance (2006) (0)
- Date of publication xxxx 00, 0000, date of current version xxxx 00, 0000 (2020) (0)
- Reaction of Isocytosine with N-Chlorocarbonyl Isocyanate. (1986) (0)
- F 0 Contour Generation Using Rich Context Models in HMM-Based Speech Synthesis (2013) (0)
- An Evaluation of Articulatory Controllable Speech Modification based on Gaussian Mixture Models with Direct Waveform Modification (2015) (0)
- SUPERSEDED - The Voice Conversion Challenge 2016 (2016) (0)
- Paper Template for INTERSPEECH 2015 (2019) (0)
- Source-Filter HiFi-GAN: Fast and Pitch Controllable High-Fidelity Neural Vocoder (2022) (0)
- Maximum Likelihood Voice Con with STRAIGHT Mix (2006) (0)
- Acoustic modeling of spontaneous speech of Japanese preschool children (2006) (0)
- The NAIST English speech recognition system for IWSLT 2013 (2013) (0)
- A Dialog System with Human-to-Human Conversation Example (2014) (0)
- Removing noise from event-related potentials using a probabilistic generative model with grouped covariance matrices. (2016) (0)
- Voice Conversion Challenge 2020 database v1.0 (2020) (0)
- NNSVS: A Neural Network-Based Singing Voice Synthesis Toolkit (2022) (0)
- Spoken-Text-Style Transfer with Conditional Variational Autoencoder and Content Word Storage (2022) (0)
- Mandarin Electro-Laryngeal Speech Enhancement based on Statistical Voice Conversion and Manual Tone Control (2021) (0)
- A Study on Natural Expressive Speech: Automatic Memorable Spoken Quote Detection (2015) (0)
- An Evaluation of Three-Stage Voice Conversion Framework for Noisy and Reverberant Conditions (2022) (0)
- Phoneme Embeddings on Predicting Fundamental Frequency Pattern for Electrolaryngeal Speech (2020) (0)
- Explorer The Voice Conversion Challenge 2016 (2016) (0)
- Singing Fundamental Frequency Contour Generation Using Generalized Command-Response Model and Score-Conditional Variational Autoencoder (2021) (0)
- Bottleneck Features for Emotional Speech Recognition (2015) (0)
- The Network-based Multilingual ASR System Towards Multilingual Conversations in Medical Domain (2014) (0)
- An Evaluation of Discriminative Training for Hidden Markov Models in a Real-Environment Speech-Oriented Guidance System (2010) (0)
- E-Society Software Development Project for Speech Recognition and Synthesis(Technical Report) (2003) (0)
- Linear transformation approaches to many-to-one voice conversion (2010) (0)
- PREDICTION FOR ELECTROLARYNGEAL SPEECH ENHANCEMENT CONSIDERING GENERATIVE PROCESS OF F 0 CONTOURS WITHIN PRODUCT OF EXPERTS FRAMEWORK (2016) (0)
- Quality Improvement Approaches Based on the Modulation Spectrum to Statistical Parametric Speech Synthesis (2015) (0)
- Comparison of Effective Features and Analysis of Questions Towards Dialogue-based Deception Detection (2014) (0)
- Statistical approach to perceived age control of singing voice (2014) (0)
- Key files for Spoofing and Anti-Spoofing (SAS) corpus v1.0 (2017) (0)
- Statistical conversion of speech parameter trajectory for mapping between features of different modalities (2008) (0)
- A Cyclical Approach to Synthetic and Natural Speech Mismatch Refinement of Neural Post-filter for Low-cost Text-to-speech System (2022) (0)
- Statistical voice conversion techniques for alaryngeal speech enhancement (2013) (0)
- Multiple‐prosody speech databases and their effectiveness in high‐quality speech synthesis at arbitrary rates (2005) (0)
- Speaking Aid System for Total Laryngect Body Transmitted Art (2006) (0)
- Error Selection Methods for Machine Translation Error Analysis (2016) (0)
- ON THE USE OF PHONETIC INFORお1ATION FOR恥1APPINGFROおf ARTICULATORYお10VEMENTS TO VOCAL TRACT SPECTRUM (2012) (0)
- Language Model Adaptation and Analysis for Individuality Transforming 水上 雅博 (2014) (0)
- Intelligibility Enhancement Based on Speech Waveform Modification Using Hearing Impairment (2019) (0)
- An investigation of how to design control parameters for statistical voice timbre control (2017) (0)
- Interpretable Control for Emotional Text-to-Speech System toward Development of Sympathetic Educational-Support Robots (2022) (0)
- Probabilistic Enhancement of EEG Component Using Prior Distribution of Correlations Between Channels (2014) (0)
- VOICE CONVERSION FOR ,泊ruous T YPES OF BODY TRANS民自TTEDSPEECH (2012) (0)
- Learning Novel Objects for Extended Mobile Manipulation (2011) (0)
- Articulatory Controllable Speech Modification using Sequential Inversion and Production Mapping with Gaussian Mixture Models (音声) -- (第16回音声言語シンポジウム) (2014) (0)
- Word-level Emphasis Transfer in Speech-to-speech Translation | Article Information | J-GLOBAL (2016) (0)
- Unnecessary utterance detection for avoiding digressions in discussion (2014) (0)
- Music Similarity Calculation of Individual Instrumental Sounds Using Metric Learning (2022) (0)
- Unknown Word Detection Based on Event-Related Brain Desynchronization Responses (2015) (0)
- Low-Latency Electrolaryngeal Speech Enhancement Based on Fastspeech2-Based Voice Conversion and Self-Supervised Speech Representation (2023) (0)
- Proc. 2009 Asia-Pacific Signal and Information Processing Association (APSIPA) (2009) (0)
- Transcription cost reduction for Acoustic model construction by speech data selection based on acoustic likelihoods (2005) (0)
- Recursive neural network paraphrase identification for example-based dialog retrieval (2014) (0)
- A Dialog System to Detect Deception (2015) (0)
- UTTERANCE-BASED SELECTIVE TRAINING FOR COST-EFFECTIVE TASK-ADAPTATION OF ACOUSTIC MODELS (2006) (0)
- Recognition and Analysis of Emotion in Indonesian Conversational Speech (音声) -- (第16回音声言語シンポジウム) (2014) (0)
- Stereophonic Music Separation Based on Non-Negative Tensor Factorization with Cepstral Distance Regularization (2018) (0)
- Improvement of Serial Approach to Anomalous Sound Detection by Incorporating Two Binary Cross-Entropies for Outlier Exposure (2022) (0)
- Time Alignment using Lip Images for Frame-based Electrolaryngeal Voice Conversion (2021) (0)
- Text-to-speech synthesis based on latent variable conversion using diffusion probabilistic model and variational autoencoder (2022) (0)
- Development of "KamiRepo" system with automatic student identification to handle handwritten assignments on LMS (2018) (0)
- Missing component restoration for masked speech signals based on time-domain spectrogram factorization (2017) (0)
- Direction-aware target speaker extraction with a dual-channel system based on conditional variational autoencoders under underdetermined conditions (2022) (0)
- Proposed Voice Conversion System with Quasi-periodic WaveNet Vocoder (2019) (0)
- Total LaryngectomeesLaryngeal speaker Tracheostoma Expired air Nasal cavity Vocal folds Oral cavity Trachea Expired air Esophagus (2011) (0)
- ICSLP 2006 Summary -Acoustic Modeling and Speech Synthesis- (2006) (0)
- English-Read-By-Japanese Speech Synthesis Preserving Speaker Individuality Based on Partial Correction of Prosody and Phonetic Sounds and Effects of English Proficiency Level on Its Performance (2015) (0)
- The Voice Conversion Challenge, 2016: multidimensional scaling (MDS) listening test results (2016) (0)
This paper list is powered by the following services:
What Schools Are Affiliated With Tomoki Toda?
Tomoki Toda is affiliated with the following schools: