Technical Program

Voice Conversion and TTS

Session Type: Poster
Time: Friday, December 21, 10:00 - 12:00
Location: Kallirhoe Hall
 
STARGAN-VC: NON-PARALLEL MANY-TO-MANY VOICE CONVERSION USING STAR GENERATIVE ADVERSARIAL NETWORKS
         Hirokazu Kameoka; NTT Corporation
         Takuhiro Kaneko; NTT Corporation
         Kou Tanaka; NTT Corporation
         Nobukatsu Hojo; NTT Corporation
 
RHYTHM-FLEXIBLE VOICE CONVERSION WITHOUT PARALLEL DATA USING CYCLE-GAN OVER PHONEME POSTERIORGRAM SEQUENCES
         Cheng-chieh Yeh; National Taiwan University
         Po-chun Hsu; National Taiwan University
         Ju-chieh Chou; National Taiwan University
         Hung-yi Lee; National Taiwan University
         Lin-shan Lee; National Taiwan University
 
ADAPTIVE WAVENET VOCODER FOR RESIDUAL COMPENSATION IN GAN-BASED VOICE CONVERSION
         Berrak Sisman; National University of Singapore
         Mingyang Zhang; National University of Singapore
         Sakriani Sakti; Nara Institute of Science and Technology
         Haizhou Li; National University of Singapore
         Satoshi Nakamura; Nara Institute of Science and Technology
 
NEURAL TTS VOICE CONVERSION
         Zvi Kons; IBM Research
         Slava Shechtman; IBM Research
         Alex Sorin; IBM Research
         Ron Hoory; IBM Research
         Carmel Rabinovitz; IBM Research
         Edmilson Da Silva Morais; IBM Research
 
AN EVALUATION OF DEEP SPECTRAL MAPPINGS AND WAVENET VOCODER FOR VOICE CONVERSION
         Patrick Lumban Tobing; Nagoya University
         Tomoki Hayashi; Nagoya University
         Yi-Chiao Wu; Nagoya University
         Kazuhiro Kobayashi; Nagoya University
         Tomoki Toda; Nagoya University
 
IMPROVING FFTNET VOCODER WITH NOISE SHAPING AND SUBBAND APPROACHES
         Takuma Okamoto; National Institute of Information and Communications Technology
         Tomoki Toda; Nagoya University
         Yoshinori Shiga; National Institute of Information and Communications Technology
         Hisashi Kawai; National Institute of Information and Communications Technology
 
COMPARING PROSODIC FRAMEWORKS: INVESTIGATING THE ACOUSTIC-SYMBOLIC RELATIONSHIP IN TOBI AND RAP
         Raul Fernandez; IBM Research
         Andrew Rosenberg; IBM Research
 
DATA SELECTION FOR IMPROVING NATURALNESS OF TTS VOICES TRAINED ON SMALL FOUND CORPUSES
         Fang-Yu Kuo; ObEN, Inc.
         Sandesh Aryal; ObEN, Inc.
         Gilles Degottex; ObEN, Inc.
         Sam Kang; ObEN, Inc.
         Pierre Lanchantin; ObEN, Inc.
         Iris Ouyang; ObEN, Inc.
 
COMPREHENSIVE EVALUATION OF STATISTICAL SPEECH WAVEFORM SYNTHESIS
         Thomas Merritt; Amazon
         Bartosz Putrycz; Amazon
         Adam Nadolski; Amazon
         Tianjun Ye; Amazon
         Daniel Korzekwa; Amazon
         Wiktor Dolecki; Amazon
         Thomas Drugman; Amazon
         Viacheslav Klimkov; Amazon
         Alexis Moinet; Amazon
         Andrew Breen; Amazon
         Rafal Kuklinski; Amazon
         Nikko Strom; Amazon
         Roberto Barra-Chicote; Amazon
 
EXAMPLAR-BASED SPEECH WAVEFORM GENERATION FOR TEXT-TO-SPEECH
         Cassia Valentini-Botinhao; University of Edinburgh
         Oliver Watts; University of Edinburgh
         Felipe Espic; University of Edinburgh
         Simon King; University of Edinburgh
 
AN ICELANDIC PRONUNCIATION DICTIONARY FOR TTS
         Anna Björk Nikulásdóttir; Reykjavik University
         Jón Guðnason; Reykjavik University
         Eiríkur Rögnvaldsson; University of Iceland
 
MOS NATURALNESS AND THE QUEST FOR HUMAN-LIKE SPEECH
         Sajad Shirali-Shahreza; University of Toronto
         Gerald Penn; University of Toronto