Voice Conversion and TTS |
Session Type: Poster |
Time: Friday, December 21, 10:00 - 12:00 |
Location: Kallirhoe Hall |
STARGAN-VC: NON-PARALLEL MANY-TO-MANY VOICE CONVERSION USING STAR GENERATIVE ADVERSARIAL NETWORKS |
Hirokazu Kameoka; NTT Corporation |
Takuhiro Kaneko; NTT Corporation |
Kou Tanaka; NTT Corporation |
Nobukatsu Hojo; NTT Corporation |
RHYTHM-FLEXIBLE VOICE CONVERSION WITHOUT PARALLEL DATA USING CYCLE-GAN OVER PHONEME POSTERIORGRAM SEQUENCES |
Cheng-chieh Yeh; National Taiwan University |
Po-chun Hsu; National Taiwan University |
Ju-chieh Chou; National Taiwan University |
Hung-yi Lee; National Taiwan University |
Lin-shan Lee; National Taiwan University |
ADAPTIVE WAVENET VOCODER FOR RESIDUAL COMPENSATION IN GAN-BASED VOICE CONVERSION |
Berrak Sisman; National University of Singapore |
Mingyang Zhang; National University of Singapore |
Sakriani Sakti; Nara Institute of Science and Technology |
Haizhou Li; National University of Singapore |
Satoshi Nakamura; Nara Institute of Science and Technology |
NEURAL TTS VOICE CONVERSION |
Zvi Kons; IBM Research |
Slava Shechtman; IBM Research |
Alex Sorin; IBM Research |
Ron Hoory; IBM Research |
Carmel Rabinovitz; IBM Research |
Edmilson Da Silva Morais; IBM Research |
AN EVALUATION OF DEEP SPECTRAL MAPPINGS AND WAVENET VOCODER FOR VOICE CONVERSION |
Patrick Lumban Tobing; Nagoya University |
Tomoki Hayashi; Nagoya University |
Yi-Chiao Wu; Nagoya University |
Kazuhiro Kobayashi; Nagoya University |
Tomoki Toda; Nagoya University |
IMPROVING FFTNET VOCODER WITH NOISE SHAPING AND SUBBAND APPROACHES |
Takuma Okamoto; National Institute of Information and Communications Technology |
Tomoki Toda; Nagoya University |
Yoshinori Shiga; National Institute of Information and Communications Technology |
Hisashi Kawai; National Institute of Information and Communications Technology |
COMPARING PROSODIC FRAMEWORKS: INVESTIGATING THE ACOUSTIC-SYMBOLIC RELATIONSHIP IN TOBI AND RAP |
Raul Fernandez; IBM Research |
Andrew Rosenberg; IBM Research |
DATA SELECTION FOR IMPROVING NATURALNESS OF TTS VOICES TRAINED ON SMALL FOUND CORPUSES |
Fang-Yu Kuo; ObEN, Inc. |
Sandesh Aryal; ObEN, Inc. |
Gilles Degottex; ObEN, Inc. |
Sam Kang; ObEN, Inc. |
Pierre Lanchantin; ObEN, Inc. |
Iris Ouyang; ObEN, Inc. |
COMPREHENSIVE EVALUATION OF STATISTICAL SPEECH WAVEFORM SYNTHESIS |
Thomas Merritt; Amazon |
Bartosz Putrycz; Amazon |
Adam Nadolski; Amazon |
Tianjun Ye; Amazon |
Daniel Korzekwa; Amazon |
Wiktor Dolecki; Amazon |
Thomas Drugman; Amazon |
Viacheslav Klimkov; Amazon |
Alexis Moinet; Amazon |
Andrew Breen; Amazon |
Rafal Kuklinski; Amazon |
Nikko Strom; Amazon |
Roberto Barra-Chicote; Amazon |
EXAMPLAR-BASED SPEECH WAVEFORM GENERATION FOR TEXT-TO-SPEECH |
Cassia Valentini-Botinhao; University of Edinburgh |
Oliver Watts; University of Edinburgh |
Felipe Espic; University of Edinburgh |
Simon King; University of Edinburgh |
AN ICELANDIC PRONUNCIATION DICTIONARY FOR TTS |
Anna Björk Nikulásdóttir; Reykjavik University |
Jón Guðnason; Reykjavik University |
Eiríkur Rögnvaldsson; University of Iceland |
MOS NATURALNESS AND THE QUEST FOR HUMAN-LIKE SPEECH |
Sajad Shirali-Shahreza; University of Toronto |
Gerald Penn; University of Toronto |