Voice Conversion and TTS |
| Session Type: Poster |
| Time: Friday, December 21, 10:00 - 12:00 |
| Location: Kallirhoe Hall |
| STARGAN-VC: NON-PARALLEL MANY-TO-MANY VOICE CONVERSION USING STAR GENERATIVE ADVERSARIAL NETWORKS |
| Hirokazu Kameoka; NTT Corporation |
| Takuhiro Kaneko; NTT Corporation |
| Kou Tanaka; NTT Corporation |
| Nobukatsu Hojo; NTT Corporation |
| RHYTHM-FLEXIBLE VOICE CONVERSION WITHOUT PARALLEL DATA USING CYCLE-GAN OVER PHONEME POSTERIORGRAM SEQUENCES |
| Cheng-chieh Yeh; National Taiwan University |
| Po-chun Hsu; National Taiwan University |
| Ju-chieh Chou; National Taiwan University |
| Hung-yi Lee; National Taiwan University |
| Lin-shan Lee; National Taiwan University |
| ADAPTIVE WAVENET VOCODER FOR RESIDUAL COMPENSATION IN GAN-BASED VOICE CONVERSION |
| Berrak Sisman; National University of Singapore |
| Mingyang Zhang; National University of Singapore |
| Sakriani Sakti; Nara Institute of Science and Technology |
| Haizhou Li; National University of Singapore |
| Satoshi Nakamura; Nara Institute of Science and Technology |
| NEURAL TTS VOICE CONVERSION |
| Zvi Kons; IBM Research |
| Slava Shechtman; IBM Research |
| Alex Sorin; IBM Research |
| Ron Hoory; IBM Research |
| Carmel Rabinovitz; IBM Research |
| Edmilson Da Silva Morais; IBM Research |
| AN EVALUATION OF DEEP SPECTRAL MAPPINGS AND WAVENET VOCODER FOR VOICE CONVERSION |
| Patrick Lumban Tobing; Nagoya University |
| Tomoki Hayashi; Nagoya University |
| Yi-Chiao Wu; Nagoya University |
| Kazuhiro Kobayashi; Nagoya University |
| Tomoki Toda; Nagoya University |
| IMPROVING FFTNET VOCODER WITH NOISE SHAPING AND SUBBAND APPROACHES |
| Takuma Okamoto; National Institute of Information and Communications Technology |
| Tomoki Toda; Nagoya University |
| Yoshinori Shiga; National Institute of Information and Communications Technology |
| Hisashi Kawai; National Institute of Information and Communications Technology |
| COMPARING PROSODIC FRAMEWORKS: INVESTIGATING THE ACOUSTIC-SYMBOLIC RELATIONSHIP IN TOBI AND RAP |
| Raul Fernandez; IBM Research |
| Andrew Rosenberg; IBM Research |
| DATA SELECTION FOR IMPROVING NATURALNESS OF TTS VOICES TRAINED ON SMALL FOUND CORPUSES |
| Fang-Yu Kuo; ObEN, Inc. |
| Sandesh Aryal; ObEN, Inc. |
| Gilles Degottex; ObEN, Inc. |
| Sam Kang; ObEN, Inc. |
| Pierre Lanchantin; ObEN, Inc. |
| Iris Ouyang; ObEN, Inc. |
| COMPREHENSIVE EVALUATION OF STATISTICAL SPEECH WAVEFORM SYNTHESIS |
| Thomas Merritt; Amazon |
| Bartosz Putrycz; Amazon |
| Adam Nadolski; Amazon |
| Tianjun Ye; Amazon |
| Daniel Korzekwa; Amazon |
| Wiktor Dolecki; Amazon |
| Thomas Drugman; Amazon |
| Viacheslav Klimkov; Amazon |
| Alexis Moinet; Amazon |
| Andrew Breen; Amazon |
| Rafal Kuklinski; Amazon |
| Nikko Strom; Amazon |
| Roberto Barra-Chicote; Amazon |
| EXAMPLAR-BASED SPEECH WAVEFORM GENERATION FOR TEXT-TO-SPEECH |
| Cassia Valentini-Botinhao; University of Edinburgh |
| Oliver Watts; University of Edinburgh |
| Felipe Espic; University of Edinburgh |
| Simon King; University of Edinburgh |
| AN ICELANDIC PRONUNCIATION DICTIONARY FOR TTS |
| Anna Björk Nikulásdóttir; Reykjavik University |
| Jón Guðnason; Reykjavik University |
| Eiríkur Rögnvaldsson; University of Iceland |
| MOS NATURALNESS AND THE QUEST FOR HUMAN-LIKE SPEECH |
| Sajad Shirali-Shahreza; University of Toronto |
| Gerald Penn; University of Toronto |