Technical Program
SP-P10: Speech Synthesis and Voice Conversion |
| Session Type: Poster |
| Time: Thursday, March 9, 08:30 - 10:30 |
| Location: Churchill: Poster Area A |
| Session Chair: Raul Fernandez, IBM T.J. Watson Research Center |
| SP-P10.1: DURATION PREDICTION USING MULTIPLE GAUSSIAN PROCESS EXPERTS FOR GPR-BASED SPEECH SYNTHESIS |
| Decha Moungsri; Tokyo Institute of Technology |
| Tomoki Koriyama; Tokyo Institute of Technology |
| Takao Kobayashi; Tokyo Institute of Technology |
| SP-P10.2: COMBINING UNIDIRECTIONAL LONG SHORT-TERM MEMORY WITH CONVOLUTIONAL OUTPUT LAYER FOR HIGH-PERFORMANCE SPEECH SYNTHESIS |
| Wenfu Wang; Institute of Automation, Chinese Academy of Sciences |
| Bo Xu; Institute of Automation, Chinese Academy of Sciences |
| SP-P10.3: LOMBARD SPEECH SYNTHESIS USING LONG SHORT-TERM MEMORY RECURRENT NEURAL NETWORKS |
| Bajibabu Bollepalli; Aalto University |
| Manu Airaksinen; Aalto University |
| Paavo Alku; Aalto University |
| SP-P10.4: MULTI-TASK LEARNING OF STRUCTURED OUTPUT LAYER BIDIRECTIONAL LSTMS FOR SPEECH SYNTHESIS |
| Runnan Li; Tsinghua University |
| Zhiyong Wu; Tsinghua University |
| Xunying Liu; The Chinese University of Hong Kong |
| Helen Meng; The Chinese University of Hong Kong |
| Lianhong Cai; Tsinghua University |
| SP-P10.5: QUALITY ASSESSMENT OF VOICE CONVERTED SPEECH USING ARTICULATORY FEATURES |
| Avni Rajpal; Dhirubhai Ambani Institute of Information and Communication Technology |
| Nirmesh Shah; Dhirubhai Ambani Institute of Information and Communication Technology |
| Mohammadi Zaki; Indian Institute of Science |
| Hemant Patil; Dhirubhai Ambani Institute of Information and Communication Technology |
| SP-P10.6: NOVEL AMPLITUDE SCALING METHOD FOR BILINEAR FREQUENCY WARPING-BASED VOICE CONVERSION |
| Nirmesh Shah; Dhirubhai Ambani Institute of Information and Communication Technology |
| Hemant Patil; Dhirubhai Ambani Institute of Information and Communication Technology |
| SP-P10.7: EXEMPLAR SELECTION METHODS IN VOICE CONVERSION |
| Guanlong Zhao; Texas A&M University |
| Ricardo Gutierrez-Osuna; Texas A&M University |
| SP-P10.8: VOICE-TRANSFORMATION-BASED DATA AUGMENTATION FOR PROSODIC CLASSIFICATION |
| Raul Fernandez; IBM |
| Andrew Rosenberg; IBM |
| Alexander Sorin; IBM Haifa Research Lab |
| Bhuvana Ramabhadran; IBM |
| Ron Hoory; IBM Haifa Research Lab |
| SP-P10.9: NON-PARALLEL VOICE CONVERSION USING I-VECTOR PLDA: TOWARDS UNIFYING SPEAKER VERIFICATION AND TRANSFORMATION |
| Tomi Kinnunen; University of Eastern Finland |
| Lauri Juvela; Aalto University |
| Paavo Alku; Aalto University |
| Junichi Yamagishi; National Institute of Informatics |
| SP-P10.10: A STUDY OF SPEAKER VERIFICATION PERFORMANCE WITH EXPRESSIVE SPEECH |
| Srinivas Parthasarathy; The University of Texas at Dallas |
| Chunlei Zhang; The University of Texas at Dallas |
| John H.L. Hansen; The University of Texas at Dallas |
| Carlos Busso; The University of Texas at Dallas |