Technical Program
SP-P10: Speech Synthesis and Voice Conversion |
Session Type: Poster |
Time: Thursday, March 9, 08:30 - 10:30 |
Location: Churchill: Poster Area A |
Session Chair: Raul Fernandez, IBM T.J. Watson Research Center |
SP-P10.1: DURATION PREDICTION USING MULTIPLE GAUSSIAN PROCESS EXPERTS FOR GPR-BASED SPEECH SYNTHESIS |
Decha Moungsri; Tokyo Institute of Technology |
Tomoki Koriyama; Tokyo Institute of Technology |
Takao Kobayashi; Tokyo Institute of Technology |
SP-P10.2: COMBINING UNIDIRECTIONAL LONG SHORT-TERM MEMORY WITH CONVOLUTIONAL OUTPUT LAYER FOR HIGH-PERFORMANCE SPEECH SYNTHESIS |
Wenfu Wang; Institute of Automation, Chinese Academy of Sciences |
Bo Xu; Institute of Automation, Chinese Academy of Sciences |
SP-P10.3: LOMBARD SPEECH SYNTHESIS USING LONG SHORT-TERM MEMORY RECURRENT NEURAL NETWORKS |
Bajibabu Bollepalli; Aalto University |
Manu Airaksinen; Aalto University |
Paavo Alku; Aalto University |
SP-P10.4: MULTI-TASK LEARNING OF STRUCTURED OUTPUT LAYER BIDIRECTIONAL LSTMS FOR SPEECH SYNTHESIS |
Runnan Li; Tsinghua University |
Zhiyong Wu; Tsinghua University |
Xunying Liu; The Chinese University of Hong Kong |
Helen Meng; The Chinese University of Hong Kong |
Lianhong Cai; Tsinghua University |
SP-P10.5: QUALITY ASSESSMENT OF VOICE CONVERTED SPEECH USING ARTICULATORY FEATURES |
Avni Rajpal; Dhirubhai Ambani Institute of Information and Communication Technology |
Nirmesh Shah; Dhirubhai Ambani Institute of Information and Communication Technology |
Mohammadi Zaki; Indian Institute of Science |
Hemant Patil; Dhirubhai Ambani Institute of Information and Communication Technology |
SP-P10.6: NOVEL AMPLITUDE SCALING METHOD FOR BILINEAR FREQUENCY WARPING-BASED VOICE CONVERSION |
Nirmesh Shah; Dhirubhai Ambani Institute of Information and Communication Technology |
Hemant Patil; Dhirubhai Ambani Institute of Information and Communication Technology |
SP-P10.7: EXEMPLAR SELECTION METHODS IN VOICE CONVERSION |
Guanlong Zhao; Texas A&M University |
Ricardo Gutierrez-Osuna; Texas A&M University |
SP-P10.8: VOICE-TRANSFORMATION-BASED DATA AUGMENTATION FOR PROSODIC CLASSIFICATION |
Raul Fernandez; IBM |
Andrew Rosenberg; IBM |
Alexander Sorin; IBM Haifa Research Lab |
Bhuvana Ramabhadran; IBM |
Ron Hoory; IBM Haifa Research Lab |
SP-P10.9: NON-PARALLEL VOICE CONVERSION USING I-VECTOR PLDA: TOWARDS UNIFYING SPEAKER VERIFICATION AND TRANSFORMATION |
Tomi Kinnunen; University of Eastern Finland |
Lauri Juvela; Aalto University |
Paavo Alku; Aalto University |
Junichi Yamagishi; National Institute of Informatics |
SP-P10.10: A STUDY OF SPEAKER VERIFICATION PERFORMANCE WITH EXPRESSIVE SPEECH |
Srinivas Parthasarathy; The University of Texas at Dallas |
Chunlei Zhang; The University of Texas at Dallas |
John H.L. Hansen; The University of Texas at Dallas |
Carlos Busso; The University of Texas at Dallas |