Technical Program

SP-P10: Speech Synthesis and Voice Conversion

Session Type: Poster
Time: Thursday, March 9, 08:30 - 10:30
Location: Churchill: Poster Area A
Session Chair: Raul Fernandez, IBM T.J. Watson Research Center
 
SP-P10.1: DURATION PREDICTION USING MULTIPLE GAUSSIAN PROCESS EXPERTS FOR GPR-BASED SPEECH SYNTHESIS
         Decha Moungsri; Tokyo Institute of Technology
         Tomoki Koriyama; Tokyo Institute of Technology
         Takao Kobayashi; Tokyo Institute of Technology
 
SP-P10.2: COMBINING UNIDIRECTIONAL LONG SHORT-TERM MEMORY WITH CONVOLUTIONAL OUTPUT LAYER FOR HIGH-PERFORMANCE SPEECH SYNTHESIS
         Wenfu Wang; Institute of Automation, Chinese Academy of Sciences
         Bo Xu; Institute of Automation, Chinese Academy of Sciences
 
SP-P10.3: LOMBARD SPEECH SYNTHESIS USING LONG SHORT-TERM MEMORY RECURRENT NEURAL NETWORKS
         Bajibabu Bollepalli; Aalto University
         Manu Airaksinen; Aalto University
         Paavo Alku; Aalto University
 
SP-P10.4: MULTI-TASK LEARNING OF STRUCTURED OUTPUT LAYER BIDIRECTIONAL LSTMS FOR SPEECH SYNTHESIS
         Runnan Li; Tsinghua University
         Zhiyong Wu; Tsinghua University
         Xunying Liu; The Chinese University of Hong Kong
         Helen Meng; The Chinese University of Hong Kong
         Lianhong Cai; Tsinghua University
 
SP-P10.5: QUALITY ASSESSMENT OF VOICE CONVERTED SPEECH USING ARTICULATORY FEATURES
         Avni Rajpal; Dhirubhai Ambani Institute of Information and Communication Technology
         Nirmesh Shah; Dhirubhai Ambani Institute of Information and Communication Technology
         Mohammadi Zaki; Indian Institute of Science
         Hemant Patil; Dhirubhai Ambani Institute of Information and Communication Technology
 
SP-P10.6: NOVEL AMPLITUDE SCALING METHOD FOR BILINEAR FREQUENCY WARPING-BASED VOICE CONVERSION
         Nirmesh Shah; Dhirubhai Ambani Institute of Information and Communication Technology
         Hemant Patil; Dhirubhai Ambani Institute of Information and Communication Technology
 
SP-P10.7: EXEMPLAR SELECTION METHODS IN VOICE CONVERSION
         Guanlong Zhao; Texas A&M University
         Ricardo Gutierrez-Osuna; Texas A&M University
 
SP-P10.8: VOICE-TRANSFORMATION-BASED DATA AUGMENTATION FOR PROSODIC CLASSIFICATION
         Raul Fernandez; IBM
         Andrew Rosenberg; IBM
         Alexander Sorin; IBM Haifa Research Lab
         Bhuvana Ramabhadran; IBM
         Ron Hoory; IBM Haifa Research Lab
 
SP-P10.9: NON-PARALLEL VOICE CONVERSION USING I-VECTOR PLDA: TOWARDS UNIFYING SPEAKER VERIFICATION AND TRANSFORMATION
         Tomi Kinnunen; University of Eastern Finland
         Lauri Juvela; Aalto University
         Paavo Alku; Aalto University
         Junichi Yamagishi; National Institute of Informatics
 
SP-P10.10: A STUDY OF SPEAKER VERIFICATION PERFORMANCE WITH EXPRESSIVE SPEECH
         Srinivas Parthasarathy; The University of Texas at Dallas
         Chunlei Zhang; The University of Texas at Dallas
         John H.L. Hansen; The University of Texas at Dallas
         Carlos Busso; The University of Texas at Dallas