2021 IEEE International Conference on Acoustics, Speech and Signal Processing

6-11 June 2021 • Toronto, Ontario, Canada

Extracting Knowledge from Information

2021 IEEE International Conference on Acoustics, Speech and Signal Processing

6-11 June 2021 • Toronto, Ontario, Canada

Extracting Knowledge from Information
Login Paper Search My Schedule Paper Index Help

My ICASSP 2021 Schedule

Note: Your custom schedule will not be saved unless you create a new account or login to an existing account.
  1. Create a login based on your email (takes less than one minute)
  2. Perform 'Paper Search'
  3. Select papers that you desire to save in your personalized schedule
  4. Click on 'My Schedule' to see the current list of selected papers
  5. Click on 'Printable Version' to create a separate window suitable for printing (the header and menu will appear, but will not actually print)
Click on the icon to view the manuscript on IEEE XPlore in the IEEE ICASSP 2021 Open Preview.

Clicking on the Add button next to a paper title will add that paper to your custom schedule.
Clicking on the Remove button next to a paper will remove that paper from your custom schedule.

SPE-4: Speech Synthesis 2: Controllability

Session Type: Poster
Time: Tuesday, 8 June, 13:00 - 13:45
Location: Gather.Town
Session Chair: Yu Zhang, Google
 
   SPE-4.1: PARALLEL TACOTRON: NON-AUTOREGRESSIVE AND CONTROLLABLE TTS
         Isaac Elias; Google
         Heiga Zen; Google
         Jonathan Shen; Google
         Yu Zhang; Google
         Ye Jia; Google
         Ron Weiss; Google
         Yonghui Wu; Google
 
   SPE-4.2: FCL-TACO2: TOWARDS FAST, CONTROLLABLE AND LIGHTWEIGHT TEXT-TO-SPEECH SYNTHESIS
         Disong Wang; The Chinese University of Hong Kong
         Liqun Deng; Huawei Noah's Ark Lab
         Yang Zhang; Huawei Noah's Ark Lab
         Nianzu Zheng; Huawei Noah's Ark Lab
         Yu Ting Yeung; Huawei Noah's Ark Lab
         Xiao Chen; Huawei Noah's Ark Lab
         Xunying Liu; The Chinese University of Hong Kong
         Helen Meng; The Chinese University of Hong Kong
 
   SPE-4.3: PROSODIC CLUSTERING FOR PHONEME-LEVEL PROSODY CONTROL IN END-TO-END SPEECH SYNTHESIS
         Alexandra Vioni; Innoetics, Samsung Electronics
         Myrsini Christidou; Innoetics, Samsung Electronics
         Nikolaos Ellinas; Innoetics, Samsung Electronics
         Georgios Vamvoukakis; Innoetics, Samsung Electronics
         Panos Kakoulidis; Innoetics, Samsung Electronics
         Taehoon Kim; Mobile Communications Business, Samsung Electronics
         June Sig Sung; Mobile Communications Business, Samsung Electronics
         Hyoungmin Park; Mobile Communications Business, Samsung Electronics
         Aimilios Chalamandaris; Innoetics, Samsung Electronics
         Pirros Tsiakoulis; Innoetics, Samsung Electronics
 
   SPE-4.4: IMPROVING NATURALNESS AND CONTROLLABILITY OF SEQUENCE-TO-SEQUENCE SPEECH SYNTHESIS BY LEARNING LOCAL PROSODY REPRESENTATIONS
         Cheng Gong; Tianjin University
         Longbiao Wang; Tianjin University
         Zhenhua Ling; University of Science and Technology of China
         Shaotong Guo; Tianjin University
         Ju Zhang; Huiyan Technology (Tianjin) Co., Ltd
         Jianwu Dang; Japan Advanced Institute of Science and Technology
 
   SPE-4.5: MULTI-SPEAKER EMOTIONAL SPEECH SYNTHESIS WITH FINE-GRAINED PROSODY MODELING
         Chunhui Lu; Samsung Research China-Beijing
         Xue Wen; Samsung Research China-Beijing
         Ruolan Liu; Samsung Research China-Beijing
         Xiao Chen; Samsung Research China-Beijing
 
   SPE-4.6: EMOTION CONTROLLABLE SPEECH SYNTHESIS USING EMOTION-UNLABELED DATASET WITH THE ASSISTANCE OF CROSS-DOMAIN SPEECH EMOTION RECOGNITION
         Xiong Cai; Tsinghua University
         Dongyang Dai; Tsinghua University
         Zhiyong Wu; Tsinghua University
         Xiang Li; Tsinghua University
         Jingbei Li; Tsinghua University
         Helen Meng; Chinese University of Hong Kong