SPE-27.3
DIRECT NOISY SPEECH MODELING FOR NOISY-TO-NOISY VOICE CONVERSION
Chao Xie, Yi-Chiao Wu, Patrick Lumban Tobing, Wen-Chin Huang, Tomoki Toda, Nagoya University, Japan
Session:
Voice Conversion I
Track:
Speech and Language Processing
Location:
Gather Area D
Presentation Time:
Mon, 9 May, 22:00 - 22:45 China Time (UTC +8)
Mon, 9 May, 14:00 - 14:45 UTC
Mon, 9 May, 14:00 - 14:45 UTC
Session Chair:
Jan Skoglund, Google
Session SPE-27
SPE-27.1: TOWARD DEGRADATION-ROBUST VOICE CONVERSION
Chien-yu Huang, Kai-Wei Chang, Hung-yi Lee, National Taiwan University, Taiwan
SPE-27.2: TEXT-FREE NON-PARALLEL MANY-TO-MANY VOICE CONVERSION USING NORMALISING FLOWS
Thomas Merritt, Abdelhamid Ezzerg, Piotr Bilinski, Kamil Pokora, Roberto Barra-Chicote, Daniel Korzekwa, Amazon, United Kingdom of Great Britain and Northern Ireland; Magdalena Proszewska, Jagiellonian University, Poland
SPE-27.3: DIRECT NOISY SPEECH MODELING FOR NOISY-TO-NOISY VOICE CONVERSION
Chao Xie, Yi-Chiao Wu, Patrick Lumban Tobing, Wen-Chin Huang, Tomoki Toda, Nagoya University, Japan
SPE-27.4: ONE-SHOT VOICE CONVERSION FOR STYLE TRANSFER BASED ON SPEAKER ADAPTATION
Zhichao Wang, Qicong Xie, Tao Li, Hongqiang Du, Lei Xie, Northwestern Polytechnical University, China; Pengcheng Zhu, Mengxiao Bi, Fuxi AI Lab, NetEase Inc., China
SPE-27.5: Cross-speaker style transfer for text-to-speech using data augmentation
Manuel Sam Ribeiro, Julian Roth, Giulia Comini, Goeric Huybrechts, Adam Gabryś, Jaime Lorenzo-Trueba, Amazon, Poland
SPE-27.6: AN INVESTIGATION OF STREAMING NON-AUTOREGRESSIVE SEQUENCE-TO-SEQUENCE VOICE CONVERSION
Tomoki Hayashi, Kazuhiro Kobayashi, Tomoki Toda, Nagoya University, Japan