SPE-7.1
WAVEBENDER GAN: AN ARCHITECTURE FOR PHONETICALLY MEANINGFUL SPEECH MANIPULATION
Gustavo Teodoro Döhler Beck, Ulme Wennberg, Zofia Malisz, Gustav Eje Henter, KTH Royal Institute of Technology, Sweden
Session:
Speech Synthesis: General Topics II
Track:
Speech and Language Processing
Location:
Gather Area D
Presentation Time:
Sun, 8 May, 21:00 - 21:45 China Time (UTC +8)
Sun, 8 May, 13:00 - 13:45 UTC
Sun, 8 May, 13:00 - 13:45 UTC
Session Chair:
Lei He, Microsoft
Session SPE-7
SPE-7.1: WAVEBENDER GAN: AN ARCHITECTURE FOR PHONETICALLY MEANINGFUL SPEECH MANIPULATION
Gustavo Teodoro Döhler Beck, Ulme Wennberg, Zofia Malisz, Gustav Eje Henter, KTH Royal Institute of Technology, Sweden
SPE-7.2: FRE-GAN 2: FAST AND EFFICIENT FREQUENCY-CONSISTENT AUDIO SYNTHESIS
Sang-Hoon Lee, Ji-Hoon Kim, Kang-Eun Lee, Seong-Whan Lee, Korea University, Korea, Republic of
SPE-7.3: R-G2P: EVALUATING AND ENHANCING ROBUSTNESS OF GRAPHEME TO PHONEME CONVERSION BY CONTROLLED NOISE INTRODUCING AND CONTEXTUAL INFORMATION INCORPORATION
Chendong Zhao, Haoqian Wang, The Shenzhen International Graduate School, Tsinghua University, China, China; Jianzong Wang, Xiaoyang Qu, Jing Xiao, Ping An Technology (Shenzhen) Co., Ltd., China
SPE-7.4: NEURAL GRAPHEME-TO-PHONEME CONVERSION WITH PRE-TRAINED GRAPHEME MODELS
Lu Dong, Zhi-Qiang Guo, Chao-Hong Tan, Zhen-Hua Ling, National Engineering Laboratory for Speech and Language Information Processing, University of Science and Technology of China, Hefei, P. R. China, China; Ya-Jun Hu, Yuan Jiang, iFLYTEK Research, iFLYTEK Co., Ltd., Hefei, P. R. China, China
SPE-7.5: ISTFTNET: FAST AND LIGHTWEIGHT MEL-SPECTROGRAM VOCODER INCORPORATING INVERSE SHORT-TIME FOURIER TRANSFORM
Takuhiro Kaneko, Kou Tanaka, Hirokazu Kameoka, Shogo Seki, NTT Corporation, Japan
SPE-7.6: ACOUSTIC APPLICATION OF PHASE RECONSTRUCTION ALGORITHMS IN OPTICS
Tomoki Kobayashi, Tomoro Tanaka, Kohei Yatabe, Yasuhiro Oikawa, Waseda University, Japan