Paper ID | SPE-30.4 |
Paper Title |
HOW TO MAKE TEXT-TO-SPEECH SYSTEM PRONOUNCE “VOLDEMORT”: AN EXPERIMENTAL APPROACH OF FOREIGN WORD PHONEMIZATION IN VIETNAMESE |
Authors |
Dang-Khoa Mac, Van-Huy Nguyen, Dinh-Nghi Nguyen, Kim-Anh Nguyen, Vingroup Big Data Institute, Vietnam |
Session | SPE-30: Speech Processing 2: General Topics |
Location | Gather.Town |
Session Time: | Wednesday, 09 June, 16:30 - 17:15 |
Presentation Time: | Wednesday, 09 June, 16:30 - 17:15 |
Presentation |
Poster
|
Topic |
Speech Processing: [SPE-SYNT] Speech Synthesis and Generation |
IEEE Xplore Open Preview |
Click here to view in IEEE Xplore |
Virtual Presentation |
Click here to watch in the Virtual Conference |
Abstract |
Generating foreign words is one of the hardest tasks for any speech synthesis systems. This work deal with this problem in the case of Vietnamese, a low-resourced language, following an experimental approach. Base on a deep analysis of the usage of foreign words in Vietnamese, various types of pronunciation dictionaries for foreign words was proposed including rule-based phonemization, word-to-syllables mapping, and cross-lingual phone-to-phone mapping. These dictionaries were then used to train different types of grapheme-to-phoneme (G2P) converters. The perceptual evaluation of the Vietnamese synthesized speech confirms that the output of the proposed method can compare favorably with the pronunciation by the human on the unseen foreign words. |