IEEE ICASSP 2022

2022 IEEE International Conference on Acoustics, Speech and Signal Processing

7-13 May 2022
  • Virtual (all paper presentations)
22-27 May 2022
  • Main Venue: Marina Bay Sands Expo & Convention Center, Singapore
27-28 October 2022
  • Satellite Venue: Crowne Plaza Shenzhen Longgang City Centre, Shenzhen, China

ICASSP 2022
ST-11: CITISEN: A Mobile Application for Deep Learning-Based Speech Enhancement
Wed, 11 May, 23:00 - 23:45 China Time (UTC +8)
Wed, 11 May, 15:00 - 15:45 UTC
Location: Gather Area P
Virtual
Gather.Town
Show & Tell
Presented by: (1) Yu Tsao, Research Center for Information Technology Innovation at Academia Sinica, Taiwan (2) Yu-Wen Chen, Research Center for Information Technology Innovation at Academia Sinica, Taiwan (3) Kuo-Hsuan Hung, Research Center for Information Technology Innovation at Academia Sinica, Taiwan (4) Kai-Chun Liu, Research Center for Information Technology Innovation at Academia Sinica, Taiwan

We present a deep learning-based speech enhancment mobile application, named CITISEN. The CITISEN can perform three functions: speech enhancement (SE), model adaptation (MA), and background noise conversion (BNC). For SE, pretrained SE models can be installed into CITISEN in order to reduce noise components from instant or saved recordings. The MA function finetunes the pretrained SE models to attain imporved SE performance. The BNC first removes the original background noise from the input utterances and then mixes the processed utterances with new background noise. In the demo dession, we will show how to install pretrained SE models to CITISEN and who to use CITISEN as a platform for utilizing and evaluating SE models and flexibly extend the models to address various noise environments and users.