IEEE ICASSP 2022

2022 IEEE International Conference on Acoustics, Speech and Signal Processing

7-13 May 2022
  • Virtual (all paper presentations)
22-27 May 2022
  • Main Venue: Marina Bay Sands Expo & Convention Center, Singapore
27-28 October 2022
  • Satellite Venue: Crowne Plaza Shenzhen Longgang City Centre, Shenzhen, China

ICASSP 2022
SPE-L9: Speech Processing Techniques II
Tue, 24 May, 13:00 - 15:30 China Time (UTC +8)
Tue, 24 May, 05:00 - 07:30 UTC
Location: Peony Junior Ballroom 4511
Session Co-Chairs: Rohan Kumar Das, Fortemedia Singapore Pte. Ltd. and Rong Tong, Singapore Institute of Technology
Track: Speech and Language Processing

SPE-L9.1: NOISE-ROBUST SPEECH RECOGNITION WITH 10 MINUTES UNPARALLELED IN-DOMAIN DATA

Chen Chen, Nana Hou, Yuchen Hu, Eng Siong Chng, Nanyang Technological University, Singapore; Shashank Shirol, Manipal Institute of Technology, India

SPE-L9.2: FrAUG: A Frame Rate Based Data Augmentation Method for Depression Detection from Speech Signals

Vijay Ravi, Jinhan Wang, Jonathan Flint, Abeer Alwan, University of California Los Angeles, United States of America

SPE-L9.3: DISTRIBUTION AUGMENTATION FOR LOW-RESOURCE EXPRESSIVE TEXT-TO-SPEECH

Mateusz Lajszczak, Animesh Prasad, Arent van Korlaar, Bajibabu Bollepalli, Antonio Bonafonte, Arnaud Joly, Marco Nicolis, Alexis Moinet, Thomas Drugman, Trevor Wood, Elena Sokolova, Amazon, United Kingdom of Great Britain and Northern Ireland

SPE-L9.4: DEEPFILTERNET: A LOW COMPLEXITY SPEECH ENHANCEMENT FRAMEWORK FOR FULL-BAND AUDIO BASED ON DEEP FILTERING

Hendrik Schröter, Andreas Maier, Friedrich-Alexander University Erlangen-Nuremberg (FAU), Germany; Alberto N. Escalante-B., Tobias Rosenkranz, WS Audiology, Germany

SPE-L9.5: Joint Far- and Near-End Speech Intelligibility Enhancement based on the Approximated Speech Intelligibility Index

Andreas Jonas Fuglsig, Lars Søndergaard Bertelsen, Peter Mariager, RTX A/S, Denmark; Jan Østergaard, Jesper Jensen, Zheng-Hua Tan, Aalborg University, Denmark

SPE-L9.6: INVESTIGATION OF ROBUSTNESS OF HUBERT FEATURES FROM DIFFERENT LAYERS TO DOMAIN, ACCENT AND LANGUAGE VARIATIONS

Pratik Kumar, Vrunda N. Sukhadia, Srinivasan Umesh, Indian Institute of Technology Madras, India

SPE-L9.7: ADAPTIVE DISCOUNTING OF IMPLICIT LANGUAGE MODELS IN RNN-TRANSDUCERS

Vinit Unni, Preethi Jyothi, Sunita Sarawagi, Indian Institute of Technology Bombay, India; Shreya Khare, Ashish Mittal, Samarth Bharadwaj, IBM Research, India

SPE-L9.8: CURRICULUM OPTIMIZATION FOR LOW-RESOURCE SPEECH RECOGNITION

Anastasia Kuznetsova, Francis Tyers, Indiana University Bloomington, United States of America; Anurag Kumar, Jennifer Drexler Fox, Rev.com, United States of America

SPE-L9.9: GENERALIZATION ABILITY OF MOS PREDICTION NETWORKS

Erica Cooper, Junichi Yamagishi, National Institute of Informatics, Japan; Wen-Chin Huang, Tomoki Toda, Nagoya University, Japan