IEEE ICASSP 2022

2022 IEEE International Conference on Acoustics, Speech and Signal Processing

7-13 May 2022
  • Virtual (all paper presentations)
22-27 May 2022
  • Main Venue: Marina Bay Sands Expo & Convention Center, Singapore
27-28 October 2022
  • Satellite Venue: Crowne Plaza Shenzhen Longgang City Centre, Shenzhen, China

ICASSP 2022
SPE-L5: Speech Recognition I
Thu, 26 May, 13:00 - 15:00 China Time (UTC +8)
Thu, 26 May, 05:00 - 07:00 UTC
Location: Simpor Junior Ballroom 4811-3
Session Co-Chairs: Sriram Ganapathy, Indian Institute of Science and Yangyang Shi, Facebook
Track: Speech and Language Processing

SPE-L5.1: End-to-End Speech Recognition from Federated Acoustic Models

Yan Gao, Pedro P. B. de Gusmao, Nicholas D. Lane, University of Cambridge, United Kingdom of Great Britain and Northern Ireland; Titouan Parcollet, Avignon University, France; Salah Zaiem, Telecom Paris, France; Javier Fernandez-Marques, University of Oxford, United Kingdom of Great Britain and Northern Ireland; Daniel J. Beutel, Adap GmbH; University of Cambridge, Germany

SPE-L5.2: SUPERVISED ATTENTION IN SEQUENCE-TO-SEQUENCE MODELS FOR SPEECH RECOGNITION

Gene-Ping Yang, Hao Tang, University of Edinburgh, United Kingdom of Great Britain and Northern Ireland

SPE-L5.3: Improving Factored Hybrid HMM Acoustic Modeling without State Tying

Tina Raissi, Ralf Schlüter, RWTH Aachen University, Germany; Eugen Beck, Apptek GmbH, Germany; Hermann Ney, RWTH Aachen Univeristy, Germany

SPE-L5.4: END-TO-END SPEECH RECOGNITION WITH JOINT DEREVERBERATION OF SUB-BAND AUTOREGRESSIVE ENVELOPES

Rohit Kumar, Anurenjan Purushothaman, Sriram Ganapathy, Indian Institute of Science, Bangalore, India; Anirudh Sreeram, University of Southern California, United States of America

SPE-L5.5: A TWO-STEP APPROACH TO LEVERAGE CONTEXTUAL DATA: SPEECH RECOGNITION IN AIR-TRAFFIC COMMUNICATION

Iuliia Nigmatulina, Juan Zuluaga-Gomez, Amrutha Prasad, Seyyed Saeed Sarfjoo, Petr Motlicek, Idiap Research Institute, Switzerland

SPE-L5.6: UNSUPERVISED DATA SELECTION FOR SPEECH RECOGNITION WITH CONTRASTIVE LOSS RATIOS

Chanho Park, Rehan Ahmad, Thomas Hain, The University of Sheffield, United Kingdom of Great Britain and Northern Ireland

SPE-L5.7: STREAMING TRANSFORMER TRANSDUCER BASED SPEECH RECOGNITION USING NON-CAUSAL CONVOLUTION

Yangyang Shi, Chunyang Wu, Dilin Wang, Alex Xiao, Jay Mahadeokar, Xiaohui Zhang, Chunxi Liu, Ke Li, Yuan Shangguan, Varun Nagaraja, Ozlem Kalinli, Mike Seltzer, Facebook AI, United States of America

SPE-L5.8: RUN-AND-BACK STITCH SEARCH: NOVEL BLOCK SYNCHRONOUS DECODING FOR STREAMING ENCODER-DECODER ASR

Emiru Tsunoo, Michael Hentschel, Yosuke Kashiwagi, Sony Group Corporation, Japan; Chaitanya Narisetty, Shinji Watanabe, Carnegie Mellon University, United States of America