IEEE ICASSP 2022

2022 IEEE International Conference on Acoustics, Speech and Signal Processing

7-13 May 2022
  • Virtual (all paper presentations)
22-27 May 2022
  • Main Venue: Marina Bay Sands Expo & Convention Center, Singapore
27-28 October 2022
  • Satellite Venue: Crowne Plaza Shenzhen Longgang City Centre, Shenzhen, China

ICASSP 2022
AUD-33: Extended Evaluation and Captioning
Fri, 13 May, 21:00 - 21:45 China Time (UTC +8)
Fri, 13 May, 13:00 - 13:45 UTC
Location: Gather Area K
Session Chair: Emanuël Habets, University of Erlangen-Nuremberg
Track: Audio and Acoustic Signal Processing

AUD-33.1: DIVERSITY-CONTROLLABLE AND ACCURATE AUDIO CAPTIONING BASED ON NEURAL CONDITION

Xuenan Xu, Mengyue Wu, Kai Yu, Shanghai Jiao Tong University, China

AUD-33.2: AUDIOCLIP: EXTENDING CLIP TO IMAGE, TEXT AND AUDIO

Andrey Guzhov, Federico Raue, Jörn Hees, Andreas Dengel, Deutsches Forschungszentrum für Künstliche Intelligenz GmbH, Germany

AUD-33.3: CAN AUDIO CAPTIONS BE EVALUATED WITH IMAGE CAPTION METRICS?

Zelin Zhou, Zhiling Zhang, Xuenan Xu, Zeyu Xie, Mengyue Wu, Kenny Zhu, Shanghai Jiao Tong University, China

AUD-33.4: A DATA-DRIVEN COGNITIVE SALIENCE MODEL FOR OBJECTIVE PERCEPTUAL AUDIO QUALITY ASSESSMENT

Pablo M. Delgado, Jürgen Herre, International Audio Laboratories Erlangen, Germany

AUD-33.6: EFFECT OF NOISE SUPPRESSION LOSSES ON SPEECH DISTORTION AND ASR PERFORMANCE

Sebastian Braun, Hannes Gamper, Microsoft, United States of America