IEEE ICASSP 2022

2022 IEEE International Conference on Acoustics, Speech and Signal Processing

7-13 May 2022
  • Virtual (all paper presentations)
22-27 May 2022
  • Main Venue: Marina Bay Sands Expo & Convention Center, Singapore
27-28 October 2022
  • Satellite Venue: Crowne Plaza Shenzhen Longgang City Centre, Shenzhen, China

ICASSP 2022
AUD-9: Detection and Classification of Acoustic Scenes and Events III: Losses and Training
Mon, 9 May, 21:00 - 21:45 China Time (UTC +8)
Mon, 9 May, 13:00 - 13:45 UTC
Location: Gather Area K
Session Chair: Romain Serizel, LORIA
Track: Audio and Acoustic Signal Processing

AUD-9.1: TIME-BALANCED FOCAL LOSS FOR AUDIO EVENT DETECTION

Sangwook Park, Mounya Elhilali, Johns Hopkins University, United States of America

AUD-9.2: MULTI-ACCDOA: LOCALIZING AND DETECTING OVERLAPPING SOUNDS FROM THE SAME CLASS WITH AUXILIARY DUPLICATING PERMUTATION INVARIANT TRAINING

Kazuki Shimada, Yuichiro Koyama, Shusuke Takahashi, Naoya Takahashi, Emiru Tsunoo, Yuki Mitsufuji, Sony Group Corporation, Japan

AUD-9.3: IMPROVED REPRESENTATION LEARNING FOR ACOUSTIC EVENT CLASSIFICATION USING TREE-STRUCTURED ONTOLOGY

Arman Zharmagambetov, University of California, Merced, United States of America; Qingming Tang, Chieh-Chi Kao, Qin Zhang, Ming Sun, Viktor Rozgic, Jasha Droppo, Chao Wang, Amazon, Alexa, United States of America

AUD-9.4: TEMPORAL CONTRASTIVE-LOSS FOR AUDIO EVENT DETECTION

Sandeep Kothinti, Mounya Elhilali, Johns Hopkins University, United States of America

AUD-9.5: A FRAME LOSS OF MULTIPLE INSTANCE LEARNING FOR WEAKLY SUPERVISED SOUND EVENT DETECTION

Xu Wang, Xiangjinzi Zhang, Shengwu Xiong, School of Computer and Artificial Intelligence, Wuhan University of Technology, Wuhan, China, China; Yunfei Zi, ,

AUD-9.6: PSEUDO STRONG LABELS FOR LARGE SCALE WEAKLY SUPERVISED AUDIO TAGGING

Heinrich Dinkel, Zhiyong Yan, Yongqing Wang, Junbo Zhang, Yujun Wang, Xiaomi Corporation, China