IEEE ICASSP 2022

2022 IEEE International Conference on Acoustics, Speech and Signal Processing

7-13 May 2022
  • Virtual (all paper presentations)
22-27 May 2022
  • Main Venue: Marina Bay Sands Expo & Convention Center, Singapore
27-28 October 2022
  • Satellite Venue: Crowne Plaza Shenzhen Longgang City Centre, Shenzhen, China

ICASSP 2022
CHAL-3.4

CROSS-CHANNEL ATTENTION-BASED TARGET SPEAKER VOICE ACTIVITY DETECTION: EXPERIMENTAL RESULTS FOR M2MET CHALLENGE

Weiqing Wang, Duke University, United States of America; Xiaoyi Qin, Ming Li, Duke Kunshan University, China

Session:
Multi-Channel Multi-Party Meeting Transcription

Track:
Grand Challenge

Location:
Gather Area C

Presentation Time:
Sat, 7 May, 21:00 - 21:45 China Time (UTC +8)
Sat, 7 May, 13:00 - 13:45 UTC

Session Chair:
Lei Xie, Northwestern Polytechnical University
Presentation
Discussion
Resources
Session CHAL-3
CHAL-3.1: SUMMARY ON THE ICASSP 2022 MULTI-CHANNEL MULTI-PARTY MEETING TRANSCRIPTION GRAND CHALLENGE
Fan Yu, Shiliang Zhang, Zhihao Du, Siqi Zheng, Weilong Huang, Zhijie Yan, Bin Ma, Alibaba, China; Pengcheng Guo, Yihui Fu, Lei Xie, AISHELL Foundation, China; Zheng-Hua Tan, Aalborg University, Denmark; DeLiang Wang, The Ohio State University, United States of America; Yanmin Qian, Shanghai Jiao Tong University,, China; Kong Aik Lee, Institute for Infocomm Research, A*STAR, Singapore; Xin Xu, Hui Bu, Beijing Shell Shell Technology Co., Ltd., China
CHAL-3.2: THE CUHK-TENCENT SPEAKER DIARIZATION SYSTEM FOR THE ICASSP 2022 MULTI-CHANNEL MULTI-PARTY MEETING TRANSCRIPTION CHALLENGE
Naijun Zheng, Xixin Wu, Lingwei Meng, Jiawen Kang, Helen Meng, The Chinese University of Hong Kong, China; Na Li, Chao Weng, Dan Su, Tencent AI Lab, China; Haibin Wu, National Taiwan University, China
CHAL-3.3: THE USTC-XIMALAYA SYSTEM FOR THE ICASSP 2022 MULTI-CHANNEL MULTI-PARTY MEETING TRANSCRIPTION (M2MET) CHALLENGE
Maokui He, Xiaoqi Zhang, Yuxuan Wang, Shutong Niu, Jun Du, University of Science and Technology of China, China; Xiang Lv, Weilin Zhou, Jingjing Yin, Yuhang Cao, Heng Lu, Ximalaya Inc., China; Chin-Hui Lee, Georgia Institute of Technology, United States of America
CHAL-3.4: CROSS-CHANNEL ATTENTION-BASED TARGET SPEAKER VOICE ACTIVITY DETECTION: EXPERIMENTAL RESULTS FOR M2MET CHALLENGE
Weiqing Wang, Duke University, United States of America; Xiaoyi Qin, Ming Li, Duke Kunshan University, China
CHAL-3.5: THE VOLCSPEECH SYSTEM FOR THE ICASSP 2022 MULTI-CHANNEL MULTI-PARTY MEETING TRANSCRIPTION CHALLENGE
Chen Shen, Yi Liu, Wenzhi Fan, Bin Wang, Shixue Wen, Yao Tian, Jun Zhang, Jingsheng Yang, Zejun Ma, bytedance, China
CHAL-3.6: THE ROYALFLUSH SYSTEM OF SPEECH RECOGNITION FOR M2MET CHALLENGE
Shuaishuai Ye, Peiyao Wang, Shunfei Chen, Xinhui Hu, Xinkang Xu, Hithink RoyalFlush Information Network Co.,Ltd., China