CHAL-6.5
THE FIRST MULTIMODAL INFORMATION BASED SPEECH PROCESSING (MISP) CHALLENGE: DATA, TASKS, BASELINES AND RESULTS
Hang Chen, Hengshun Zhou, Jun Du, University of Science and Technology of China, China; Chin-Hui Lee, Georgia Institute of Technology, United States of America; Jingdong Chen, Northwestern Polytechnical University, China; Shinji Watanabe, Carnegie Mellon University, United States of America; Sabato Marco Siniscalchi, Kore University of Enna, Italy; Odette Scharenborg, Delft University of Technology, Netherlands; Di-Yuan Liu, Bao-Cai Yin, Jia Pan, Jian-Qing Gao, Cong Liu, iFlytek, China
Session:
Multimodal Information Based Speech Processing
Track:
Grand Challenge
Location:
Gather Area C
Presentation Time:
Sat, 7 May, 22:00 - 22:45 China Time (UTC +8)
Sat, 7 May, 14:00 - 14:45 UTC
Sat, 7 May, 14:00 - 14:45 UTC
Session Co-Chairs:
Jun Du, University of Science and Technology of China and Sabato Marco Siniscalchi, Kore University of Enna
Session CHAL-6
CHAL-6.1: AUDIO-VISUAL WAKE WORD SPOTTING SYSTEM FOR MISP CHALLENGE 2021
Yanguang Xu, Jianwei Sun, Yang Han, Shuaijiang Zhao, Chaoyang Mei, Tingwei Guo, Shuran Zhou, Chuandong Xie, Wei Zou, Xiangang Li, Beike, China
CHAL-6.2: CHANNEL-WISE AV-FUSION ATTENTION FOR MULTI-CHANNEL AUDIO-VISUAL SPEECH RECOGNITION
Gaopeng Xu, Song Yang, Wei Li, Sang Wang, Wei Guo, Junfeng Yuan, Jie Gao, NIO Co., Ltd., China
CHAL-6.3: THE DKU AUDIO-VISUAL WAKE WORD SPOTTING SYSTEM FOR THE 2021 MISP CHALLENGE
Ming Cheng, Haoxu Wang, Ming Li, Wuhan University, China; Yechen Wang, Duke Kunshan University, China
CHAL-6.4: THE SJTU SYSTEM FOR MULTIMODAL INFORMATION BASED SPEECH PROCESSING CHALLENGE 2021
Wei Wang, Xun Gong, Yifei Wu, Zhikai Zhou, Chenda Li, Wangyou Zhang, Bing Han, Yanmin Qian, Shanghai Jiao Tong University, China
CHAL-6.5: THE FIRST MULTIMODAL INFORMATION BASED SPEECH PROCESSING (MISP) CHALLENGE: DATA, TASKS, BASELINES AND RESULTS
Hang Chen, Hengshun Zhou, Jun Du, University of Science and Technology of China, China; Chin-Hui Lee, Georgia Institute of Technology, United States of America; Jingdong Chen, Northwestern Polytechnical University, China; Shinji Watanabe, Carnegie Mellon University, United States of America; Sabato Marco Siniscalchi, Kore University of Enna, Italy; Odette Scharenborg, Delft University of Technology, Netherlands; Di-Yuan Liu, Bao-Cai Yin, Jia Pan, Jian-Qing Gao, Cong Liu, iFlytek, China