SPE-31.3
MULTI-LINGUAL MULTI-TASK SPEECH EMOTION RECOGNITION USING WAV2VEC 2.0
Mayank Sharma, Amazon, India
Session:
Emotion Recognition: Neural Architecture
Track:
Speech and Language Processing
Location:
Gather Area D
Presentation Time:
Mon, 9 May, 23:00 - 23:45 China Time (UTC +8)
Mon, 9 May, 15:00 - 15:45 UTC
Mon, 9 May, 15:00 - 15:45 UTC
Session Chair:
Sriram Ganapathy, Indian Institute of Science (IISc), Bangalore
Session SPE-31
SPE-31.1: KEY-SPARSE TRANSFORMER FOR MULTIMODAL SPEECH EMOTION RECOGNITION
Weidong Chen, Xiaofeng Xing, Xiangmin Xu, Jichen Yang, South China University of Technology, China; Jianxin Pang, UBTECH Robotics Corp, China
SPE-31.2: Neural Architecture Search for Speech Emotion Recognition
Xixin Wu, Shoukang Hu, Zhiyong Wu, Xunying Liu, Helen Meng, The Chinese University of Hong Kong, Hong Kong
SPE-31.3: MULTI-LINGUAL MULTI-TASK SPEECH EMOTION RECOGNITION USING WAV2VEC 2.0
Mayank Sharma, Amazon, India
SPE-31.4: Light-SERNet: A lightweight fully convolutional neural network for speech emotion recognition
Arya Aftab, Shahrokh Ghaemmaghami, Sharif University of Technology, Iran (Islamic Republic of); Alireza Morsali, Benoit Champagne, McGill University, Canada
SPE-31.5: Multimodal Transformer With Learnable Frontend and Self Attention for Emotion Recognition
Soumya Dutta, Sriram Ganapathy, LEAP lab, Indian Institute of Science, Bangalore, India., India
SPE-31.6: SPEECH EMOTION RECOGNITION USING SELF-SUPERVISED FEATURES
Edmilson Morais, Ron Hoory, Weizhong Zhu, Itai Gat, Matheus Damasceno, Hagai Aronowitz, IBM Research, Brazil