SPE-47.6
SPEECH EMOTION RECOGNITION WITH CO-ATTENTION BASED MULTI-LEVEL ACOUSTIC INFORMATION
Heqing Zou, Chen Chen, Deepu Rajan, Eng Siong Chng, Nanyang Technological University, Singapore; Yuke Si, Tianjin University, China
Session:
Emotion Recognition: General Topics I
Track:
Speech and Language Processing
Location:
Gather Area D
Presentation Time:
Tue, 10 May, 23:00 - 23:45 China Time (UTC +8)
Tue, 10 May, 15:00 - 15:45 UTC
Tue, 10 May, 15:00 - 15:45 UTC
Session Chair:
Viktor Rozgic, Amazon
Session SPE-47
SPE-47.1: SPEAKER NORMALIZATION FOR SELF-SUPERVISED SPEECH EMOTION RECOGNITION
Itai Gat, Hagai Aronowitz, Weizhong Zhu, Edmilson Morais, Ron Hoory, IBM Research, Israel
SPE-47.2: Sentiment-Aware Automatic Speech Recognition pre-training for enhanced Speech Emotion Recognition
Ayoub Ghriss, University of Colorado, Boulder, United States of America; Bo Yang, Viktor Rozgic, Wang Chao, Amazon LLC, United States of America; Elizabeth Shriberg, Ellipsis Health, United States of America
SPE-47.3: CONFIDENCE ESTIMATION FOR SPEECH EMOTION RECOGNITION BASED ON THE RELATIONSHIP BETWEEN EMOTION CATEGORIES AND PRIMITIVES
Yang Li, University of Rochester, United States of America; Constantinos Papayiannis, Viktor Rozgic, Elizabeth Shriberg, Chao Wang, Amazon, United States of America
SPE-47.4: AuxFormer: Robust Approach to Audiovisual Emotion Recognition
Lucas Goncalves, Carlos Busso, The University of Texas at Dallas, United States of America
SPE-47.5: FUSING ASR OUTPUTS IN JOINT TRAINING FOR SPEECH EMOTION RECOGNITION
Yuanchao Li, Peter Bell, Catherine Lai, University of Edinburgh, United Kingdom of Great Britain and Northern Ireland
SPE-47.6: SPEECH EMOTION RECOGNITION WITH CO-ATTENTION BASED MULTI-LEVEL ACOUSTIC INFORMATION
Heqing Zou, Chen Chen, Deepu Rajan, Eng Siong Chng, Nanyang Technological University, Singapore; Yuke Si, Tianjin University, China