Technical Program

Paper Detail

Presentation #	11
Session:	Natural Language Processing
Location:	Kallirhoe Hall
Session Time:	Thursday, December 20, 13:30 - 15:30
Presentation Time:	Thursday, December 20, 13:30 - 15:30
Presentation:	Poster
Topic:	Multimodal processing:
Paper Title:	SENTIMENT CLASSIFICATION ON ERRONEOUS ASR TRANSCRIPTS: A MULTI VIEW LEARNING APPROACH
Authors:	Sri Harsha Dumpala, Imran Sheikh, Rupayan Chakraborty, Sunil Kumar Kopparapu, TCS Research and Innovation-Mumbai, India
Abstract:	Sentiment classification on spoken language transcriptions has received less attention. A practical system employing the spoken language modality will have to use a language transcription from an Automatic Speech Recognition (ASR) engine which is inherently prone to errors. The main interest of this paper lies in improvement of sentiment classification on erroneous ASR transcriptions. Our aim is to improve the representations of the ASR transcripts using manual transcripts and other modalities, like audio and visual, that are available during training. We adopt an approach based on Deep Canonical Correlation Analysis (DCCA) and propose two new extensions of DCCA to enhance the ASR view using multiple modalities. We present a detailed evaluation of the performance of our approach on datasets of opinion videos (CMU-MOSI and CMU-MOSEI) collected from Youtube.