Technical Program

Detection, Paralinguistics and Coding

Session Type: Poster

Time: Wednesday, December 19, 13:30 - 15:30

Location: Kallirhoe Hall

EXPLORING END-TO-END ATTENTION-BASED NEURAL NETWORKS FOR NATIVE LANGUAGE IDENTIFICATION

Rutuja Ubale; Educational Testing Service Research

Yao Qian; Educational Testing Service Research

Keelan Evanini; Educational Testing Service Research

ANALYSING THE PREDICTIONS OF A CNN-BASED REPLAY SPOOFING DETECTION SYSTEM

Bhusan Chettri; Queen Mary University of London

Saumitra Mishra; Queen Mary University of London

Bob L. Sturm; KTH Royal Institute of Engineering

Emmanouil Benetos; Queen Mary University of London

IMPROVED CONDITIONAL GENERATIVE ADVERSARIAL NET CLASSIFICATION FOR SPOKEN LANGUAGE RECOGNITION

Xiaoxiao Miao; The University of Kent / Key Laboratory of Speech Acoustics and Content Understanding, Institute of Acoustics / University of Chinese Academy of Sciences

Ian McLoughlin; The University of Kent

Shengyu Yao; Key Laboratory of Speech Acoustics and Content Understanding, Institute of Acoustics / University of Chinese Academy of Sciences

Yonghong Yan; Key Laboratory of Speech Acoustics and Content Understanding, Institute of Acoustics / University of Chinese Academy of Sciences / Xinjiang Key Laboratory of Minority Speech and Language Information Processing, Xinjiang Technical Institute of Physics and Chemistry, Chinese Academy of Sciences

UNSUPERVISED REPRESENTATION LEARNING OF SPEECH FOR DIALECT IDENTIFICATION

Suwon Shon; Massachusetts Institute of Technology

Wei-Ning Hsu; Massachusetts Institute of Technology

James Glass; Massachusetts Institute of Technology

MULTIMODAL SPEECH EMOTION RECOGNITION USING AUDIO AND TEXT

Seunghyun Yoon; Seoul National University

Seokhyun Byun; Seoul National University

Kyomin Jung; Seoul National University

POSTERIOR CALIBRATION FOR MULTI-CLASS PARALINGUISTIC CLASSIFICATION

Gábor Gosztolya; MTA-SZTE Research Group on Artificial Intelligence

Róbert Busa-Fekete; Yahoo Research Inc.

CONTEXT-AWARE ATTENTION MECHANISM FOR SPEECH EMOTION RECOGNITION