Detection, Paralinguistics and Coding |
Session Type: Poster |
Time: Wednesday, December 19, 13:30 - 15:30 |
Location: Kallirhoe Hall |
EXPLORING END-TO-END ATTENTION-BASED NEURAL NETWORKS FOR NATIVE LANGUAGE IDENTIFICATION |
Rutuja Ubale; Educational Testing Service Research |
Yao Qian; Educational Testing Service Research |
Keelan Evanini; Educational Testing Service Research |
ANALYSING THE PREDICTIONS OF A CNN-BASED REPLAY SPOOFING DETECTION SYSTEM |
Bhusan Chettri; Queen Mary University of London |
Saumitra Mishra; Queen Mary University of London |
Bob L. Sturm; KTH Royal Institute of Engineering |
Emmanouil Benetos; Queen Mary University of London |
IMPROVED CONDITIONAL GENERATIVE ADVERSARIAL NET CLASSIFICATION FOR SPOKEN LANGUAGE RECOGNITION |
Xiaoxiao Miao; The University of Kent / Key Laboratory of Speech Acoustics and Content Understanding, Institute of Acoustics / University of Chinese Academy of Sciences |
Ian McLoughlin; The University of Kent |
Shengyu Yao; Key Laboratory of Speech Acoustics and Content Understanding, Institute of Acoustics / University of Chinese Academy of Sciences |
Yonghong Yan; Key Laboratory of Speech Acoustics and Content Understanding, Institute of Acoustics / University of Chinese Academy of Sciences / Xinjiang Key Laboratory of Minority Speech and Language Information Processing, Xinjiang Technical Institute of Physics and Chemistry, Chinese Academy of Sciences |
UNSUPERVISED REPRESENTATION LEARNING OF SPEECH FOR DIALECT IDENTIFICATION |
Suwon Shon; Massachusetts Institute of Technology |
Wei-Ning Hsu; Massachusetts Institute of Technology |
James Glass; Massachusetts Institute of Technology |
MULTIMODAL SPEECH EMOTION RECOGNITION USING AUDIO AND TEXT |
Seunghyun Yoon; Seoul National University |
Seokhyun Byun; Seoul National University |
Kyomin Jung; Seoul National University |
POSTERIOR CALIBRATION FOR MULTI-CLASS PARALINGUISTIC CLASSIFICATION |
Gábor Gosztolya; MTA-SZTE Research Group on Artificial Intelligence |
Róbert Busa-Fekete; Yahoo Research Inc. |
CONTEXT-AWARE ATTENTION MECHANISM FOR SPEECH EMOTION RECOGNITION |
Gaetan Ramet; Ecole Polytechnique Federale de Lausanne |
Philip N. Garner; Idiap Research Institute |
Michael Baeriswyl; Swisscom |
Alexandros Lazaridis; Swisscom |
AN EXPERIMENTAL STUDY ON AUDIO REPLAY ATTACK DETECTION USING DEEP NEURAL NETWORKS |
Bekir Bakar; Bursa Technical University |
Cemal Hanilci; Bursa Technical University |
LSTM-BASED WHISPER DETECTION |
Zeynab Raeesy; Amazon |
Kellen Gillespie; Amazon |
Chengyuan Ma; Amazon |
Thomas Drugman; Amazon |
Jiacheng Gu; Amazon |
Roland Maas; Amazon |
Ariya Rastrow; Amazon |
Björn Hoffmeister; Amazon |
AMERICAN SIGN LANGUAGE FINGERSPELLING RECOGNITION IN THE WILD |
Bowen Shi; Toyota Technological Institute at Chicago |
Aurora Martinez Del Rio; University of Chicago |
Jonathan Keane; University of Chicago |
Jonathan Michaux; Toyota Technological Institute at Chicago |
Diane Brentari; University of Chicago |
Greg Shakhnarovich; Toyota Technological Institute at Chicago |
Karen Livescu; Toyota Technological Institute at Chicago |
WAVENET-BASED ZERO-DELAY LOSSLESS SPEECH CODING |
Takenori Yoshimura; Nagoya Institute of Technology |
Kei Hashimoto; Nagoya Institute of Technology |
Keiichiro Oura; Nagoya Institute of Technology |
Yoshihiko Nankaku; Nagoya Institute of Technology |
Keiichi Tokuda; Nagoya Institute of Technology |
IMPROVING GENERALIZATION OF VOCAL TRACT FEATURE RECONSTRUCTION: FROM AUGMENTED ACOUSTIC INVERSION TO ARTICULATORY FEATURE RECONSTRUCTION WITHOUT ARTICULATORY DATA |
Rosanna Turrisi; Istituto Italiano di Tecnologia |
Raffaele Tavarone; Istituto Italiano di Tecnologia |
Leonardo Badino; Istituto Italiano di Tecnologia |
A DEEP LEARNING APPROACH FOR DATA DRIVEN VOCAL TRACT AREA FUNCTION ESTIMATION |
Sasan Asadiabadi; Koc university |
Engin Erzin; Koc university |