Detection, Paralinguistics and Coding |
| Session Type: Poster |
| Time: Wednesday, December 19, 13:30 - 15:30 |
| Location: Kallirhoe Hall |
| EXPLORING END-TO-END ATTENTION-BASED NEURAL NETWORKS FOR NATIVE LANGUAGE IDENTIFICATION |
| Rutuja Ubale; Educational Testing Service Research |
| Yao Qian; Educational Testing Service Research |
| Keelan Evanini; Educational Testing Service Research |
| ANALYSING THE PREDICTIONS OF A CNN-BASED REPLAY SPOOFING DETECTION SYSTEM |
| Bhusan Chettri; Queen Mary University of London |
| Saumitra Mishra; Queen Mary University of London |
| Bob L. Sturm; KTH Royal Institute of Engineering |
| Emmanouil Benetos; Queen Mary University of London |
| IMPROVED CONDITIONAL GENERATIVE ADVERSARIAL NET CLASSIFICATION FOR SPOKEN LANGUAGE RECOGNITION |
| Xiaoxiao Miao; The University of Kent / Key Laboratory of Speech Acoustics and Content Understanding, Institute of Acoustics / University of Chinese Academy of Sciences |
| Ian McLoughlin; The University of Kent |
| Shengyu Yao; Key Laboratory of Speech Acoustics and Content Understanding, Institute of Acoustics / University of Chinese Academy of Sciences |
| Yonghong Yan; Key Laboratory of Speech Acoustics and Content Understanding, Institute of Acoustics / University of Chinese Academy of Sciences / Xinjiang Key Laboratory of Minority Speech and Language Information Processing, Xinjiang Technical Institute of Physics and Chemistry, Chinese Academy of Sciences |
| UNSUPERVISED REPRESENTATION LEARNING OF SPEECH FOR DIALECT IDENTIFICATION |
| Suwon Shon; Massachusetts Institute of Technology |
| Wei-Ning Hsu; Massachusetts Institute of Technology |
| James Glass; Massachusetts Institute of Technology |
| MULTIMODAL SPEECH EMOTION RECOGNITION USING AUDIO AND TEXT |
| Seunghyun Yoon; Seoul National University |
| Seokhyun Byun; Seoul National University |
| Kyomin Jung; Seoul National University |
| POSTERIOR CALIBRATION FOR MULTI-CLASS PARALINGUISTIC CLASSIFICATION |
| Gábor Gosztolya; MTA-SZTE Research Group on Artificial Intelligence |
| Róbert Busa-Fekete; Yahoo Research Inc. |
| CONTEXT-AWARE ATTENTION MECHANISM FOR SPEECH EMOTION RECOGNITION |
| Gaetan Ramet; Ecole Polytechnique Federale de Lausanne |
| Philip N. Garner; Idiap Research Institute |
| Michael Baeriswyl; Swisscom |
| Alexandros Lazaridis; Swisscom |
| AN EXPERIMENTAL STUDY ON AUDIO REPLAY ATTACK DETECTION USING DEEP NEURAL NETWORKS |
| Bekir Bakar; Bursa Technical University |
| Cemal Hanilci; Bursa Technical University |
| LSTM-BASED WHISPER DETECTION |
| Zeynab Raeesy; Amazon |
| Kellen Gillespie; Amazon |
| Chengyuan Ma; Amazon |
| Thomas Drugman; Amazon |
| Jiacheng Gu; Amazon |
| Roland Maas; Amazon |
| Ariya Rastrow; Amazon |
| Björn Hoffmeister; Amazon |
| AMERICAN SIGN LANGUAGE FINGERSPELLING RECOGNITION IN THE WILD |
| Bowen Shi; Toyota Technological Institute at Chicago |
| Aurora Martinez Del Rio; University of Chicago |
| Jonathan Keane; University of Chicago |
| Jonathan Michaux; Toyota Technological Institute at Chicago |
| Diane Brentari; University of Chicago |
| Greg Shakhnarovich; Toyota Technological Institute at Chicago |
| Karen Livescu; Toyota Technological Institute at Chicago |
| WAVENET-BASED ZERO-DELAY LOSSLESS SPEECH CODING |
| Takenori Yoshimura; Nagoya Institute of Technology |
| Kei Hashimoto; Nagoya Institute of Technology |
| Keiichiro Oura; Nagoya Institute of Technology |
| Yoshihiko Nankaku; Nagoya Institute of Technology |
| Keiichi Tokuda; Nagoya Institute of Technology |
| IMPROVING GENERALIZATION OF VOCAL TRACT FEATURE RECONSTRUCTION: FROM AUGMENTED ACOUSTIC INVERSION TO ARTICULATORY FEATURE RECONSTRUCTION WITHOUT ARTICULATORY DATA |
| Rosanna Turrisi; Istituto Italiano di Tecnologia |
| Raffaele Tavarone; Istituto Italiano di Tecnologia |
| Leonardo Badino; Istituto Italiano di Tecnologia |
| A DEEP LEARNING APPROACH FOR DATA DRIVEN VOCAL TRACT AREA FUNCTION ESTIMATION |
| Sasan Asadiabadi; Koc university |
| Engin Erzin; Koc university |