SS_18_P: Advances in Audio-Visual Captioning |
Session Type: Poster |
Time: Thursday, September 1, 11:00 - 12:40 |
Location: Poster Area 3 |
Session Chair: Yoann Altmann, Heriot-Watt University
|
|
SS_18_P.1: AUXILIARY CLASSIFIER BASED RESIDUAL RNN FOR IMAGE CAPTIONING |
Özkan Çaylı, Volkan Kılıç, Aytuğ Onan, Izmir Katip Çelebi University, Turkey; Wenwu Wang, University of Surrey, United Kingdom |
|
SS_18_P.2: CLOTHO-AQA: A CROWDSOURCED DATASET FOR AUDIO QUESTION ANSWERING |
Samuel Lipping, Parthasaarathy Sudarsanam, Konstantinos Drossos, Tuomas Virtanen, Tampere University, Finland |
|
SS_18_P.3: LEVERAGING PRE-TRAINED BERT FOR AUDIO CAPTIONING |
Xubo Liu, Xinhao Mei, Qiushi Huang, Jianyuan Sun, Jinzheng Zhao, Haohe Liu, Mark D. Plumbley, Wenwu Wang, University of Surrey, United Kingdom; Volkan Kılıç, Izmir Katip Celebi University, Turkey |
|
SS_18_P.4: AUTOMATED IMAGE CAPTIONING WITH MULTI-LAYER GATED RECURRENT UNIT |
Özge Taylan Moral, Volkan Kılıç, Aytuğ Onan, izmir katip celebi university, Turkey; Wenwu Wang, university of surrey, United Kingdom |
|