SS_18_P: Advances in Audio-Visual Captioning |
| Session Type: Poster |
| Time: Thursday, September 1, 11:00 - 12:40 |
| Location: Poster Area 3 |
| Session Chair: Yoann Altmann, Heriot-Watt University |
| SS_18_P.1: AUXILIARY CLASSIFIER BASED RESIDUAL RNN FOR IMAGE CAPTIONING |
| Özkan Çaylı, Volkan Kılıç, Aytuğ Onan, Izmir Katip Çelebi University, Turkey; Wenwu Wang, University of Surrey, United Kingdom |
| SS_18_P.2: CLOTHO-AQA: A CROWDSOURCED DATASET FOR AUDIO QUESTION ANSWERING |
| Samuel Lipping, Parthasaarathy Sudarsanam, Konstantinos Drossos, Tuomas Virtanen, Tampere University, Finland |
| SS_18_P.3: LEVERAGING PRE-TRAINED BERT FOR AUDIO CAPTIONING |
| Xubo Liu, Xinhao Mei, Qiushi Huang, Jianyuan Sun, Jinzheng Zhao, Haohe Liu, Mark D. Plumbley, Wenwu Wang, University of Surrey, United Kingdom; Volkan Kılıç, Izmir Katip Celebi University, Turkey |
| SS_18_P.4: AUTOMATED IMAGE CAPTIONING WITH MULTI-LAYER GATED RECURRENT UNIT |
| Özge Taylan Moral, Volkan Kılıç, Aytuğ Onan, izmir katip celebi university, Turkey; Wenwu Wang, university of surrey, United Kingdom |