Technical Program

SS_18_P: Advances in Audio-Visual Captioning

Session Type: Poster
Time: Thursday, September 1, 11:00 - 12:40
Location: Poster Area 3
Session Chair: Yoann Altmann, Heriot-Watt University
 
SS_18_P.1: AUXILIARY CLASSIFIER BASED RESIDUAL RNN FOR IMAGE CAPTIONING
Özkan Çaylı, Volkan Kılıç, Aytuğ Onan, Izmir Katip Çelebi University, Turkey; Wenwu Wang, University of Surrey, United Kingdom
 
SS_18_P.2: CLOTHO-AQA: A CROWDSOURCED DATASET FOR AUDIO QUESTION ANSWERING
Samuel Lipping, Parthasaarathy Sudarsanam, Konstantinos Drossos, Tuomas Virtanen, Tampere University, Finland
 
SS_18_P.3: LEVERAGING PRE-TRAINED BERT FOR AUDIO CAPTIONING
Xubo Liu, Xinhao Mei, Qiushi Huang, Jianyuan Sun, Jinzheng Zhao, Haohe Liu, Mark D. Plumbley, Wenwu Wang, University of Surrey, United Kingdom; Volkan Kılıç, Izmir Katip Celebi University, Turkey
 
SS_18_P.4: AUTOMATED IMAGE CAPTIONING WITH MULTI-LAYER GATED RECURRENT UNIT
Özge Taylan Moral, Volkan Kılıç, Aytuğ Onan, izmir katip celebi university, Turkey; Wenwu Wang, university of surrey, United Kingdom