MLSP-49.3
Exploring Dual Stream Global Information for Image Captioning
Tiantao Xian, Zhixin Li, Tianyu Chen, Guangxi Normal University, China; Huifang Ma, Northwest Normal University, China
Session:
Learning from Multimodal Data
Track:
Machine Learning for Signal Processing
Location:
Gather Area H
Presentation Time:
Fri, 13 May, 20:00 - 20:45 China Time (UTC +8)
Fri, 13 May, 12:00 - 12:45 UTC
Fri, 13 May, 12:00 - 12:45 UTC
Session Chair:
Jen-Tzung Chien, National Yang Ming Jiao Tong University
Session MLSP-49
MLSP-49.1: CROSS-MODAL KNOWLEDGE DISTILLATION FOR VISION-TO-SENSOR ACTION RECOGNITION
Jianyuan Ni, Anne H. H. Ngu, Texas State University, United States of America; Raunak Sarbajna, University of Houston, United States of America; Yang Liu, Sun Yat-sen University, China; Yan Yan, Illinois Institute of Technology, United States of America
MLSP-49.2: CLIPCAM: A Simple Baseline for Zero-shot Text-guided Object and Action Localization
Hsuan-An Hsia, Che-Hsien Lin, Bo-Han Kung, Jhao-Ting Chen, Daniel Stanley Tan, Jun-Cheng Chen, Academia Sinica, Taiwan; Kai-Lung Hua, National Taiwan University of Science and Technology, Taiwan
MLSP-49.3: Exploring Dual Stream Global Information for Image Captioning
Tiantao Xian, Zhixin Li, Tianyu Chen, Guangxi Normal University, China; Huifang Ma, Northwest Normal University, China
MLSP-49.4: UNSUPERVISED CONTRASTIVE HASHING FOR CROSS-MODAL RETRIEVAL IN REMOTE SENSING
Georgii Mikriukov, Mahdyar Ravanbakhsh, Begüm Demir, Technische Universität Berlin, Germany
MLSP-49.5: ROBUST THERMAL INFRARED PEDESTRIAN DETECTION BY ASSOCIATING VISIBLE PEDESTRIAN KNOWLEDGE
Sungjune Park, Dae Hwi Choi, Jung Uk Kim, Yong Man Ro, Korea Advanced Institute of Science and Technology (KAIST), Korea, Republic of
MLSP-49.6: A GENERALIZED HIERARCHICAL NONNEGATIVE TENSOR DECOMPOSITION
Joshua Vendrow, Deanna Needell, UCLA, United States of America; Jamie Haddock, Harvey Mudd College, United States of America