MMSP-5.1
ENHANCING CONTRASTIVE LEARNING WITH TEMPORAL COGNIZANCE FOR AUDIO-VISUAL REPRESENTATION GENERATION
Chandrashekhar Lavania, Shiva Sundaram, Sundararajan Srinivasan, Katrin Kirchhoff, Amazon, United States of America
Session:
Multimodal Signal Processing, Analysis, and Synthesis II
Track:
Multimedia Signal Processing
Location:
Gather Area O
Presentation Time:
Tue, 10 May, 22:00 - 22:45 China Time (UTC +8)
Tue, 10 May, 14:00 - 14:45 UTC
Tue, 10 May, 14:00 - 14:45 UTC
Session Chair:
Chaker Larabi, Universite de Poitiers
Session MMSP-5
MMSP-5.1: ENHANCING CONTRASTIVE LEARNING WITH TEMPORAL COGNIZANCE FOR AUDIO-VISUAL REPRESENTATION GENERATION
Chandrashekhar Lavania, Shiva Sundaram, Sundararajan Srinivasan, Katrin Kirchhoff, Amazon, United States of America
MMSP-5.2: CROSS-MODAL KNOWLEDGE DISTILLATION IN MULTI-MODAL FAKE NEWS DETECTION
Zimian Wei, Hengyue Pan, Linbo Qiao, Xin Niu, Peijie Dong, Dongsheng Li, College of Computer, National University of Defense Technology, China
MMSP-5.3: TRAINING STRATEGIES FOR AUTOMATIC SONG WRITING: A UNIFIED FRAMEWORK PERSPECTIVE
Tao Qian, Shuai Guo, Qin Jin, Renmin University of China, China; Jiatong Shi, Peter Wu, Carnegie Mellon University, United States of America
MMSP-5.4: RESIDUAL-GUIDED PERSONALIZED SPEECH SYNTHESIS BASED ON FACE IMAGE
Jianrong Wang, Zixuan Wang, Xiaosheng Hu, Xuewei Li, Tianjin University, China; Qiang Fang, Chinese Academy of Social Sciences, China; Li Liu, the Chinese University of Hong Kong, China
MMSP-5.5: Sketch storytelling
Yucheng Zhou, Fudan University, China
MMSP-5.6: MAG+: AN EXTENDED MULTIMODAL ADAPTATION GATE FOR MULTIMODAL SENTIMENT ANALYSIS
Xianbing Zhao, Yixin Chen, Wanting Li, Buzhou Tang, Harbin Institute of Technology (Shenzhen), China; Lei Gao, University College London, United Kingdom of Great Britain and Northern Ireland