Technical Program
MMSP-L2: Audiovisual and Cross-media Processing |
Session Type: Lecture |
Time: Thursday, March 9, 16:00 - 18:00 |
Location: Grand Salon 4 |
Session Chair: Gaurav Sharma, University of Rochester |
MMSP-L2.1: 3D AUDIO-VISUAL SPEAKER TRACKING WITH AN ADAPTIVE PARTICLE FILTER |
Xinyuan Qian; Queen Mary University of London |
Alessio Brutti; Fondazione Bruno Kessler |
Maurizio Omologo; Fondazione Bruno Kessler |
Andrea Cavallaro; Queen Mary University of London |
MMSP-L2.2: AUDIO-VISUAL OBJECT LOCALIZATION AND SEPARATION USING LOW-RANK AND SPARSITY |
Jie Pu; Imperial College London |
Yannis Panagakis; Imperial College London |
Stavros Petridis; Imperial College London |
Maja Pantic; Imperial College London / University of Twente |
MMSP-L2.3: SEE AND LISTEN: SCORE-INFORMED ASSOCIATION OF SOUND TRACKS TO PLAYERS IN CHAMBER MUSIC PERFORMANCE VIDEOS |
Bochen Li; University of Rochester |
Karthik Dinesh; University of Rochester |
Zhiyao Duan; University of Rochester |
Gaurav Sharma; University of Rochester |
MMSP-L2.4: RATE-COVERAGE ANALYSIS AND OPTIMIZATION FOR JOINT AUDIO-VIDEO MULTIMEDIA RETRIEVAL |
Guanghan Ning; University of Missouri-Columbia |
Zhi Zhang; University of Missouri-Columbia |
Xiaobo Ren; TCL Research America |
Haohong Wang; TCL Research America |
Zhihai He; University of Missouri-Columbia |
MMSP-L2.5: CROSS-MODAL TRANSFER WITH NEURAL WORD VECTORS FOR IMAGE FEATURE LEARNING |
Go Irie; NTT Corporation |
Taichi Asami; NTT Corporation |
Shuhei Tarashima; NTT Corporation |
Takayuki Kurozumi; NTT Corporation |
Tetsuya Kinebuchi; NTT Corporation |
MMSP-L2.6: CROSS-MODALITY MATCHING BASED ON FISHER VECTOR WITH NEURAL WORD EMBEDDINGS AND DEEP IMAGE FEATURES |
Liang Han; Peking University |
Wenmin Wang; Peking University |
Mengdi Fan; Peking University |
Ronggang Wang; Peking University |