Technical Program
MMSP-L2: Audiovisual and Cross-media Processing |
| Session Type: Lecture |
| Time: Thursday, March 9, 16:00 - 18:00 |
| Location: Grand Salon 4 |
| Session Chair: Gaurav Sharma, University of Rochester |
| MMSP-L2.1: 3D AUDIO-VISUAL SPEAKER TRACKING WITH AN ADAPTIVE PARTICLE FILTER |
| Xinyuan Qian; Queen Mary University of London |
| Alessio Brutti; Fondazione Bruno Kessler |
| Maurizio Omologo; Fondazione Bruno Kessler |
| Andrea Cavallaro; Queen Mary University of London |
| MMSP-L2.2: AUDIO-VISUAL OBJECT LOCALIZATION AND SEPARATION USING LOW-RANK AND SPARSITY |
| Jie Pu; Imperial College London |
| Yannis Panagakis; Imperial College London |
| Stavros Petridis; Imperial College London |
| Maja Pantic; Imperial College London / University of Twente |
| MMSP-L2.3: SEE AND LISTEN: SCORE-INFORMED ASSOCIATION OF SOUND TRACKS TO PLAYERS IN CHAMBER MUSIC PERFORMANCE VIDEOS |
| Bochen Li; University of Rochester |
| Karthik Dinesh; University of Rochester |
| Zhiyao Duan; University of Rochester |
| Gaurav Sharma; University of Rochester |
| MMSP-L2.4: RATE-COVERAGE ANALYSIS AND OPTIMIZATION FOR JOINT AUDIO-VIDEO MULTIMEDIA RETRIEVAL |
| Guanghan Ning; University of Missouri-Columbia |
| Zhi Zhang; University of Missouri-Columbia |
| Xiaobo Ren; TCL Research America |
| Haohong Wang; TCL Research America |
| Zhihai He; University of Missouri-Columbia |
| MMSP-L2.5: CROSS-MODAL TRANSFER WITH NEURAL WORD VECTORS FOR IMAGE FEATURE LEARNING |
| Go Irie; NTT Corporation |
| Taichi Asami; NTT Corporation |
| Shuhei Tarashima; NTT Corporation |
| Takayuki Kurozumi; NTT Corporation |
| Tetsuya Kinebuchi; NTT Corporation |
| MMSP-L2.6: CROSS-MODALITY MATCHING BASED ON FISHER VECTOR WITH NEURAL WORD EMBEDDINGS AND DEEP IMAGE FEATURES |
| Liang Han; Peking University |
| Wenmin Wang; Peking University |
| Mengdi Fan; Peking University |
| Ronggang Wang; Peking University |