ASMSP_L1: Multimodal signal processing |
Session Type: Lecture |
Time: Tuesday, August 30, 11:20 - 13:00 |
Location: Baltic + Aegean |
Session Chair: Shreya G. Upadhyay, National Tsing Hua University
|
|
ASMSP_L1.1: ZERO-SHOT AUDIO CLASSIFICATION USING IMAGE EMBEDDINGS |
Duygu Dogan, Huang Xie, Toni Heittola, Tuomas Virtanen, Tampere University, Finland |
|
ASMSP_L1.2: IMPROVING MULTIMODAL MOVIE SCENE SEGMENTATION USING MIXTURE OF ACOUSTIC EXPERTS |
Meng-Han Lin, Jeng-Lin Li, Chi-Chun Lee, National Tsing Hua University, Taiwan |
|
ASMSP_L1.3: ON THE INTEGRATION OF ACOUSTICS AND LIDAR: A MULTI-MODAL APPROACH TO ACOUSTIC REFLECTOR ESTIMATION |
Ellen Riemens, Jorge Martinez, Richard C. Hendriks, TU Delft, Netherlands; Pablo Martinez-Nuevo, Martin Moller, Bang & Olufsen, Denmark |
|
ASMSP_L1.4: IMPROVING INDUCED VALENCE RECOGNITION BY INTEGRATING ACOUSTIC SOUND SEMANTICS IN MOVIES |
Shreya G. Upadhyay, Bo-Hao Su, Chi-Chun Lee, National Tsing Hua University, Taiwan |
|
ASMSP_L1.5: BONE-CONDUCTED SPEECH ENHANCEMENT USING VECTOR-QUANTIZED VARIATIONAL AUTOENCODER AND GAMMACHIRP FILTERBANK CEPSTRAL COEFFICIENTS |
Quoc-Huy Nguyen, Masashi Unoki, Japan Advanced Institute of Science and Technology, Japan |
|