Technical Program

ASMSP_L1: Multimodal signal processing

Session Type: Lecture
Time: Tuesday, August 30, 11:20 - 13:00
Location: Baltic + Aegean
Session Chair: Shreya G. Upadhyay, National Tsing Hua University
 
ASMSP_L1.1: ZERO-SHOT AUDIO CLASSIFICATION USING IMAGE EMBEDDINGS
Duygu Dogan, Huang Xie, Toni Heittola, Tuomas Virtanen, Tampere University, Finland
 
ASMSP_L1.2: IMPROVING MULTIMODAL MOVIE SCENE SEGMENTATION USING MIXTURE OF ACOUSTIC EXPERTS
Meng-Han Lin, Jeng-Lin Li, Chi-Chun Lee, National Tsing Hua University, Taiwan
 
ASMSP_L1.3: ON THE INTEGRATION OF ACOUSTICS AND LIDAR: A MULTI-MODAL APPROACH TO ACOUSTIC REFLECTOR ESTIMATION
Ellen Riemens, Jorge Martinez, Richard C. Hendriks, TU Delft, Netherlands; Pablo Martinez-Nuevo, Martin Moller, Bang & Olufsen, Denmark
 
ASMSP_L1.4: IMPROVING INDUCED VALENCE RECOGNITION BY INTEGRATING ACOUSTIC SOUND SEMANTICS IN MOVIES
Shreya G. Upadhyay, Bo-Hao Su, Chi-Chun Lee, National Tsing Hua University, Taiwan
 
ASMSP_L1.5: BONE-CONDUCTED SPEECH ENHANCEMENT USING VECTOR-QUANTIZED VARIATIONAL AUTOENCODER AND GAMMACHIRP FILTERBANK CEPSTRAL COEFFICIENTS
Quoc-Huy Nguyen, Masashi Unoki, Japan Advanced Institute of Science and Technology, Japan