IEEE ICASSP 2022

2022 IEEE International Conference on Acoustics, Speech and Signal Processing

7-13 May 2022
  • Virtual (all paper presentations)
22-27 May 2022
  • Main Venue: Marina Bay Sands Expo & Convention Center, Singapore
27-28 October 2022
  • Satellite Venue: Crowne Plaza Shenzhen Longgang City Centre, Shenzhen, China

ICASSP 2022
SPE-60.3

AUDIO-VISUAL SCENE-AWARE DIALOG AND REASONING USING AUDIO-VISUAL TRANSFORMERS WITH JOINT STUDENT-TEACHER LEARNING

Ankit Parag Shah, Carnegie Mellon University, United States of America; Shijie Geng, Rutgers University, United States of America; Gao Peng, Chinese University of Hong Kong, United States of America; Anoop Cherian, Takaaki Hori, Tim K. Marks, Jonathan Le Roux, Chiori Hori, Mitsubishi Electric Research Laboratories (MERL), United States of America

Session:
Multimodal Language Processing

Track:
Speech and Language Processing

Location:
Gather Area E

Presentation Time:
Wed, 11 May, 22:00 - 22:45 China Time (UTC +8)
Wed, 11 May, 14:00 - 14:45 UTC

Session Chair:
David Harwath, University of Texas, Austin
Presentation
Discussion
Resources