2021 IEEE International Conference on Acoustics, Speech and Signal Processing

6-11 June 2021 • Toronto, Ontario, Canada

Extracting Knowledge from Information

2021 IEEE International Conference on Acoustics, Speech and Signal Processing

6-11 June 2021 • Toronto, Ontario, Canada

Extracting Knowledge from Information
Login Paper Search My Schedule Paper Index Help

My ICASSP 2021 Schedule

Note: Your custom schedule will not be saved unless you create a new account or login to an existing account.
  1. Create a login based on your email (takes less than one minute)
  2. Perform 'Paper Search'
  3. Select papers that you desire to save in your personalized schedule
  4. Click on 'My Schedule' to see the current list of selected papers
  5. Click on 'Printable Version' to create a separate window suitable for printing (the header and menu will appear, but will not actually print)
Click on the icon to view the manuscript on IEEE XPlore in the IEEE ICASSP 2021 Open Preview.

Clicking on the Add button next to a paper title will add that paper to your custom schedule.
Clicking on the Remove button next to a paper will remove that paper from your custom schedule.

AUD-33: Topics in Deep Learning for Speech and Audio

Session Type: Poster
Time: Friday, 11 June, 14:00 - 14:45
Location: Gather.Town
Session Chair: Hirokazu Kameoka, Nippon Telegraph and Telephone Corporation
 
   AUD-33.1: UNIDIRECTIONAL MEMORY-SELF-ATTENTION TRANSDUCER FOR ONLINE SPEECH RECOGNITION
         Jian Luo; Ping An Technology (Shenzhen) Co., Ltd.
         Jianzong Wang; Ping An Technology (Shenzhen) Co., Ltd.
         Ning Cheng; Ping An Technology (Shenzhen) Co., Ltd.
         Jing Xiao; Ping An Technology (Shenzhen) Co., Ltd.
 
   AUD-33.2: ACCDOA: ACTIVITY-COUPLED CARTESIAN DIRECTION OF ARRIVAL REPRESENTATION FOR SOUND EVENT LOCALIZATION AND DETECTION
         Kazuki Shimada; Sony Corporation
         Yuichiro Koyama; Sony Corporation
         Naoya Takahashi; Sony Corporation
         Shusuke Takahashi; Sony Corporation
         Yuki Mitsufuji; Sony Corporation
 
   AUD-33.3: SEEN AND UNSEEN EMOTIONAL STYLE TRANSFER FOR VOICE CONVERSION WITH A NEW EMOTIONAL SPEECH DATASET
         Kun Zhou; National University of Singapore
         Berrak Sisman; Singapore University of Technology and Design
         Rui Liu; Singapore University of Technology and Design
         Haizhou Li; National University of Singapore
 
   AUD-33.4: U-CONVOLUTION BASED RESIDUAL ECHO SUPPRESSION WITH MULTIPLE ENCODERS
         Eesung Kim; Kakao Enterprise
         Jae-Jin Jeon; Kakao Enterprise
         Hyeji Seo; Kakao Enterprise
 
   AUD-33.5: A MULTI-CHANNEL TEMPORAL ATTENTION CONVOLUTIONAL NEURAL NETWORK MODEL FOR ENVIRONMENTAL SOUND CLASSIFICATION
         You Wang; Georgia Institute of Technology
         Chuyao Feng; Georgia Institute of Technology
         David Anderson; Georgia Institute of Technology
 
   AUD-33.6: A GENERAL NETWORK ARCHITECTURE FOR SOUND EVENT LOCALIZATION AND DETECTION USING TRANSFER LEARNING AND RECURRENT NEURAL NETWORK
         Thi Ngoc Tho Nguyen; Nanyang Technological University
         Ngoc Khanh Nguyen; Motional
         Huy Phan; Queen Mary University of London
         Lam Pham; Austrian Institute of Technology
         Kenneth Ooi; Nanyang Technological University
         Douglas L. Jones; University of Illinois at Urbana-Champaign
         Woon-Seng Gan; Nanyang Technological University