AUD-33: Topics in Deep Learning for Speech and Audio |
| Session Type: Poster |
| Time: Friday, 11 June, 14:00 - 14:45 |
| Location: Gather.Town |
| Virtual Session: View on Virtual Platform |
| Session Chair: Hirokazu Kameoka, Nippon Telegraph and Telephone Corporation |
| AUD-33.1: UNIDIRECTIONAL MEMORY-SELF-ATTENTION TRANSDUCER FOR ONLINE SPEECH RECOGNITION |
| Jian Luo; Ping An Technology (Shenzhen) Co., Ltd. |
| Jianzong Wang; Ping An Technology (Shenzhen) Co., Ltd. |
| Ning Cheng; Ping An Technology (Shenzhen) Co., Ltd. |
| Jing Xiao; Ping An Technology (Shenzhen) Co., Ltd. |
| AUD-33.2: ACCDOA: ACTIVITY-COUPLED CARTESIAN DIRECTION OF ARRIVAL REPRESENTATION FOR SOUND EVENT LOCALIZATION AND DETECTION |
| Kazuki Shimada; Sony Corporation |
| Yuichiro Koyama; Sony Corporation |
| Naoya Takahashi; Sony Corporation |
| Shusuke Takahashi; Sony Corporation |
| Yuki Mitsufuji; Sony Corporation |
| AUD-33.3: SEEN AND UNSEEN EMOTIONAL STYLE TRANSFER FOR VOICE CONVERSION WITH A NEW EMOTIONAL SPEECH DATASET |
| Kun Zhou; National University of Singapore |
| Berrak Sisman; Singapore University of Technology and Design |
| Rui Liu; Singapore University of Technology and Design |
| Haizhou Li; National University of Singapore |
| AUD-33.4: U-CONVOLUTION BASED RESIDUAL ECHO SUPPRESSION WITH MULTIPLE ENCODERS |
| Eesung Kim; Kakao Enterprise |
| Jae-Jin Jeon; Kakao Enterprise |
| Hyeji Seo; Kakao Enterprise |
| AUD-33.5: A MULTI-CHANNEL TEMPORAL ATTENTION CONVOLUTIONAL NEURAL NETWORK MODEL FOR ENVIRONMENTAL SOUND CLASSIFICATION |
| You Wang; Georgia Institute of Technology |
| Chuyao Feng; Georgia Institute of Technology |
| David Anderson; Georgia Institute of Technology |
| AUD-33.6: A GENERAL NETWORK ARCHITECTURE FOR SOUND EVENT LOCALIZATION AND DETECTION USING TRANSFER LEARNING AND RECURRENT NEURAL NETWORK |
| Thi Ngoc Tho Nguyen; Nanyang Technological University |
| Ngoc Khanh Nguyen; Motional |
| Huy Phan; Queen Mary University of London |
| Lam Pham; Austrian Institute of Technology |
| Kenneth Ooi; Nanyang Technological University |
| Douglas L. Jones; University of Illinois at Urbana-Champaign |
| Woon-Seng Gan; Nanyang Technological University |