IEEE ICASSP 2022

2022 IEEE International Conference on Acoustics, Speech and Signal Processing

7-13 May 2022
  • Virtual (all paper presentations)
22-27 May 2022
  • Main Venue: Marina Bay Sands Expo & Convention Center, Singapore
27-28 October 2022
  • Satellite Venue: Crowne Plaza Shenzhen Longgang City Centre, Shenzhen, China

ICASSP 2022
AUD-L4: Music and Audio Processing II
Thu, 26 May, 13:00 - 15:00 China Time (UTC +8)
Thu, 26 May, 05:00 - 07:00 UTC
Location: Roselle Junior Ballroom 4711-3
Session Co-Chairs: Paul Chan, Institute for Infocomm Research, A*STAR and Ye Wang, National University of Singapore
Track: Audio and Acoustic Signal Processing

AUD-L4.1: EXPLORING TRANSFORMER’S POTENTIAL ON AUTOMATIC PIANO TRANSCRIPTION

Longshen Ou, Ziyi Guo, Ye Wang, National University of Singapore, Singapore; Emmanouil Benetos, Queen Mary University of London, United Kingdom of Great Britain and Northern Ireland; Jiqing Han, Harbin Institute of Technology, China

AUD-L4.2: MODELING BEATS AND DOWNBEATS WITH A TIME-FREQUENCY TRANSFORMER

Yun-Ning Hung, Georgia Institute of Technology, United States of America; Ju-Chiang Wang, Xuchen Song, Wei-Tsung Lu, Minz Won, ByteDance, United States of America

AUD-L4.3: DIFFERENTIABLE DIGITAL SIGNAL PROCESSING MIXTURE MODEL FOR SYNTHESIS PARAMETER EXTRACTION FROM MIXTURE OF HARMONIC SOUNDS

Masaya Kawamura, Tomohiko Nakamura, Hiroshi Saruwatari, The University of Tokyo, Japan; Daichi Kitamura, National Institute of Technology, Kagawa College, Japan; Yu Takahashi, Kazunobu Kondo, Yamaha Corporation, Japan

AUD-L4.4: MULTICHANNEL NOISE REDUCTION USING DILATED MULTICHANNEL U-NET AND PRE-TRAINED SINGLE-CHANNEL NETWORK

Zhi-Wei Tan, Yuan Liu, Andy W. H. Khong, Nanyang Technological University, Singapore; Anh H. T. Nguyen, FPT Software, Viet Nam

AUD-L4.5: TONET: TONE-OCTAVE NETWORK FOR SINGING MELODY EXTRACTION FROM POLYPHONIC MUSIC

Ke Chen, Taylor Berg-Kirkpatrick, Shlomo Dubnov, University of California San Diego, United States of America; Shuai Yu, Wei Li, Fudan University, China; Cheng-i Wang, Smule Inc., United States of America

AUD-L4.6: GENRE-CONDITIONED ACOUSTIC MODELS FOR AUTOMATIC LYRICS TRANSCRIPTION OF POLYPHONIC MUSIC

Xiaoxue Gao, Chitralekha Gupta, Haizhou Li, National University of Singapore, Singapore

AUD-L4.7: ADVERSARIAL AUDIO SYNTHESIS USING A HARMONIC-PERCUSSIVE DISCRIMINATOR

Jihyun Lee, Hyungseob Lim, Chanwoo Lee, Hong-Goo Kang, Yonsei University, Korea, Republic of; Inseon Jang, Electronics and Telecommunications Research Institution, Korea, Republic of

AUD-L4.8: TIME-DOMAIN AUDIO SOURCE SEPARATION WITH NEURAL NETWORKS BASED ON MULTIRESOLUTION ANALYSIS

Tomohiko Nakamura, Shihori Kozuka, Hiroshi Saruwatari, University of Tokyo, Japan