IEEE ICASSP 2022

2022 IEEE International Conference on Acoustics, Speech and Signal Processing

7-13 May 2022

Virtual (all paper presentations)

22-27 May 2022

Main Venue: Marina Bay Sands Expo & Convention Center, Singapore

27-28 October 2022

Satellite Venue: Crowne Plaza Shenzhen Longgang City Centre, Shenzhen, China

ICASSP 2022

MLSP-54.3

NEURAL AUDIO-TO-SCORE MUSIC TRANSCRIPTION FOR UNCONSTRAINED POLYPHONY USING COMPACT OUTPUT REPRESENTATIONS

Víctor Arroyo, Jose J. Valero-Mas, Jorge Calvo-Zaragoza, Antonio Pertusa, University of Alicante, Spain

Session:

Neural Audio Representation and Synthesis

Location:

Gather Area H

Presentation Time:

Fri, 13 May, 23:00 - 23:45 China Time (UTC +8)
Fri, 13 May, 15:00 - 15:45 UTC

Session Chair:

Torbjørn Svendsen, Norwegian University of Science and Technology

Resources

View Manuscript

Session MLSP-54

MLSP-54.1: TOWARDS LEARNING UNIVERSAL AUDIO REPRESENTATIONS

Luyu Wang, Pauline Luc, Yan Wu, Adria Recasens, Lucas Smaira, Andrew Brock, Andrew Jaegle, Jean-Baptiste Alayrac, Sander Dieleman, Joao Carreira, Aaron van den Oord, DeepMind, United Kingdom of Great Britain and Northern Ireland

MLSP-54.2: DIFFERENTIABLE WAVETABLE SYNTHESIS

Siyuan Shan, University of North Carolina at Chapel Hill, United States of America; Lamtharn Hantrakul, Jitong Chen, Matt Avent, David Trevelyan, ByteDance, Thailand

MLSP-54.3: NEURAL AUDIO-TO-SCORE MUSIC TRANSCRIPTION FOR UNCONSTRAINED POLYPHONY USING COMPACT OUTPUT REPRESENTATIONS

Víctor Arroyo, Jose J. Valero-Mas, Jorge Calvo-Zaragoza, Antonio Pertusa, University of Alicante, Spain

MLSP-54.4: END-TO-END MUSIC REMASTERING SYSTEM USING SELF-SUPERVISED AND ADVERSARIAL TRAINING

Junghyun Koo, Seungryeol Paik, Kyogu Lee, Seoul National University, Korea, Republic of

MLSP-54.5: AVQVC: One-shot Voice Conversion by Vector Quantization with Applying Contrastive Learning

Huaizhen Tang, University of Science and Technology of China, China; Xulong Zhang, Jianzong Wang, Ning Cheng, Jing Xiao, Ping An Technology (Shenzhen) Co., Ltd., China

MLSP-54.6: TOWARDS SPEAKER AGE ESTIMATION WITH LABEL DISTRIBUTION LEARNING

Shijing Si, Jianzong Wang, Junqing Peng, Jing Xiao, Ping An Technology (Shenzhen) Co., Ltd., China

Contact | Accessibility | Nondiscrimination Policy | IEEE Ethics Reporting | IEEE Privacy Policy | Terms | Signal Processing Society

©2026 IEEE – All rights reserved.

Last updated Last updated 21 May 2022.

Use of this website signifies your agreement to the IEEE Terms and Conditions.

Support: webmaster@2022.ieeeicassp.org Host: https://cmsworldwide.com/