MLSP-54.3
NEURAL AUDIO-TO-SCORE MUSIC TRANSCRIPTION FOR UNCONSTRAINED POLYPHONY USING COMPACT OUTPUT REPRESENTATIONS
Víctor Arroyo, Jose J. Valero-Mas, Jorge Calvo-Zaragoza, Antonio Pertusa, University of Alicante, Spain
Session:
Neural Audio Representation and Synthesis
Track:
Machine Learning for Signal Processing
Location:
Gather Area H
Presentation Time:
Fri, 13 May, 23:00 - 23:45 China Time (UTC +8)
Fri, 13 May, 15:00 - 15:45 UTC
Fri, 13 May, 15:00 - 15:45 UTC
Session Chair:
Torbjørn Svendsen, Norwegian University of Science and Technology
Session MLSP-54
MLSP-54.1: TOWARDS LEARNING UNIVERSAL AUDIO REPRESENTATIONS
Luyu Wang, Pauline Luc, Yan Wu, Adria Recasens, Lucas Smaira, Andrew Brock, Andrew Jaegle, Jean-Baptiste Alayrac, Sander Dieleman, Joao Carreira, Aaron van den Oord, DeepMind, United Kingdom of Great Britain and Northern Ireland
MLSP-54.2: DIFFERENTIABLE WAVETABLE SYNTHESIS
Siyuan Shan, University of North Carolina at Chapel Hill, United States of America; Lamtharn Hantrakul, Jitong Chen, Matt Avent, David Trevelyan, ByteDance, Thailand
MLSP-54.3: NEURAL AUDIO-TO-SCORE MUSIC TRANSCRIPTION FOR UNCONSTRAINED POLYPHONY USING COMPACT OUTPUT REPRESENTATIONS
Víctor Arroyo, Jose J. Valero-Mas, Jorge Calvo-Zaragoza, Antonio Pertusa, University of Alicante, Spain
MLSP-54.4: END-TO-END MUSIC REMASTERING SYSTEM USING SELF-SUPERVISED AND ADVERSARIAL TRAINING
Junghyun Koo, Seungryeol Paik, Kyogu Lee, Seoul National University, Korea, Republic of
MLSP-54.5: AVQVC: One-shot Voice Conversion by Vector Quantization with Applying Contrastive Learning
Huaizhen Tang, University of Science and Technology of China, China; Xulong Zhang, Jianzong Wang, Ning Cheng, Jing Xiao, Ping An Technology (Shenzhen) Co., Ltd., China
MLSP-54.6: TOWARDS SPEAKER AGE ESTIMATION WITH LABEL DISTRIBUTION LEARNING
Shijing Si, Jianzong Wang, Junqing Peng, Jing Xiao, Ping An Technology (Shenzhen) Co., Ltd., China