MLSP-43.6
FilterAugment: An Acoustic Environmental Data Augmentation Method
Hyeonuk Nam, Seong-Hu Kim, Yong-Hwa Park, KAIST, Korea, Republic of
Session:
Deep Learning for Speech and Language Processing
Track:
Machine Learning for Signal Processing
Location:
Gather Area F
Presentation Time:
Thu, 12 May, 21:00 - 21:45 China Time (UTC +8)
Thu, 12 May, 13:00 - 13:45 UTC
Thu, 12 May, 13:00 - 13:45 UTC
Session Chair:
Isabel Trancoso, Instituto Superior Técnico
Session MLSP-43
MLSP-43.1: INTEGER-ONLY ZERO-SHOT QUANTIZATION FOR EFFICIENT SPEECH RECOGNITION
Sehoon Kim, Amir Gholami, Zhewei Yao, Nicholas Lee, Patrick Wang, Aniruddha Nrusimha, Bohan Zhai, Tianren Gao, Michael Mahoney, Kurt Keutzer, University of California, Berkeley, United States of America
MLSP-43.2: ONE-CLASS LEARNING TOWARDS SYNTHETIC VOICE SPOOFING DETECTION
You Zhang, Fei Jiang, Zhiyao Duan, University of Rochester, United States of America
MLSP-43.3: nnSpeech: Speaker-Guided Conditional Variational Autoencoder for Zero-shot Multi-speaker Text-to-Speech
Botao Zhao, Fudan University, China; Xulong Zhang, Jianzong Wang, Ning Cheng, Jing Xiao, Ping An Technology (Shenzhen) Co., Ltd., China
MLSP-43.4: NOISE-ROBUST SPEECH RECOGNITION WITH 10 MINUTES UNPARALLELED IN-DOMAIN DATA
Chen Chen, Nana Hou, Yuchen Hu, Eng Siong Chng, Nanyang Technological University, Singapore; Shashank Shirol, Manipal Institute of Technology, India
MLSP-43.5: ENHANCING CLASS UNDERSTANDING VIA PROMPT-TUNING FOR ZERO-SHOT TEXT CLASSIFICATION
Yuhao Dan, Qin Chen, Liang He, East China Normal University, China; Jie Zhou, Fudan Univerisity, China; Qingchun Bai, Shanghai Open University, China
MLSP-43.6: FilterAugment: An Acoustic Environmental Data Augmentation Method
Hyeonuk Nam, Seong-Hu Kim, Yong-Hwa Park, KAIST, Korea, Republic of