IEEE ICASSP 2022

2022 IEEE International Conference on Acoustics, Speech and Signal Processing

7-13 May 2022
  • Virtual (all paper presentations)
22-27 May 2022
  • Main Venue: Marina Bay Sands Expo & Convention Center, Singapore
27-28 October 2022
  • Satellite Venue: Crowne Plaza Shenzhen Longgang City Centre, Shenzhen, China

ICASSP 2022
SPE-L2: Voice Conversion, Speech Enhancement and Analysis
Tue, 24 May, 16:00 - 18:00 China Time (UTC +8)
Tue, 24 May, 08:00 - 10:00 UTC
Location: Simpor Junior Ballroom 4811-3
Session Co-Chairs: Berrak SISMAN, Singapore University of Technology and Design and Marc-André Carbonneau, Ubisoft
Track: Speech and Language Processing

SPE-L2.1: NVC-NET: END-TO-END ADVERSARIAL VOICE CONVERSION

Bac Nguyen, Fabien Cardinaux, Sony Europe B.V., Germany

SPE-L2.2: TOWARDS IDENTITY PRESERVING NORMAL TO DYSARTHRIC VOICE CONVERSION

Wen-Chin Huang, Lester Phillip Violeta, Tomoki Toda, Nagoya University, Japan; Bence Mark Halpern, Odette Scharenborg, Delft University of Technology, Netherlands

SPE-L2.3: A COMPARISON OF DISCRETE AND SOFT SPEECH UNITS FOR IMPROVED VOICE CONVERSION

Benjamin van Niekerk, Matthew Baas, Herman Kamper, Stellenbosch University, South Africa; Marc-André Carbonneau, Julian Zaïdi, Hugo Seuté, Ubisoft, Canada

SPE-L2.4: THE IMPACT OF REMOVING HEAD MOVEMENTS ON AUDIO-VISUAL SPEECH ENHANCEMENT

Zhiqi Kang, Radu Horaud, Xavier Alameda-Pineda, Inria Grenoble Rhône-Alpes & Univ. Grenoble Alpes, France, France; Mostafa Sadeghi, Inria Nancy Grand-Est, France, France; Jacob Donley, Anurag Kumar, Facebook Reality Labs Research, Redmond WA, USA, United States of America

SPE-L2.5: PHASE CONTINUITY: LEARNING DERIVATIVES OF PHASE SPECTRUM FOR SPEECH ENHANCEMENT

Doyeon Kim, Hyewon Han, Hong-Goo Kang, Yonsei University, Korea, Republic of; Hyeon-Kyeong Shin, Soo-Whan Chung, Naver Corporation, Korea, Republic of

SPE-L2.6: L-SpEx: Localized Target Speaker Extraction

Meng Ge, Longbiao Wang, Jianwu Dang, Tianjin University, China; Chenglin Xu, Kuaishou Technology, China; Eng Siong Chng, Nanyang Technological University, Singapore; Haizhou Li, National University of Singapore, Singapore

SPE-L2.8: EXPLORING DEMENTIA DETECTION FROM SPEECH: CROSS CORPUS ANALYSIS

Ayimnisagul Ablimit, Tanja Schultz, University of Bremen, Germany; Catarina Botelho, Alberto Abad, Isabel Trancoso, INESC-ID/Instituto Superior Técnico, Portugal

SPE-L2.9: VOICE FILTER: FEW-SHOT TEXT-TO-SPEECH SPEAKER ADAPTATION USING VOICE CONVERSION AS A POST-PROCESSING MODULE

Adam Gabryś, Goeric Huybrechts, Manuel Sam Ribeiro, Julian Roth, Giulia Comini, Roberto Barra-Chicote, Bartek Perz, Jaime Lorenzo-Trueba, Amazon, Poland; Chung-Ming Chien, National Taiwan University (NTU), United Kingdom of Great Britain and Northern Ireland