Technical Program
AASP-P1: Deep Learning for Source Separation and Enhancement II |
Session Type: Poster |
Time: Monday, March 6, 13:30 - 15:30 |
Location: Churchill: Poster Area H |
Session Chair: Jonathan Le Roux, MERL |
AASP-P1.1: PERMUTATION INVARIANT TRAINING OF DEEP MODELS FOR SPEAKER-INDEPENDENT MULTI-TALKER SPEECH SEPARATION |
Dong Yu; Microsoft Research |
Morten Kolbæk; Aalborg University |
Zheng-Hua Tan; Aalborg University |
Jesper Jensen; Aalborg University |
AASP-P1.2: DEEP ATTRACTOR NETWORK FOR SINGLE-MICROPHONE SPEAKER SEPARATION |
Zhuo Chen; Columbia University |
Yi Luo; Columbia University |
Nima Mesgarani; Columbia University |
AASP-P1.3: DEEP MIXTURE DENSITY NETWORK FOR STATISTICAL MODEL-BASED FEATURE ENHANCEMENT |
Keisuke Kinoshita; NTT Corporation |
Marc Delcroix; NTT Corporation |
Atsunori Ogawa; NTT Corporation |
Takuya Higuchi; NTT Corporation |
Tomohiro Nakatani; NTT Corporation |
AASP-P1.4: IMPACT OF LOW-PRECISION DEEP REGRESSION NETWORKS ON SINGLE-CHANNEL SOURCE SEPARATION |
Enea Ceolini; University Zürich, ETH Zürich |
Shih-Chii Liu; University Zürich, ETH Zürich |
AASP-P1.5: IMPROVING MUSIC SOURCE SEPARATION BASED ON DEEP NEURAL NETWORKS THROUGH DATA AUGMENTATION AND NETWORK BLENDING |
Stefan Uhlich; Sony Europe Limited |
Marcello Porcu; Sony Europe Limited |
Franck Giron; Sony Europe Limited |
Michael Enenkl; Sony Europe Limited |
Thomas Kemp; Sony Europe Limited |
Naoya Takahashi; Sony Corporation |
Yuki Mitsufuji; Sony Corporation |
AASP-P1.6: SUPERVISED SOURCE ENHANCEMENT COMPOSED OF NONNEGATIVE AUTO-ENCODERS AND COMPLEMENTARITY SUBTRACTION |
Kenta Niwa; NTT Corporation |
Yuma Koizumi; NTT Corporation |
Tomoko Kawase; NTT Corporation |
Kazunori Kobayashi; NTT Corporation |
Yusuke Hioka; The University of Auckland |
AASP-P1.7: DEEP LONG SHORT-TERM MEMORY ADAPTIVE BEAMFORMING NETWORKS FOR MULTICHANNEL ROBUST SPEECH RECOGNITION |
Zhong Meng; Georgia Institute of Technology |
Shinji Watanabe; Mitsubishi Electric Research Laboratories |
John R. Hershey; Mitsubishi Electric Research Laboratories |
Hakan Erdogan; Microsoft Research |
AASP-P1.8: A SPEECH ENHANCEMENT ALGORITHM BY ITERATING SINGLE- AND MULTI-MICROPHONE PROCESSING AND ITS APPLICATION TO ROBUST ASR |
Xueliang Zhang; Inner Mongolia University |
Zhong-Qiu Wang; The Ohio State University |
DeLiang Wang; The Ohio State University |
AASP-P1.9: FULLY COMPLEX DEEP NEURAL NETWORK FOR PHASE-INCORPORATING MONAURAL SOURCE SEPARATION |
Yuan-Shan Lee; National Central University |
Chien-Yao Wang; National Central University |
Shu-Fan Wang; National Central University |
Jia-Ching Wang; National Central University |
Chung-Hsien Wu; National Cheng Kung University |
AASP-P1.10: INTEGRATING DNN-BASED AND SPATIAL CLUSTERING-BASED MASK ESTIMATION FOR ROBUST MVDR BEAMFORMING |
Tomohiro Nakatani; NTT Corporation |
Nobutaka Ito; NTT Corporation |
Takuya Higuchi; NTT Corporation |
Shoko Araki; NTT Corporation |
Keisuke Kinoshita; NTT Corporation |