Technical Program
AASP-P1: Deep Learning for Source Separation and Enhancement II |
| Session Type: Poster |
| Time: Monday, March 6, 13:30 - 15:30 |
| Location: Churchill: Poster Area H |
| Session Chair: Jonathan Le Roux, MERL |
| AASP-P1.1: PERMUTATION INVARIANT TRAINING OF DEEP MODELS FOR SPEAKER-INDEPENDENT MULTI-TALKER SPEECH SEPARATION |
| Dong Yu; Microsoft Research |
| Morten Kolbæk; Aalborg University |
| Zheng-Hua Tan; Aalborg University |
| Jesper Jensen; Aalborg University |
| AASP-P1.2: DEEP ATTRACTOR NETWORK FOR SINGLE-MICROPHONE SPEAKER SEPARATION |
| Zhuo Chen; Columbia University |
| Yi Luo; Columbia University |
| Nima Mesgarani; Columbia University |
| AASP-P1.3: DEEP MIXTURE DENSITY NETWORK FOR STATISTICAL MODEL-BASED FEATURE ENHANCEMENT |
| Keisuke Kinoshita; NTT Corporation |
| Marc Delcroix; NTT Corporation |
| Atsunori Ogawa; NTT Corporation |
| Takuya Higuchi; NTT Corporation |
| Tomohiro Nakatani; NTT Corporation |
| AASP-P1.4: IMPACT OF LOW-PRECISION DEEP REGRESSION NETWORKS ON SINGLE-CHANNEL SOURCE SEPARATION |
| Enea Ceolini; University Zürich, ETH Zürich |
| Shih-Chii Liu; University Zürich, ETH Zürich |
| AASP-P1.5: IMPROVING MUSIC SOURCE SEPARATION BASED ON DEEP NEURAL NETWORKS THROUGH DATA AUGMENTATION AND NETWORK BLENDING |
| Stefan Uhlich; Sony Europe Limited |
| Marcello Porcu; Sony Europe Limited |
| Franck Giron; Sony Europe Limited |
| Michael Enenkl; Sony Europe Limited |
| Thomas Kemp; Sony Europe Limited |
| Naoya Takahashi; Sony Corporation |
| Yuki Mitsufuji; Sony Corporation |
| AASP-P1.6: SUPERVISED SOURCE ENHANCEMENT COMPOSED OF NONNEGATIVE AUTO-ENCODERS AND COMPLEMENTARITY SUBTRACTION |
| Kenta Niwa; NTT Corporation |
| Yuma Koizumi; NTT Corporation |
| Tomoko Kawase; NTT Corporation |
| Kazunori Kobayashi; NTT Corporation |
| Yusuke Hioka; The University of Auckland |
| AASP-P1.7: DEEP LONG SHORT-TERM MEMORY ADAPTIVE BEAMFORMING NETWORKS FOR MULTICHANNEL ROBUST SPEECH RECOGNITION |
| Zhong Meng; Georgia Institute of Technology |
| Shinji Watanabe; Mitsubishi Electric Research Laboratories |
| John R. Hershey; Mitsubishi Electric Research Laboratories |
| Hakan Erdogan; Microsoft Research |
| AASP-P1.8: A SPEECH ENHANCEMENT ALGORITHM BY ITERATING SINGLE- AND MULTI-MICROPHONE PROCESSING AND ITS APPLICATION TO ROBUST ASR |
| Xueliang Zhang; Inner Mongolia University |
| Zhong-Qiu Wang; The Ohio State University |
| DeLiang Wang; The Ohio State University |
| AASP-P1.9: FULLY COMPLEX DEEP NEURAL NETWORK FOR PHASE-INCORPORATING MONAURAL SOURCE SEPARATION |
| Yuan-Shan Lee; National Central University |
| Chien-Yao Wang; National Central University |
| Shu-Fan Wang; National Central University |
| Jia-Ching Wang; National Central University |
| Chung-Hsien Wu; National Cheng Kung University |
| AASP-P1.10: INTEGRATING DNN-BASED AND SPATIAL CLUSTERING-BASED MASK ESTIMATION FOR ROBUST MVDR BEAMFORMING |
| Tomohiro Nakatani; NTT Corporation |
| Nobutaka Ito; NTT Corporation |
| Takuya Higuchi; NTT Corporation |
| Shoko Araki; NTT Corporation |
| Keisuke Kinoshita; NTT Corporation |