Technical Program

SP-L2: End to End Speech Processing

Session Type: Lecture
Time: Monday, March 6, 16:00 - 18:00
Location: Grand Ballroom
Session Chairs: Bhuvana Ramabhadran, IBM T.J. Watson Research and Jinyu Li, MSFT
 
SP-L2.1: JOINT CTC-ATTENTION BASED END-TO-END SPEECH RECOGNITION USING MULTI-TASK LEARNING
         Suyoun Kim; Carnegie Mellon University
         Takaaki Hori; Mitsubishi Electric Research Laboratories
         Shinji Watanabe; Mitsubishi Electric Research Laboratories
 
SP-L2.2: END-TO-END ASR-FREE KEYWORD SEARCH FROM SPEECH
         Kartik Audhkhasi; IBM
         Andrew Rosenberg; IBM
         Abhinav Sethy; IBM
         Bhuvana Ramabhadran; IBM
         Brian Kingsbury; IBM
 
SP-L2.3: VERY DEEP CONVOLUTIONAL NETWORKS FOR END-TO-END SPEECH RECOGNITION
         Yu Zhang; Massachusetts Institute of Technology
         William Chan; Carnegie Mellon University
         Navdeep Jaitly; Google Brain
 
SP-L2.4: CONFIDENCE MEASURES FOR CTC-BASED PHONE SYNCHRONOUS DECODING
         Zhehuai Chen; Shanghai Jiao Tong University
         Yimeng Zhuang; Shanghai Jiao Tong University
         Kai Yu; Shanghai Jiao Tong University
 
SP-L2.5: MINIMUM BAYES RISK TRAINING OF CTC ACOUSTIC MODELS IN MAXIMUM A POSTERIORI BASED DECODING FRAMEWORK
         Naoyuki Kanda; National Institute of Information and Communications Technology
         Xugang Lu; National Institute of Information and Communications Technology
         Hisashi Kawai; National Institute of Information and Communications Technology
 
SP-L2.6: END-TO-END SPOOFING DETECTION WITH RAW WAVEFORM CLDNNS
         Heinrich Dinkel; Shanghai Jiao Tong University
         Nanxin Chen; Shanghai Jiao Tong University
         Yanmin Qian; Shanghai Jiao Tong University
         Kai Yu; Shanghai Jiao Tong University