My SLT 2018 Schedule

Note: Your custom schedule will not be saved unless you create a new account or login to an existing account.

Create a login based on your email (takes less than one minute)
Perform 'Paper Search'
Select papers that you desire to save in your personalized schedule
Click on 'My Schedule' to see the current list of selected papers
Click on 'Printable Version' to create a separate window suitable for printing (the header and menu will appear, but will not actually print)

Paper Detail

Presentation #	13
Session:	ASR IV
Session Time:	Friday, December 21, 13:30 - 15:30
Presentation Time:	Friday, December 21, 13:30 - 15:30
Presentation:	Poster
Topic:	Speech recognition and synthesis:
Paper Title:	SPEAKER SELECTIVE BEAMFORMER WITH KEYWORD MASK ESTIMATION
Authors:	Yusuke Kida; Yahoo Japan Corporation
	Dung Tran; Yahoo Japan Corporation
	Motoi Omachi; Yahoo Japan Corporation
	Toru Taniguchi; Yahoo Japan Corporation
	Yuya Fujita; Yahoo Japan Corporation
Abstract:	This paper addresses the problem of automatic speech recognition (ASR) of a target speaker in background speech. The novelty of our approach is that we focus on a wakeup keyword, which is usually used for activating ASR systems like smart speakers. The proposed method firstly utilizes a DNN-based mask estimator to separate the mixture signal into the keyword signal uttered by the target speaker and the remaining background speech. Then the separated signals are used for calculating a beamforming filter to enhance the subsequent utterances from the target speaker. Experimental evaluations show that the trained DNN-based mask can selectively separate the keyword and background speech from the mixture signal. The effectiveness of the proposed method is also verified with Japanese ASR experiments, and we confirm that the character error rates are significantly improved by the proposed method for both simulated and real recorded test sets.