Technical Program

Paper Detail

Presentation #3
Session:Corpora and Evaluation Methodologies
Location:Kallirhoe Hall
Session Time:Wednesday, December 19, 13:30 - 15:30
Presentation Time:Wednesday, December 19, 13:30 - 15:30
Presentation: Poster
Topic: Spoken language corpora:
Paper Title: JSpeech: A Multi-lingual Conversational Speech Corpus
Authors: Ali Janalizadeh Choobbasti, Mohammad Erfan Gholamian, Amirkabir University of Technology, Iran; Amir Vaheb, Miras Technologies International, Iran; Saeid Safavi, University of Surrey, Iran
Abstract: Speech processing, automatic speech and speaker recognition are the major area of interests in the field of computational linguistics. Research and development of computer and human interaction, forensic technologies and dialogue systems have been the motivating factor behind this interest. In this paper, JSpeech is introduced, a multi-lingual corpus. This corpus contains 1332 hours of conversational speech from 47 different languages. This corpus can be used in a variety of studies, created from 106 public chat group the effect of language variability on the performance of speaker recognition systems and automatic language detection. To this end, we include speaker verification results obtained for this corpus using a state of the art method based on 3D convolutional neural network.