Presentation # | 3 |
Session: | Corpora and Evaluation Methodologies |
Session Time: | Wednesday, December 19, 13:30 - 15:30 |
Presentation Time: | Wednesday, December 19, 13:30 - 15:30 |
Presentation: |
Poster
|
Topic: |
Spoken language corpora: |
Paper Title: |
JSpeech: A Multi-lingual Conversational Speech Corpus |
Authors: |
Ali Janalizadeh Choobbasti; Amirkabir University of Technology | | |
| Mohammad Erfan Gholamian; Amirkabir University of Technology | | |
| Amir Vaheb; Miras Technologies International | | |
| Saeid Safavi; University of Surrey | | |
Abstract: |
Speech processing, automatic speech and speaker recognition are the major area of interests in the field of computational linguistics. Research and development of computer and human interaction, forensic technologies and dialogue systems have been the motivating factor behind this interest. In this paper, JSpeech is introduced, a multi-lingual corpus. This corpus contains 1332 hours of conversational speech from 47 different languages. This corpus can be used in a variety of studies, created from 106 public chat group the effect of language variability on the performance of speaker recognition systems and automatic language detection. To this end, we include speaker verification results obtained for this corpus using a state of the art method based on 3D convolutional neural network. |