PT3: Speaker and Language Recognition |
| Session Type: Poster |
| Poster Time: Tuesday, December 9, 16:00 - 17:30 |
| Location: Emerald 4-6 |
| Session Chair: Sachin Kajarekar, Apple
|
| |
| PT3.101: SPEAKER-INDEPENDENT DETECTION OF CHILD-DIRECTED SPEECH |
| Sebastian Schuster; Stanford University, United States |
| Stephanie Pancoast; Stanford University, United States |
| Milind Ganjoo; Stanford University, United States |
| Michael C. Frank; Stanford University, United States |
| Dan Jurafsky; Stanford University, United States |
| |
| PT3.102: SPOKEN LANGUAGE MISMATCH IN SPEAKER VERIFICATION: AN INVESTIGATION WITH NIST-SRE AND CRSS BI-LING CORPORA |
| Abhinav Misra; University of Texas at Dallas, United States |
| John H. L. Hansen; University of Texas at Dallas, United States |
| |
| PT3.103: IMPROVING SPEAKER RECOGNITION PERFORMANCE IN THE DOMAIN ADAPTATION CHALLENGE USING DEEP NEURAL NETWORKS |
| Daniel Garcia-Romero; Johns Hopkins University, United States |
| Xiaohui Zhang; Johns Hopkins University, United States |
| Alan McCree; Johns Hopkins University, United States |
| Daniel Povey; Johns Hopkins University, United States |
| |
| PT3.104: TRAINING CANDIDATE SELECTION FOR EFFECTIVE REJECTION IN OPEN-SET LANGUAGE IDENTIFICATION |
| Qian Zhang; University of Texas at Dallas, United States |
| John H. L. Hansen; University of Texas at Dallas, United States |
| |
| PT3.105: CONSTRAINED SPEAKER DIARIZATION OF TV SERIES BASED ON VISUAL PATTERNS |
| Xavier Bost; University of Avignon, France |
| Georges Linarès; University of Avignon, France |
| |
| PT3.106: EXPLOITING MAGNITUDE AND PHASE SPECTRAL INFORMATION FOR CONVERTED SPEECH DETECTION |
| Maria Joana Correia; INESC-ID/Spoken Language Systems Laboratory, Portugal |
| Alberto Abad; INESC-ID/Spoken Language Systems Laboratory, Portugal |
| Isabel Trancoso; INESC-ID/Spoken Language Systems Laboratory, Portugal |
| |
| PT3.107: ARTIFICIAL NEURAL NETWORK FEATURES FOR SPEAKER DIARIZATION |
| Sree Harsha Yella; Idiap Research Institute, Switzerland |
| Andreas Stolcke; Microsoft Research, United States |
| Malcolm Slaney; Microsoft Research, United States |
| |
| PT3.108: DISCRIMINATION BETWEEN SINGING AND SPEECH IN REAL-WORLD AUDIO |
| Brian Thompson; MIT Lincoln Laboratory, United States |
| |
| PT3.109: SPEAKER DIARIZATION WITH PLDA I-VECTOR SCORING AND UNSUPERVISED CALIBRATION |
| Gregory Sell; Johns Hopkins University, United States |
| Daniel Garcia-Romero; Johns Hopkins University, United States |
| |
| PT3.110: UTILIZATION OF UNLABELED DEVELOPMENT DATA FOR SPEAKER VERIFICATION |
| Gang Liu; University of Texas at Dallas, United States |
| Chengzhu Yu; University of Texas at Dallas, United States |
| Navid Shokouhi; University of Texas at Dallas, United States |
| Abhinav Misra; University of Texas at Dallas, United States |
| Hua Xing; University of Texas at Dallas, United States |
| John H. L. Hansen; University of Texas at Dallas, United States |
| |