Please use this identifier to cite or link to this item: http://hdl.handle.net/10397/106998
PIRA download icon_1.1View/Download Full Text
Title: The I4U mega fusion and collaboration for NIST speaker recognition evaluation 2016
Authors: Lee, KA 
Hautamäki, V
Kinnunen, T
Larcher, A
Zhang, C
Nautsch, A
Stafylakis, T
Liu, G
Rouvier, M
Rao, W
Alegre, F
Ma, J
Mak, MW 
Sarkar, AK
Delgado, H
Saeidi, R
Aronowitz, H
Sizov, A
Sun, H
Nguyen, TH
Wang, G
Ma, B
Vestman, V
Sahidullah, M
Halonen, M
Kanervisto, A
Le Lan, G
Bahmaninezhad, F
Isadskiy, S
Rathgeb, C
Busch, C
Tzimiropoulos, G
Qian, Q
Wang, Z
Zhao, Q
Wang, T
Li, H
Xue, J
Zhu, S
Jin, R
Zhao, T
Bousquet, PM
Ajili, M
Kheder, WB
Matrouf, D
Lim, ZH
Xu, C
Xu, H
Xiao, X
Chng, ES
Fauve, B
Sriskandaraja, K
Sethu, V
Lin, WW 
Thomsen, DAL
Tan, ZH
Todisco, M
Evans, N
Li, H
Hansen, JHL
Bonastre, JF
Ambikairajah, E
Issue Date: 2017
Source: Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, Stockholm, Sweden, 20-24 August 2017, p. 1328-1332
Abstract: The 2016 speaker recognition evaluation (SRE’16) is the latest edition in the series of benchmarking events conducted by the National Institute of Standards and Technology (NIST). I4U is a joint entry to SRE’16 as the result from the collaboration and active exchange of information among researchers from sixteen Institutes and Universities across 4 continents. The joint submission and several of its 32 sub-systems were among top-performing systems. A lot of efforts have been devoted to two major challenges, namely, unlabeled training data and dataset shift from Switchboard-Mixer to the new Call My Net dataset. This paper summarizes the lessons learned, presents our shared view from the sixteen research groups on recent advances, major paradigm shift, and common tool chain used in speaker recognition as we have witnessed in SRE’16. More importantly, we look into the intriguing question of fusing a large ensemble of sub-systems and the potential benefit of large-scale collaboration.
Publisher: International Speech Communication Association (ISCA)
ISBN: 978-1-5108-4876-4
DOI: 10.21437/Interspeech.2017-203
Description: 18th Annual Conference of the International Speech Communication Association, INTERSPEECH 2017, Stockholm, Sweden, 20-24 August 2017
Rights: Copyright © 2017 ISCA
The following publication Lee, K.A., Group, S.I. (2017) The I4U Mega Fusion and Collaboration for NIST Speaker Recognition Evaluation 2016. Proc. Interspeech 2017, 1328-1332 is available at https://doi.org/10.21437/Interspeech.2017-203.
Appears in Collections:Conference Paper

Files in This Item:
File Description SizeFormat 
lee17_interspeech.pdf257.51 kBAdobe PDFView/Open
Open Access Information
Status open access
File Version Version of Record
Access
View full-text via PolyU eLinks SFX Query
Show full item record

Page views

52
Citations as of Apr 14, 2025

Downloads

25
Citations as of Apr 14, 2025

SCOPUSTM   
Citations

13
Citations as of Sep 12, 2025

WEB OF SCIENCETM
Citations

7
Citations as of Oct 24, 2024

Google ScholarTM

Check

Altmetric


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.