Please use this identifier to cite or link to this item: http://hdl.handle.net/10397/106995
PIRA download icon_1.1View/Download Full Text
DC FieldValueLanguage
dc.contributorDepartment of Electrical and Electronic Engineeringen_US
dc.creatorTan, Zen_US
dc.creatorMak, MWen_US
dc.date.accessioned2024-06-07T00:59:30Z-
dc.date.available2024-06-07T00:59:30Z-
dc.identifier.isbn978-1-5108-4876-4en_US
dc.identifier.urihttp://hdl.handle.net/10397/106995-
dc.description18th Annual Conference of the International Speech Communication Association, INTERSPEECH 2017, Stockholm, Sweden, 20-24 August 2017en_US
dc.language.isoenen_US
dc.publisherInternational Speech Communication Association (ISCA)en_US
dc.rightsCopyright © 2017 ISCAen_US
dc.rightsThe following publication Tan, Z., Mak, M.-W. (2017) i-Vector DNN Scoring and Calibration for Noise Robust Speaker Verification. Proc. Interspeech 2017, 1562-1566 is available at https://doi.org/10.21437/Interspeech.2017-656.en_US
dc.titleI-Vector DNN scoring and calibration for noise robust speaker verificationen_US
dc.typeConference Paperen_US
dc.identifier.spage1562en_US
dc.identifier.epage1566en_US
dc.identifier.doi10.21437/Interspeech.2017-656en_US
dcterms.abstractThis paper proposes applying multi-task learning to train deep neural networks (DNNs) for calibrating the PLDA scores of speaker verification systems under noisy environments. To facilitate the DNNs to learn the main task (calibration), several auxiliary tasks were introduced, including the prediction of SNR and duration from i-vectors and classifying whether an i-vector pair belongs to the same speaker or not. The possibility of replacing the PLDA model by a DNN during the scoring stage is also explored. Evaluations on noise contaminated speech suggest that the auxiliary tasks are important for the DNNs to learn the main calibration task and that the uncalibrated PLDA scores are an essential input to the DNNs. Without this input, the DNNs can only predict the score shifts accurately, suggesting that the PLDA model is indispensable.en_US
dcterms.accessRightsopen accessen_US
dcterms.bibliographicCitationProceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, Stockholm, Sweden, 20-24 August 2017, p. 1562-1566en_US
dcterms.issued2017-
dc.identifier.scopus2-s2.0-85039167490-
dc.relation.conferenceInternational Speech Communication Association [Interspeech]en_US
dc.description.validate202405 bcchen_US
dc.description.oaVersion of Recorden_US
dc.identifier.FolderNumberEIE-0773-
dc.description.fundingSourceRGCen_US
dc.description.pubStatusPublisheden_US
dc.identifier.OPUS6912605-
dc.description.oaCategoryVoR alloweden_US
Appears in Collections:Conference Paper
Files in This Item:
File Description SizeFormat 
tan17_interspeech.pdf1.25 MBAdobe PDFView/Open
Open Access Information
Status open access
File Version Version of Record
Access
View full-text via PolyU eLinks SFX Query
Show simple item record

Page views

83
Last Week
4
Last month
Citations as of Sep 21, 2025

Downloads

29
Citations as of Sep 21, 2025

SCOPUSTM   
Citations

3
Citations as of Sep 26, 2025

WEB OF SCIENCETM
Citations

2
Citations as of Sep 25, 2025

Google ScholarTM

Check

Altmetric


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.