Please use this identifier to cite or link to this item: http://hdl.handle.net/10397/95585
PIRA download icon_1.1View/Download Full Text
DC FieldValueLanguage
dc.contributorDepartment of Electronic and Information Engineeringen_US
dc.creatorLin, WWen_US
dc.creatorMak, MWen_US
dc.creatorChien, JTen_US
dc.date.accessioned2022-09-22T06:13:59Z-
dc.date.available2022-09-22T06:13:59Z-
dc.identifier.issn2329-9290en_US
dc.identifier.urihttp://hdl.handle.net/10397/95585-
dc.language.isoenen_US
dc.publisherInstitute of Electrical and Electronics Engineersen_US
dc.rights© 2018 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works.en_US
dc.rightsThe following publication W. Lin, M. Mak and J. Chien, "Multisource I-Vectors Domain Adaptation Using Maximum Mean Discrepancy Based Autoencoders," in IEEE/ACM Transactions on Audio, Speech, and Language Processing, vol. 26, no. 12, pp. 2412-2422, Dec. 2018 is available at https://doi.org/10.1109/TASLP.2018.2866707.en_US
dc.subjectDomain adaptationen_US
dc.subjectI-vectorsen_US
dc.subjectMaximum mean discrepancyen_US
dc.subjectSpeaker verificationen_US
dc.titleMultisource i-vectors domain adaptation using maximum mean discrepancy based autoencodersen_US
dc.typeJournal/Magazine Articleen_US
dc.description.otherinformationTitle in author file: Multi-source I-vectors Domain Adaptation using Maximum Mean Discrepancy Based Autoencodersen_US
dc.identifier.spage2412en_US
dc.identifier.epage2422en_US
dc.identifier.volume26en_US
dc.identifier.issue12en_US
dc.identifier.doi10.1109/TASLP.2018.2866707en_US
dcterms.abstractLike many machine learning tasks, the performance of speaker verification (SV) systems degrades when training and test data come from very different distributions. What's more, both training and test data themselves could be composed of heterogeneous subsets. These multisource mismatches are detrimental to SV performance. This paper proposes incorporating maximum mean discrepancy (MMD) into the loss function of autoencoders to reduce these mismatches. MMD is a nonparametric method for measuring the distance between two probability distributions. With a properly chosen kernel, MMD can match up to infinite moments of data distributions. We generalize MMD to measure the discrepancies of multiple distributions.We call the generalized MMDdomainwiseMMD. Using domainwiseMMDas an objective function, we propose two autoencoders, namely nuisance-attribute autoencoder (NAE) and domain-invariant autoencoder (DAE), for multisource i-vector adaptation. NAE encodes the features that cause most of the multisource mismatch measured by domainwise MMD. DAE directly encodes the features that minimize the multisource mismatch. Using these MMD-based autoencoders as a preprocessing step for PLDA training, we achieve a relative improvement of 19.2% EER on the NIST 2016 SRE compared to PLDA without adaptation. We also found that MMD-based autoencoders are more robust to unseen domains. In the domain robustness experiments, MMD-based autoencoders show 6.8% and 5.2% improvements over IDVC on female and male Cantonese speakers, respectively.en_US
dcterms.accessRightsopen accessen_US
dcterms.bibliographicCitationIEEE/ACM transactions on audio, speech, and language processing, Dec. 2018, v. 26, no. 12, 8445620, p. 2412-2422en_US
dcterms.isPartOfIEEE/ACM transactions on audio, speech, and language processingen_US
dcterms.issued2018-12-
dc.identifier.scopus2-s2.0-85052690121-
dc.identifier.eissn2329-9304en_US
dc.identifier.artn8445620en_US
dc.description.validate202209_bcwwen_US
dc.description.oaAccepted Manuscripten_US
dc.identifier.FolderNumberEIE-0437-
dc.description.fundingSourceRGCen_US
dc.description.pubStatusPublisheden_US
dc.identifier.OPUS20150384-
Appears in Collections:Journal/Magazine Article
Files in This Item:
File Description SizeFormat 
Lin_Multisource_I-Vectors_Domain.pdfPre-Published version1.28 MBAdobe PDFView/Open
Open Access Information
Status open access
File Version Final Accepted Manuscript
Access
View full-text via PolyU eLinks SFX Query
Show simple item record

Page views

55
Last Week
0
Last month
Citations as of Sep 22, 2024

Downloads

118
Citations as of Sep 22, 2024

SCOPUSTM   
Citations

49
Citations as of Sep 26, 2024

WEB OF SCIENCETM
Citations

42
Citations as of Sep 26, 2024

Google ScholarTM

Check

Altmetric


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.