Please use this identifier to cite or link to this item: http://hdl.handle.net/10397/107207
PIRA download icon_1.1View/Download Full Text
DC FieldValueLanguage
dc.contributorDepartment of Electrical and Electronic Engineeringen_US
dc.creatorLi, Len_US
dc.creatorMak, MWen_US
dc.date.accessioned2024-06-13T01:04:35Z-
dc.date.available2024-06-13T01:04:35Z-
dc.identifier.isbn978-1-5386-4658-8 (Electronic)en_US
dc.identifier.isbn978-1-5386-4659-5 (Print on Demand(PoD))en_US
dc.identifier.urihttp://hdl.handle.net/10397/107207-
dc.description2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 15-20 April 2018, Calgary, AB, Canadaen_US
dc.language.isoenen_US
dc.publisherInstitute of Electrical and Electronics Engineersen_US
dc.rights©2018 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works.en_US
dc.rightsThe following publication L. Li and M. -W. Mak, "Unsupervised Domain Adaptation for Gender-Aware PLDA Mixture Models," 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Calgary, AB, Canada, 2018, pp. 5269-5273 is available at https://doi.org/10.1109/ICASSP.2018.8461943.en_US
dc.subjectDNN-driven mixture of PLDAen_US
dc.subjectDomain adaptationen_US
dc.subjectI-vectorsen_US
dc.subjectSpeaker verificationen_US
dc.subjectSpectral clusteringen_US
dc.titleUnsupervised domain adaptation for gender-aware PLDA mixture modelsen_US
dc.typeConference Paperen_US
dc.identifier.spage5269en_US
dc.identifier.epage5273en_US
dc.identifier.doi10.1109/ICASSP.2018.8461943en_US
dcterms.abstractProbabilistic linear discriminant analysis (PLDA) is a state-of-art back-end for i-vector based speaker verification. However, this backend is still problematic when (1) the model is deployed to new environment (in-domain) that is very different from the training one (out-of-domain) and (2) there are insufficient labeled data from the new environment. To address these problems, this paper proposes using out-of-domain training data to pre-train a PLDA mixture model and applying the mixture model on the in-domain training data to compute a pairwise score matrix for spectral clustering. The hypothesized speaker labels produced by spectral clustering are then used for re-training the mixture model to fit the new environment. To refine the mixture model, the spectral clustering and re-training processes are repeated a number of times. To make the mixture model amenable to both genders, a deep neural network (DNN) is trained to produce gender posteriors given an i-vector. The gender posteriors then replace the posterior probabilities of the indicator variables in the PLDA mixture model. Evaluations based on NIST 2016 SRE suggest that at the end of the iterative re-training, the PLDA mixture model becomes fully adapted to the new domain. Results also show that the PLDA scores can be readily incorporated into spectral clustering, resulting in high quality speaker clusters that could not be possibly achieved by agglomerative hierarchical clustering.en_US
dcterms.accessRightsopen accessen_US
dcterms.bibliographicCitationIn Proceedings of 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 15-20 April 2018, Calgary, AB, Canada, p. 5269-5273en_US
dcterms.issued2018-
dc.identifier.scopus2-s2.0-85054288121-
dc.relation.conferenceInternational Conference on Acoustics, Speech, and Signal Processing [ICASSP]en_US
dc.description.validate202404 bckwen_US
dc.description.oaAccepted Manuscripten_US
dc.identifier.FolderNumberEIE-0550-
dc.description.fundingSourceRGCen_US
dc.description.pubStatusPublisheden_US
dc.identifier.OPUS20150534-
dc.description.oaCategoryGreen (AAM)en_US
Appears in Collections:Conference Paper
Files in This Item:
File Description SizeFormat 
Li_Unsupervised_Domain_Adaptation.pdfPre-Published version284.64 kBAdobe PDFView/Open
Open Access Information
Status open access
File Version Final Accepted Manuscript
Access
View full-text via PolyU eLinks SFX Query
Show simple item record

Page views

1
Citations as of Jun 30, 2024

SCOPUSTM   
Citations

7
Citations as of Jun 21, 2024

Google ScholarTM

Check

Altmetric


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.