Please use this identifier to cite or link to this item: http://hdl.handle.net/10397/107247
PIRA download icon_1.1View/Download Full Text
DC FieldValueLanguage
dc.contributorDepartment of Electrical and Electronic Engineering-
dc.creatorLi, Nen_US
dc.creatorMak, MWen_US
dc.creatorChien, JTen_US
dc.date.accessioned2024-06-13T01:04:52Z-
dc.date.available2024-06-13T01:04:52Z-
dc.identifier.isbn978-1-5090-4903-5 (Electronic)en_US
dc.identifier.isbn978-1-5090-4904-2 (Print on Demand(PoD))en_US
dc.identifier.urihttp://hdl.handle.net/10397/107247-
dc.description2016 IEEE Spoken Language Technology Workshop (SLT), 13-16 December 2016, San Diego, CA, USAen_US
dc.language.isoenen_US
dc.publisherInstitute of Electrical and Electronics Engineersen_US
dc.rights©2016 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works.en_US
dc.rightsThe following publication N. Li, M. -W. Mak and J. -T. Chien, "Deep neural network driven mixture of PLDA for robust i-vector speaker verification," 2016 IEEE Spoken Language Technology Workshop (SLT), San Diego, CA, USA, 2016, pp. 186-191 is available at https://doi.org/10.1109/SLT.2016.7846263.en_US
dc.subjectDeep neural networksen_US
dc.subjectI-vectoren_US
dc.subjectMixture of PLDAen_US
dc.subjectSNR mismatchen_US
dc.subjectSpeaker verificationen_US
dc.titleDeep neural network driven mixture of PLDA for robust i-vector speaker verificationen_US
dc.typeConference Paperen_US
dc.identifier.spage186en_US
dc.identifier.epage191en_US
dc.identifier.doi10.1109/SLT.2016.7846263en_US
dcterms.abstractIn speaker recognition, the mismatch between the enrollment and test utterances due to noise with different signal-to-noise ratios (SNRs) is a great challenge. Based on the observation that noise-level variability causes the i-vectors to form heterogeneous clusters, this paper proposes using an SNR-aware deep neural network (DNN) to guide the training of PLDA mixture models. Specifically, given an i-vector, the SNR posterior probabilities produced by the DNN are used as the posteriors of indicator variables of the mixture model. As a result, the proposed model provides a more reasonable soft division of the i-vector space compared to the conventional mixture of PLDA. During verification, given a test trial, the marginal likelihoods from individual PLDA models are linearly combined by the posterior probabilities of SNR levels computed by the DNN. Experimental results for SNR mismatch tasks based on NIST 2012 SRE suggest that the proposed model is more effective than PLDA and conventional mixture of PLDA for handling heterogeneous corpora.-
dcterms.accessRightsopen accessen_US
dcterms.bibliographicCitationIn Proceedings of 2016 IEEE Spoken Language Technology Workshop (SLT), 13-16 December 2016, San Diego, CA, USAen_US
dcterms.issued2016-
dc.identifier.scopus2-s2.0-85016063901-
dc.relation.conferenceIEEE Spoken Language Technology Workshop [SLT]-
dc.description.validate202403 bckw-
dc.description.oaAccepted Manuscripten_US
dc.identifier.FolderNumberEIE-0741-
dc.description.fundingSourceRGCen_US
dc.description.pubStatusPublisheden_US
dc.identifier.OPUS9591755-
dc.description.oaCategoryGreen (AAM)en_US
Appears in Collections:Conference Paper
Files in This Item:
File Description SizeFormat 
Mak_Deep_Neural_Network.pdfPre-Published version375.59 kBAdobe PDFView/Open
Open Access Information
Status open access
File Version Final Accepted Manuscript
Access
View full-text via PolyU eLinks SFX Query
Show simple item record

Page views

7
Citations as of Jun 30, 2024

SCOPUSTM   
Citations

10
Citations as of Jun 21, 2024

Google ScholarTM

Check

Altmetric


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.