Please use this identifier to cite or link to this item: http://hdl.handle.net/10397/107270
PIRA download icon_1.1View/Download Full Text
DC FieldValueLanguage
dc.contributorDepartment of Electrical and Electronic Engineeringen_US
dc.creatorLi, Nen_US
dc.creatorMak, MWen_US
dc.date.accessioned2024-06-13T01:05:01Z-
dc.date.available2024-06-13T01:05:01Z-
dc.identifier.isbn978-1-4799-9988-0 (Electronic)en_US
dc.identifier.isbn978-1-4799-9987-3 (USB)en_US
dc.identifier.urihttp://hdl.handle.net/10397/107270-
dc.description2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 20-25 March 2016, Shanghai, Chinaen_US
dc.language.isoenen_US
dc.publisherInstitute of Electrical and Electronics Engineersen_US
dc.rights©2016 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works.en_US
dc.rightsThe following publication N. Li and M. -W. Mak, "SNR-invariant PLDA with multiple speaker subspaces," 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Shanghai, China, 2016, pp. 5565-5569 is available at https://doi.org/10.1109/ICASSP.2016.7472742.en_US
dc.subjectI-vectorsen_US
dc.subjectSNR subspacesen_US
dc.subjectSNR-invariant PLDAen_US
dc.subjectSpeaker subspacesen_US
dc.subjectSpeaker verificationen_US
dc.titleSNR-invariant PLDA with multiple speaker subspacesen_US
dc.typeConference Paperen_US
dc.identifier.spage5565en_US
dc.identifier.epage5569en_US
dc.identifier.doi10.1109/ICASSP.2016.7472742en_US
dcterms.abstractTo deal with the mismatch between the enrollment and test utterances caused by noise with different signal-to-noise ratios (SNR), we have recently proposed an SNR-invariant PLDA model for robust speaker verification. In the model, SNR-specific information were separated from speaker-specific information through marginalizing out the SNR factors during the scoring process. However, this modeling approach assumes that speaker variabilities can be captured by a single speaker subspace regardless of the noise level of the utterances. We will show in this paper that i-vectors extracted from utterances with different noise levels will shift to different regions of the i-vector space and that i-vectors extracted from utterances having similar SNR tend to cluster together. In view of this observation, we propose introducing multiple speaker subspaces to the SNR-invariance PLDA model and use multiple covariance matrices to represent SNR-dependent channel variability. Through NIST 2012 SRE, this paper demonstrates that this finer and more precise modeling of speaker and SNR variabilities leads to better performance when compared with the conventional PLDA and SNR-invariant PLDA.en_US
dcterms.accessRightsopen accessen_US
dcterms.bibliographicCitationIn Proceedings of 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 20-25 March 2016, Shanghai, China, p. 5565-5569en_US
dcterms.issued2016-
dc.identifier.scopus2-s2.0-84973367336-
dc.relation.conferenceInternational Conference on Acoustics, Speech, and Signal Processing [ICASSP]en_US
dc.description.validate202404 bckwen_US
dc.description.oaAccepted Manuscripten_US
dc.identifier.FolderNumberEIE-0863-
dc.description.fundingSourceRGCen_US
dc.description.pubStatusPublisheden_US
dc.identifier.OPUS9574627-
dc.description.oaCategoryGreen (AAM)en_US
Appears in Collections:Conference Paper
Files in This Item:
File Description SizeFormat 
Mak_Snr-Invariant_Plda_Multiple.pdfPre-Published version938.8 kBAdobe PDFView/Open
Open Access Information
Status open access
File Version Final Accepted Manuscript
Access
View full-text via PolyU eLinks SFX Query
Show simple item record

Page views

7
Citations as of Jun 30, 2024

SCOPUSTM   
Citations

7
Citations as of Jun 21, 2024

Google ScholarTM

Check

Altmetric


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.