Please use this identifier to cite or link to this item: http://hdl.handle.net/10397/95586
PIRA download icon_1.1View/Download Full Text
DC FieldValueLanguage
dc.contributorDepartment of Electronic and Information Engineering-
dc.creatorLi, Nen_US
dc.creatorMak, MWen_US
dc.creatorLin, WWen_US
dc.creatorChien, JTen_US
dc.date.accessioned2022-09-22T06:13:59Z-
dc.date.available2022-09-22T06:13:59Z-
dc.identifier.issn0885-2308en_US
dc.identifier.urihttp://hdl.handle.net/10397/95586-
dc.language.isoenen_US
dc.publisherAcademic Pressen_US
dc.rights© 2017 Elsevier Ltd. All rights reserved.en_US
dc.rights© 2017. This manuscript version is made available under the CC-BY-NC-ND 4.0 license http://creativecommons.org/licenses/by-nc-nd/4.0/.en_US
dc.rightsThe following publication Li, N., Mak, M. W., Lin, W. W., & Chien, J. T. (2017). Discriminative subspace modeling of SNR and duration variabilities for robust speaker verification. Computer Speech & Language, 45, 83-103 is available at https://doi.org/10.1016/j.csl.2017.04.001.en_US
dc.subjectDuration variationen_US
dc.subjectI-vectoren_US
dc.subjectPLDAen_US
dc.subjectSNR mismatchen_US
dc.subjectSpeaker verificationen_US
dc.subjectVariational Bayesen_US
dc.titleDiscriminative subspace modeling of SNR and duration variabilities for robust speaker verificationen_US
dc.typeJournal/Magazine Articleen_US
dc.identifier.spage83en_US
dc.identifier.epage103en_US
dc.identifier.volume45en_US
dc.identifier.doi10.1016/j.csl.2017.04.001en_US
dcterms.abstractAlthough i-vectors together with probabilistic LDA (PLDA) have achieved a great success in speaker verification, how to suppress the undesirable effects caused by the variability in utterance length and background noise level is still a challenge. This paper aims to improve the robustness of i-vector based speaker verification systems by compensating for the utterance-length variability and noise-level variability. Inspired by the recent findings that noise-level variability can be modeled by a signal-to-noise ratio (SNR) subspace and that duration variability can be modeled as additive noise in the i-vector space, we propose to add an SNR factor and a duration factor to the PLDA model. In this framework, we assume that i-vectors derived from utterances with comparable durations share similar duration-specific information and that i-vectors extracted from utterances within a narrow SNR range have similar SNR-specific information. Based on these assumptions, an i-vector can be represented as a linear combination of four components: speaker, SNR, duration, and channel. A variational Bayes algorithm is developed to infer this latent variable model via a discriminative subspace training procedure. In the testing stage, different variabilities are compensated for when computing the likelihood ratio. Experiments on Common Conditions 1 and 4 in NIST 2012 SRE show that the proposed model outperforms the conventional PLDA and SNR-invariant PLDA. Results also show that the proposed model performs better than the uncertainty-propagation PLDA (UP-PLDA) for long test utterances.-
dcterms.accessRightsopen accessen_US
dcterms.bibliographicCitationComputer speech and language, Sept. 2017, v. 45, p. 83-103en_US
dcterms.isPartOfComputer speech and languageen_US
dcterms.issued2017-09-
dc.identifier.scopus2-s2.0-85018305015-
dc.description.validate202209_bcww-
dc.description.oaAccepted Manuscripten_US
dc.identifier.FolderNumberEIE-0662-
dc.description.fundingSourceRGCen_US
dc.description.pubStatusPublisheden_US
dc.identifier.OPUS6741064-
dc.description.oaCategoryGreen (AAM)en_US
Appears in Collections:Journal/Magazine Article
Files in This Item:
File Description SizeFormat 
Mak_Discriminative_Subspace_Modeling.pdfPre-Published version2.34 MBAdobe PDFView/Open
Open Access Information
Status open access
File Version Final Accepted Manuscript
Access
View full-text via PolyU eLinks SFX Query
Show simple item record

Page views

34
Last Week
0
Last month
Citations as of Oct 13, 2024

Downloads

56
Citations as of Oct 13, 2024

SCOPUSTM   
Citations

7
Citations as of Oct 17, 2024

WEB OF SCIENCETM
Citations

5
Citations as of Oct 10, 2024

Google ScholarTM

Check

Altmetric


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.