Please use this identifier to cite or link to this item: http://hdl.handle.net/10397/77467
PIRA download icon_1.1View/Download Full Text
DC FieldValueLanguage
dc.contributorDepartment of Electronic and Information Engineeringen_US
dc.creatorTan, Zen_US
dc.creatorMak, MWen_US
dc.creatorMak, BKWen_US
dc.creatorZhu, Yen_US
dc.date.accessioned2018-08-28T01:32:33Z-
dc.date.available2018-08-28T01:32:33Z-
dc.identifier.issn2329-9290en_US
dc.identifier.urihttp://hdl.handle.net/10397/77467-
dc.language.isoenen_US
dc.publisherInstitute of Electrical and Electronics Engineersen_US
dc.rights© 2018 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works.en_US
dc.rightsThe following publication Z. Tan, M. Mak, B. K. Mak and Y. Zhu, "Denoised Senone I-Vectors for Robust Speaker Verification," in IEEE/ACM Transactions on Audio, Speech, and Language Processing, vol. 26, no. 4, pp. 820-830, April 2018 is available at https://doi.org/10.1109/TASLP.2018.2796843.en_US
dc.subjectDeep learningen_US
dc.subjectDenoising autoencodersen_US
dc.subjectI-vectorsen_US
dc.subjectNoise robustnessen_US
dc.subjectPhonetically discriminative featuresen_US
dc.subjectSenone posteriorsen_US
dc.subjectSpeaker verificationen_US
dc.titleDenoised senone I-Vectors for robust speaker verificationen_US
dc.typeJournal/Magazine Articleen_US
dc.identifier.spage820en_US
dc.identifier.epage830en_US
dc.identifier.volume26en_US
dc.identifier.issue4en_US
dc.identifier.doi10.1109/TASLP.2018.2796843en_US
dcterms.abstractRecently, it has been shown that senone i-vectors, whose posteriors are produced by senone deep neural networks (DNNs), outperform the conventional Gaussian mixture model (GMM) i-vectors in both speaker and language recognition tasks. The success of senone i-vectors relies on the capability of the DNN to incorporate phonetic information into the i-vector extraction process. In this paper, we argue that to apply senone i-vectors in noisy environments, it is important to robustify the phonetically discriminative acoustic features and senone posteriors estimated by the DNN. To this end, we propose a deep architecture formed by stacking a deep belief network on top of a denoising autoencoder (DAE). After backpropagation fine-tuning, the network, referred to as denoising autoencoder-deep neural network (DAE-DNN), facilitates the extraction of robust phonetically-discriminitive bottleneck (BN) features and senone posteriors for i-vector extraction. We refer to the resulting i-vectors as denoised BN-based senone i-vectors. Results on NIST 2012 SRE show that senone i-vectors outperform the conventional GMM i-vectors. More interestingly, the BN features are not only phonetically discriminative, results suggest that they also contain sufficient speaker information to produce BN-based senone i-vectors that outperform the conventional senone i-vectors. This work also shows that DAE training is more beneficial to BN feature extraction than senone posterior estimation.en_US
dcterms.accessRightsopen accessen_US
dcterms.bibliographicCitationIEEE/ACM transactions on audio, speech, and language processing, Apr. 2018, v. 26, no. 4, 8269399, p. 820-830en_US
dcterms.isPartOfIEEE/ACM transactions on audio, speech, and language processingen_US
dcterms.issued2018-04-
dc.identifier.isiWOS:000424636600010-
dc.identifier.scopus2-s2.0-85041007502-
dc.identifier.eissn2329-9304en_US
dc.identifier.artn8269399en_US
dc.identifier.rosgroupid2017004678-
dc.description.ros2017-2018 > Academic research: refereed > Publication in refereed journalen_US
dc.description.validate201808 bcrcen_US
dc.description.oaAccepted Manuscripten_US
dc.identifier.FolderNumberEIE-0559-
dc.description.fundingSourceRGCen_US
dc.description.pubStatusPublisheden_US
dc.identifier.OPUS6814236-
Appears in Collections:Journal/Magazine Article
Files in This Item:
File Description SizeFormat 
Tan_Denoised_Senone_I-Vectors.pdfPre-Published version1.36 MBAdobe PDFView/Open
Open Access Information
Status open access
File Version Final Accepted Manuscript
Access
View full-text via PolyU eLinks SFX Query
Show simple item record

Page views

85
Last Week
0
Last month
Citations as of Apr 14, 2024

Downloads

33
Citations as of Apr 14, 2024

SCOPUSTM   
Citations

9
Last Week
0
Last month
Citations as of Apr 12, 2024

WEB OF SCIENCETM
Citations

7
Last Week
0
Last month
Citations as of Apr 18, 2024

Google ScholarTM

Check

Altmetric


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.