Please use this identifier to cite or link to this item: http://hdl.handle.net/10397/107243
PIRA download icon_1.1View/Download Full Text
DC FieldValueLanguage
dc.contributorDepartment of Electrical and Electronic Engineering-
dc.creatorTan, Z-
dc.creatorZhu, Y-
dc.creatorMak, MW-
dc.creatorMak, BKW-
dc.date.accessioned2024-06-13T01:04:51Z-
dc.date.available2024-06-13T01:04:51Z-
dc.identifier.isbn978-1-5090-4294-4 (Electronic)-
dc.identifier.isbn978-1-5090-4295-1 (Print on Demand(PoD))-
dc.identifier.urihttp://hdl.handle.net/10397/107243-
dc.description2016 10th International Symposium on Chinese Spoken Language Processing (ISCSLP), 17-20 October 2016, Tianjin, Chinaen_US
dc.language.isoenen_US
dc.publisherInstitute of Electrical and Electronics Engineersen_US
dc.rights©2016 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works.en_US
dc.rightsThe following publication Z. Tan, Y. Zhu, M. -W. Mak and B. K. -W. Mak, "Senone I-vectors for robust speaker verification," 2016 10th International Symposium on Chinese Spoken Language Processing (ISCSLP), Tianjin, China, 2016 is available at https://doi.org/10.1109/ISCSLP.2016.7918462.en_US
dc.subjectDeep learningen_US
dc.subjectDenoising autoencodersen_US
dc.subjectI-vectorsen_US
dc.subjectSenone posteriorsen_US
dc.subjectSpeaker verificationen_US
dc.titleSenone i-vectors for robust speaker verificationen_US
dc.typeConference Paperen_US
dc.identifier.doi10.1109/ISCSLP.2016.7918462-
dcterms.abstractRecent research has shown that using senone posteriors for i-vector extraction can achieve outstanding performance. In this paper, we extend this idea to robust speaker verification by constructing a deep neural network (DNN) comprising a deep belief network (DBN) stacked on top of a denoising autoencoder (DAE). The proposed method addresses noise robustness in two perspectives: (1) denoising the MFCC vectors through the DAE and (2) extracting noise robust bottleneck (BN) features and senone posteriors from the DBN for total-variability matrix training and i-vector extraction. The DAE comprises several layers of restricted Boltzmann machines (RBM), which are trained to minimize the mean squared error between the denoised and clean MFCCs. After training the DAE, three layers of RBMs are put on top of it to form the DNN. The whole network is fine-tuned by backpropagation to minimize the cross-entropy between the senone labels and network outputs. This architecture allows us to extract BN features and estimates senone posteriors given noisy MFCCs as input, resulting in robust BN-based senone i-vectors. Results on NIST 2012 SRE show that these senone i-vectors outperform the conventional i-vectors and the BN-based i-vectors in which the posteriors are obtained from a GMM.-
dcterms.accessRightsopen accessen_US
dcterms.bibliographicCitationIn Proceedings of 2016 10th International Symposium on Chinese Spoken Language Processing (ISCSLP), 17-20 October 2016, Tianjin, China-
dcterms.issued2016-
dc.identifier.scopus2-s2.0-85020233637-
dc.relation.conferenceInternational Symposium on Chinese Spoken Language Processing [ISCSLP]-
dc.description.validate202404 bckw-
dc.description.oaAccepted Manuscripten_US
dc.identifier.FolderNumberEIE-0715en_US
dc.description.fundingSourceRGCen_US
dc.description.pubStatusPublisheden_US
dc.identifier.OPUS9597275en_US
dc.description.oaCategoryGreen (AAM)en_US
Appears in Collections:Conference Paper
Files in This Item:
File Description SizeFormat 
Tan_Senone_I-Vectors_Robust.pdfPre-Published version822.64 kBAdobe PDFView/Open
Open Access Information
Status open access
File Version Final Accepted Manuscript
Access
View full-text via PolyU eLinks SFX Query
Show simple item record

Page views

7
Citations as of Jun 30, 2024

SCOPUSTM   
Citations

5
Citations as of Jun 21, 2024

Google ScholarTM

Check

Altmetric


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.