Please use this identifier to cite or link to this item: http://hdl.handle.net/10397/107135
PIRA download icon_1.1View/Download Full Text
DC FieldValueLanguage
dc.contributorDepartment of Electrical and Electronic Engineering-
dc.creatorTu, Y-
dc.creatorMak, MW-
dc.creatorChien, JT-
dc.date.accessioned2024-06-13T01:04:08Z-
dc.date.available2024-06-13T01:04:08Z-
dc.identifier.issn2329-9290-
dc.identifier.urihttp://hdl.handle.net/10397/107135-
dc.language.isoenen_US
dc.publisherInstitute of Electrical and Electronics Engineersen_US
dc.rights© 2020 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works.en_US
dc.rightsThe following publication Y. Tu, M. -W. Mak and J. -T. Chien, "Variational Domain Adversarial Learning With Mutual Information Maximization for Speaker Verification," in IEEE/ACM Transactions on Audio, Speech, and Language Processing, vol. 28, pp. 2013-2024, 2020 is available at https://doi.org/10.1109/TASLP.2020.3004760.en_US
dc.subjectDomain adaptationen_US
dc.subjectDomain adversarial trainingen_US
dc.subjectMutual informationen_US
dc.subjectSpeaker verification (SV)en_US
dc.subjectVariational autoencoderen_US
dc.titleVariational domain adversarial learning with mutual information maximization for speaker verificationen_US
dc.typeJournal/Magazine Articleen_US
dc.identifier.spage2013-
dc.identifier.epage2024-
dc.identifier.volume28-
dc.identifier.doi10.1109/TASLP.2020.3004760-
dcterms.abstractDomain mismatch is a common problem in speaker verification (SV) and often causes performance degradation. For the system relying on the Gaussian PLDA backend to suppress the channel variability, the performance would be further limited if there is no Gaussianity constraint on the learned embeddings. This paper proposes an information-maximized variational domain adversarial neural network (InfoVDANN) that incorporates an InfoVAE into domain adversarial training (DAT) to reduce domain mismatch and simultaneously meet the Gaussianity requirement of the PLDA backend. Specifically, DAT is applied to produce speaker discriminative and domain-invariant features, while the InfoVAE performs variational regularization on the embedded features so that they follow a Gaussian distribution. Another benefit of the InfoVAE is that it avoids posterior collapse in VAEs by preserving the mutual information between the embedded features and the training set so that extra speaker information can be retained in the features. Experiments on both SRE16 and SRE18-CMN2 show that the InfoVDANN outperforms the recent VDANN, which suggests that increasing the mutual information between the embedded features and input features enables the InfoVDANN to extract extra speaker information that is otherwise not possible.-
dcterms.accessRightsopen accessen_US
dcterms.bibliographicCitationIEEE/ACM transactions on audio, speech, and language processing, 2020, v. 28, p. 2013-2024-
dcterms.isPartOfIEEE/ACM transactions on audio, speech, and language processing-
dcterms.issued2020-
dc.identifier.scopus2-s2.0-85089194439-
dc.identifier.eissn2329-9304-
dc.description.validate202403 bckw-
dc.description.oaAccepted Manuscripten_US
dc.identifier.FolderNumberEIE-0204en_US
dc.description.fundingSourceRGCen_US
dc.description.fundingSourceOthersen_US
dc.description.fundingTextTaiwan MOST Granten_US
dc.description.pubStatusPublisheden_US
dc.identifier.OPUS26174609en_US
dc.description.oaCategoryGreen (AAM)en_US
Appears in Collections:Journal/Magazine Article
Files in This Item:
File Description SizeFormat 
Lin_Variational_Domain_Adversarial.pdfPre-Published version1.73 MBAdobe PDFView/Open
Open Access Information
Status open access
File Version Final Accepted Manuscript
Access
View full-text via PolyU eLinks SFX Query
Show simple item record

Page views

5
Citations as of Jun 30, 2024

Downloads

5
Citations as of Jun 30, 2024

SCOPUSTM   
Citations

31
Citations as of Jun 21, 2024

WEB OF SCIENCETM
Citations

25
Citations as of Jun 27, 2024

Google ScholarTM

Check

Altmetric


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.