Please use this identifier to cite or link to this item: http://hdl.handle.net/10397/106921
PIRA download icon_1.1View/Download Full Text
Title: Variational domain adversarial learning for speaker verification
Authors: Tu, Y 
Mak, MW 
Chien, JT
Issue Date: 2019
Source: Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, Graz, Austria, 15-19 September 2019, p. 4315-4319
Abstract: Domain mismatch refers to the problem in which the distribution of training data differs from that of the test data. This paper proposes a variational domain adversarial neural network (VDANN), which consists of a variational autoencoder (VAE) and a domain adversarial neural network (DANN), to reduce domain mismatch. The DANN part aims to retain speaker identity information and learn a feature space that is robust against domain mismatch, while the VAE part is to impose variational regularization on the learned features so that they follow a Gaussian distribution. Thus, the representation produced by VDANN is not only speaker discriminative and domain-invariant but also Gaussian distributed, which is essential for the standard PLDA backend. Experiments on both SRE16 and SRE18-CMN2 show that VDANN outperforms the Kaldi baseline and the standard DANN. The results also suggest that VAE regularization is effective for domain adaptation.
Publisher: International Speech Communication Association (ISCA)
ISBN: 978-1-5108-9683-3
DOI: 10.21437/Interspeech.2019-2168
Description: 20th Annual Conference of the International Speech Communication Association: Crossroads of Speech and Language, INTERSPEECH 2019, Graz, Austria, 15-19 September 2019
Rights: Copyright © 2019 ISCA
The following publication Tu, Y., Mak, M.-W., Chien, J.-T. (2019) Variational Domain Adversarial Learning for Speaker Verification. Proc. Interspeech 2019, 4315-4319 is available at https://doi.org/10.21437/Interspeech.2019-2168.
Appears in Collections:Conference Paper

Files in This Item:
File Description SizeFormat 
tu19_interspeech.pdf769.94 kBAdobe PDFView/Open
Open Access Information
Status open access
File Version Version of Record
Access
View full-text via PolyU eLinks SFX Query
Show full item record

Page views

35
Citations as of Feb 2, 2025

Downloads

8
Citations as of Feb 2, 2025

SCOPUSTM   
Citations

40
Citations as of Feb 20, 2025

WEB OF SCIENCETM
Citations

31
Citations as of Feb 20, 2025

Google ScholarTM

Check

Altmetric


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.