Please use this identifier to cite or link to this item:
http://hdl.handle.net/10397/106921
Title: | Variational domain adversarial learning for speaker verification | Authors: | Tu, Y Mak, MW Chien, JT |
Issue Date: | 2019 | Source: | Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, Graz, Austria, 15-19 September 2019, p. 4315-4319 | Abstract: | Domain mismatch refers to the problem in which the distribution of training data differs from that of the test data. This paper proposes a variational domain adversarial neural network (VDANN), which consists of a variational autoencoder (VAE) and a domain adversarial neural network (DANN), to reduce domain mismatch. The DANN part aims to retain speaker identity information and learn a feature space that is robust against domain mismatch, while the VAE part is to impose variational regularization on the learned features so that they follow a Gaussian distribution. Thus, the representation produced by VDANN is not only speaker discriminative and domain-invariant but also Gaussian distributed, which is essential for the standard PLDA backend. Experiments on both SRE16 and SRE18-CMN2 show that VDANN outperforms the Kaldi baseline and the standard DANN. The results also suggest that VAE regularization is effective for domain adaptation. | Publisher: | International Speech Communication Association (ISCA) | ISBN: | 978-1-5108-9683-3 | DOI: | 10.21437/Interspeech.2019-2168 | Description: | 20th Annual Conference of the International Speech Communication Association: Crossroads of Speech and Language, INTERSPEECH 2019, Graz, Austria, 15-19 September 2019 | Rights: | Copyright © 2019 ISCA The following publication Tu, Y., Mak, M.-W., Chien, J.-T. (2019) Variational Domain Adversarial Learning for Speaker Verification. Proc. Interspeech 2019, 4315-4319 is available at https://doi.org/10.21437/Interspeech.2019-2168. |
Appears in Collections: | Conference Paper |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
tu19_interspeech.pdf | 769.94 kB | Adobe PDF | View/Open |
Page views
35
Citations as of Feb 2, 2025
Downloads
8
Citations as of Feb 2, 2025
SCOPUSTM
Citations
40
Citations as of Feb 20, 2025
WEB OF SCIENCETM
Citations
31
Citations as of Feb 20, 2025

Google ScholarTM
Check
Altmetric
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.