Unsupervised domain adaptation for gender-aware PLDA mixture models

Li, L; Mak, MW

doi:10.1109/ICASSP.2018.8461943

Please use this identifier to cite or link to this item: http://hdl.handle.net/10397/107207

Title:	Unsupervised domain adaptation for gender-aware PLDA mixture models
Authors:	Li, L Mak, MW
Issue Date:	2018
Source:	In Proceedings of 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 15-20 April 2018, Calgary, AB, Canada, p. 5269-5273
Abstract:	Probabilistic linear discriminant analysis (PLDA) is a state-of-art back-end for i-vector based speaker verification. However, this backend is still problematic when (1) the model is deployed to new environment (in-domain) that is very different from the training one (out-of-domain) and (2) there are insufficient labeled data from the new environment. To address these problems, this paper proposes using out-of-domain training data to pre-train a PLDA mixture model and applying the mixture model on the in-domain training data to compute a pairwise score matrix for spectral clustering. The hypothesized speaker labels produced by spectral clustering are then used for re-training the mixture model to fit the new environment. To refine the mixture model, the spectral clustering and re-training processes are repeated a number of times. To make the mixture model amenable to both genders, a deep neural network (DNN) is trained to produce gender posteriors given an i-vector. The gender posteriors then replace the posterior probabilities of the indicator variables in the PLDA mixture model. Evaluations based on NIST 2016 SRE suggest that at the end of the iterative re-training, the PLDA mixture model becomes fully adapted to the new domain. Results also show that the PLDA scores can be readily incorporated into spectral clustering, resulting in high quality speaker clusters that could not be possibly achieved by agglomerative hierarchical clustering.
Keywords:	DNN-driven mixture of PLDA Domain adaptation I-vectors Speaker verification Spectral clustering
Publisher:	Institute of Electrical and Electronics Engineers
ISBN:	978-1-5386-4658-8 (Electronic) 978-1-5386-4659-5 (Print on Demand(PoD))
DOI:	10.1109/ICASSP.2018.8461943
Description:	2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 15-20 April 2018, Calgary, AB, Canada
Rights:	©2018 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works. The following publication L. Li and M. -W. Mak, "Unsupervised Domain Adaptation for Gender-Aware PLDA Mixture Models," 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Calgary, AB, Canada, 2018, pp. 5269-5273 is available at https://doi.org/10.1109/ICASSP.2018.8461943.
Appears in Collections:	Conference Paper

Files in This Item:

File	Description	Size	Format
Li_Unsupervised_Domain_Adaptation.pdf	Pre-Published version	284.64 kB	Adobe PDF	View/Open

Open Access Information

Status	open access
File Version	Final Accepted Manuscript

Access

View full-text via PolyU eLinks

Show full item record

Page views

59

Citations as of May 11, 2025

Downloads

9

Citations as of May 11, 2025

SCOPUS^TM
Citations

7

Citations as of Jun 5, 2025

Google Scholar^TM

Check