Quantifying prediction uncertainties in automatic speaker verification systems

Jing, M; Sethu, V; Ahmed, B; Lee, KA

doi:10.1016/j.csl.2025.101806

Please use this identifier to cite or link to this item: http://hdl.handle.net/10397/115064

DC Field	Value	Language
dc.contributor	Department of Electrical and Electronic Engineering	-
dc.creator	Jing, M	-
dc.creator	Sethu, V	-
dc.creator	Ahmed, B	-
dc.creator	Lee, KA	-
dc.date.accessioned	2025-09-09T07:40:27Z	-
dc.date.available	2025-09-09T07:40:27Z	-
dc.identifier.issn	0885-2308	-
dc.identifier.uri	http://hdl.handle.net/10397/115064	-
dc.language.iso	en	en_US
dc.publisher	Academic Press	en_US
dc.rights	© 2025 The Authors. Published by Elsevier Ltd. This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/).	en_US
dc.rights	The following publication Jing, M., Sethu, V., Ahmed, B., & Lee, K. A. (2025). Quantifying prediction uncertainties in automatic speaker verification systems. Computer Speech & Language, 94, 101806 is available at https://doi.org/10.1016/j.csl.2025.101806.	en_US
dc.subject	Bayes-by-backprop	en_US
dc.subject	Hamiltonian Monte-Carlo	en_US
dc.subject	PLDA	en_US
dc.subject	Speaker verification	en_US
dc.subject	Stochastic gradient Langevin dynamics	en_US
dc.subject	Uncertainty	en_US
dc.title	Quantifying prediction uncertainties in automatic speaker verification systems	en_US
dc.type	Journal/Magazine Article	en_US
dc.identifier.volume	94	-
dc.identifier.doi	10.1016/j.csl.2025.101806	-
dcterms.abstract	For modern automatic speaker verification (ASV) systems, explicitly quantifying the confidence for each prediction strengthens the system’s reliability by indicating in which case the system is with trust. However, current paradigms do not take this into consideration. We thus propose to express confidence in the prediction by quantifying the uncertainty in ASV predictions. This is achieved by developing a novel Bayesian framework to obtain a score distribution for each input. The mean of the distribution is used to derive the decision while the spread of the distribution represents the uncertainty arising from the plausible choices of the model parameters. To capture the plausible choices, we sample the probabilistic linear discriminant analysis (PLDA) back-end model posterior through Hamiltonian Monte-Carlo (HMC) and approximate the embedding model posterior through stochastic Langevin dynamics (SGLD) and Bayes-by-backprop. Given the resulting score distribution, a further quantification and decomposition of the prediction uncertainty are achieved by calculating the score variance, entropy, and mutual information. The quantified uncertainties include the aleatoric uncertainty and epistemic uncertainty (model uncertainty). We evaluate them by observing how they change while varying the amount of training speech, the duration, and the noise level of testing speech. The experiments indicate that the behaviour of those quantified uncertainties reflects the changes we made to the training and testing data, demonstrating the validity of the proposed method as a measure of uncertainty.	-
dcterms.accessRights	open access	en_US
dcterms.bibliographicCitation	Computer speech and language, Nov. 2025, v. 94, 101806	-
dcterms.isPartOf	Computer speech and language	-
dcterms.issued	2025-11	-
dc.identifier.scopus	2-s2.0-105004203368	-
dc.identifier.eissn	1095-8363	-
dc.identifier.artn	101806	-
dc.description.validate	202509 bcch	-
dc.description.oa	Version of Record	en_US
dc.identifier.FolderNumber	OA_Scopus/WOS	en_US
dc.description.fundingSource	Self-funded	en_US
dc.description.pubStatus	Published	en_US
dc.description.oaCategory	CC	en_US
Appears in Collections:	Journal/Magazine Article

Files in This Item:

File	Description	Size	Format
1-s2.0-S0885230825000312-main.pdf		5.42 MB	Adobe PDF	View/Open

Open Access Information

Status	open access
File Version	Version of Record

Access

View full-text via PolyU eLinks

Show simple item record

Google Scholar^TM

Check

Files in This Item:

Open Access Information

Access

Google ScholarTM

Altmetric

Google Scholar^TM