Please use this identifier to cite or link to this item:
http://hdl.handle.net/10397/114603
DC Field | Value | Language |
---|---|---|
dc.contributor | Department of Electrical and Electronic Engineering | - |
dc.creator | Wang, X | - |
dc.creator | Kinnunen, T | - |
dc.creator | Lee, KA | - |
dc.creator | Noé, PG | - |
dc.creator | Yamagishi, J | - |
dc.date.accessioned | 2025-08-18T03:02:08Z | - |
dc.date.available | 2025-08-18T03:02:08Z | - |
dc.identifier.uri | http://hdl.handle.net/10397/114603 | - |
dc.description | Interspeech 2024, 1-5 September 2024, Kos, Greece | en_US |
dc.language.iso | en | en_US |
dc.publisher | International Speech Communication Association | en_US |
dc.rights | The following publication Wang, X., Kinnunen, T., Lee, K.A., Noé, P.-G., Yamagishi, J. (2024) Revisiting and Improving Scoring Fusion for Spoofing-aware Speaker Verification Using Compositional Data Analysis. Proc. Interspeech 2024, 1110-1114 is available at https://doi.org/10.21437/Interspeech.2024-422. | en_US |
dc.subject | Anti-spoofing | en_US |
dc.subject | Fusion | en_US |
dc.subject | Log-likelihood ratio | en_US |
dc.subject | Speaker verification | en_US |
dc.subject | Ternary classification | en_US |
dc.title | Revisiting and improving scoring fusion for spoofing-aware speaker verification using compositional data analysis | en_US |
dc.type | Conference Paper | en_US |
dc.identifier.spage | 1110 | - |
dc.identifier.epage | 1114 | - |
dc.identifier.doi | 10.21437/Interspeech.2024-422 | - |
dcterms.abstract | Fusing outputs from automatic speaker verification (ASV) and spoofing countermeasure (CM) is expected to make an integrated system robust to zero-effort imposters and synthesized spoofing attacks. Many score-level fusion methods have been proposed, but many remain heuristic. This paper revisits score-level fusion using tools from decision theory and presents three main findings. First, fusion by summing the ASV and CM scores can be interpreted on the basis of compositional data analysis, and score calibration before fusion is essential. Second, the interpretation leads to an improved fusion method that linearly combines the log-likelihood ratios of ASV and CM. However, as the third finding reveals, this linear combination is inferior to a non-linear one in making optimal decisions. The outcomes of these findings, namely, the score calibration before fusion, improved linear fusion, and better non-linear fusion, were found to be effective on the SASV challenge database. | - |
dcterms.accessRights | open access | en_US |
dcterms.bibliographicCitation | Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, 2024, p. 1110-1114 | - |
dcterms.issued | 2024 | - |
dc.identifier.scopus | 2-s2.0-85202625484 | - |
dc.description.validate | 202508 bcch | - |
dc.description.oa | Version of Record | en_US |
dc.identifier.FolderNumber | OA_Others | en_US |
dc.description.fundingSource | Others | en_US |
dc.description.fundingText | JST, PRESTO Grant Number JPMJPR23P9, Japan; the Academy of Finland under Grant 349605, project ”SPEECHFAKES” | en_US |
dc.description.pubStatus | Published | en_US |
dc.description.oaCategory | VoR allowed | en_US |
Appears in Collections: | Conference Paper |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
wang24l_interspeech.pdf | 666.76 kB | Adobe PDF | View/Open |
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.