Please use this identifier to cite or link to this item:
http://hdl.handle.net/10397/114602
Title: | W-GVKT : within-global-view knowledge transfer for speaker verification | Authors: | Jin, Z Tu, Y Mak, MW |
Issue Date: | 2024 | Source: | Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, 2024, p. 3779-3786 | Abstract: | Contrastive self-supervised learning has played an important role in speaker verification (SV). However, such approaches suffer from false-negative issues. To address this problem, we enhance the non-contrastive DINO framework by enabling knowledge transfer from the teacher network to the student network through diversified versions of global views and call the method Within-Global-View Knowledge Transfer (W-GVKT) DINO. We discovered that given the global view of the entire utterance, creating discrepancies in the student’s output through applying spectral augmentation and feature diversification to the global view can facilitate the transfer of knowledge from the teacher to the student. With negligible computational resource increases, W-GVKT achieves an impressive EER of 4.11% without utilizing speaker labels on Voxceleb1. When combined with the RDNIO framework, W-GVKT achieved an EER of 2.89%. | Keywords: | DINO Knowledge transfer Self-supervised learning Speaker verification |
Publisher: | International Speech Communication Association | DOI: | 10.21437/Interspeech.2024-354 | Description: | Interspeech 2024, 1-5 September 2024, Kos, Greece | Rights: | The following publication Jin, Z., Tu, Y., Mak, M.-W. (2024) W-GVKT: Within-Global-View Knowledge Transfer for Speaker Verification. Proc. Interspeech 2024, 3779-3783 is available at https://doi.org/10.21437/Interspeech.2024-354. |
Appears in Collections: | Conference Paper |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
jin24b_interspeech.pdf | 612.85 kB | Adobe PDF | View/Open |
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.