Please use this identifier to cite or link to this item:
http://hdl.handle.net/10397/114612
Title: | Optimization of discriminative kernels in SVM speaker verification | Authors: | Zhang, SX Mak, MW |
Issue Date: | 2009 | Source: | Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, 2009, p. 1275-1278 | Abstract: | An important aspect of SVM-based speaker verification systems is the design of sequence kernels. These kernels should be able to map variable-length observation sequences to fixed-size supervectors that capture the dynamic characteristics of speech utterances and allow speakers to be easily distinguished. Most existing kernels in SVM speaker verification are obtained by assuming a specific form for the similarity function of supervectors. This paper relaxes this assumption to derive a new general kernel. The kernel function is general in that it is a linear combination of any kernels belonging to the reproducing kernel Hilbert space. The combination weights are obtained by optimizing the ability of a discriminant function to separate a target speaker from impostors using either regression analysis or SVM training. The idea was applied to both low- and high-level speaker verification. In both cases, results show that the proposed kernels outperform the state-of-the-art sequence kernels. Further performance enhancement was also observed when the high-level scores were combined with acoustic scores. | Keywords: | High-level features Optimal kernels Sequence kernels Speaker verification SVM |
Publisher: | International Speech Communication Association | DOI: | 10.21437/interspeech.2009-380 | Description: | Interspeech 2009, Brighton, United Kingdom, 6-10 September 2009 | Rights: | Copyright © 2009 ISCA The following publication Zhang, S.-X., Mak, M.-W. (2009) Optimization of discriminative kernels in SVM speaker verification. Proc. Interspeech 2009, 1275-1278 is available at https://doi.org/10.21437/Interspeech.2009-380. |
Appears in Collections: | Conference Paper |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
zhang09c_interspeech.pdf | 1.26 MB | Adobe PDF | View/Open |
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.