Please use this identifier to cite or link to this item: http://hdl.handle.net/10397/114612
PIRA download icon_1.1View/Download Full Text
Title: Optimization of discriminative kernels in SVM speaker verification
Authors: Zhang, SX 
Mak, MW 
Issue Date: 2009
Source: Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, 2009, p. 1275-1278
Abstract: An important aspect of SVM-based speaker verification systems is the design of sequence kernels. These kernels should be able to map variable-length observation sequences to fixed-size supervectors that capture the dynamic characteristics of speech utterances and allow speakers to be easily distinguished. Most existing kernels in SVM speaker verification are obtained by assuming a specific form for the similarity function of supervectors. This paper relaxes this assumption to derive a new general kernel. The kernel function is general in that it is a linear combination of any kernels belonging to the reproducing kernel Hilbert space. The combination weights are obtained by optimizing the ability of a discriminant function to separate a target speaker from impostors using either regression analysis or SVM training. The idea was applied to both low- and high-level speaker verification. In both cases, results show that the proposed kernels outperform the state-of-the-art sequence kernels. Further performance enhancement was also observed when the high-level scores were combined with acoustic scores.
Keywords: High-level features
Optimal kernels
Sequence kernels
Speaker verification
SVM
Publisher: International Speech Communication Association
DOI: 10.21437/interspeech.2009-380
Description: Interspeech 2009, Brighton, United Kingdom, 6-10 September 2009
Rights: Copyright © 2009 ISCA
The following publication Zhang, S.-X., Mak, M.-W. (2009) Optimization of discriminative kernels in SVM speaker verification. Proc. Interspeech 2009, 1275-1278 is available at https://doi.org/10.21437/Interspeech.2009-380.
Appears in Collections:Conference Paper

Files in This Item:
File Description SizeFormat 
zhang09c_interspeech.pdf1.26 MBAdobe PDFView/Open
Open Access Information
Status open access
File Version Version of Record
Access
View full-text via PolyU eLinks SFX Query
Show full item record

Google ScholarTM

Check

Altmetric


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.