Please use this identifier to cite or link to this item: http://hdl.handle.net/10397/114612
PIRA download icon_1.1View/Download Full Text
DC FieldValueLanguage
dc.contributorDepartment of Electrical and Electronic Engineering-
dc.creatorZhang, SX-
dc.creatorMak, MW-
dc.date.accessioned2025-08-18T03:02:14Z-
dc.date.available2025-08-18T03:02:14Z-
dc.identifier.urihttp://hdl.handle.net/10397/114612-
dc.descriptionInterspeech 2009, Brighton, United Kingdom, 6-10 September 2009en_US
dc.language.isoenen_US
dc.publisherInternational Speech Communication Associationen_US
dc.rightsCopyright © 2009 ISCAen_US
dc.rightsThe following publication Zhang, S.-X., Mak, M.-W. (2009) Optimization of discriminative kernels in SVM speaker verification. Proc. Interspeech 2009, 1275-1278 is available at https://doi.org/10.21437/Interspeech.2009-380.en_US
dc.subjectHigh-level featuresen_US
dc.subjectOptimal kernelsen_US
dc.subjectSequence kernelsen_US
dc.subjectSpeaker verificationen_US
dc.subjectSVMen_US
dc.titleOptimization of discriminative kernels in SVM speaker verificationen_US
dc.typeConference Paperen_US
dc.identifier.spage1275-
dc.identifier.epage1278-
dc.identifier.doi10.21437/interspeech.2009-380-
dcterms.abstractAn important aspect of SVM-based speaker verification systems is the design of sequence kernels. These kernels should be able to map variable-length observation sequences to fixed-size supervectors that capture the dynamic characteristics of speech utterances and allow speakers to be easily distinguished. Most existing kernels in SVM speaker verification are obtained by assuming a specific form for the similarity function of supervectors. This paper relaxes this assumption to derive a new general kernel. The kernel function is general in that it is a linear combination of any kernels belonging to the reproducing kernel Hilbert space. The combination weights are obtained by optimizing the ability of a discriminant function to separate a target speaker from impostors using either regression analysis or SVM training. The idea was applied to both low- and high-level speaker verification. In both cases, results show that the proposed kernels outperform the state-of-the-art sequence kernels. Further performance enhancement was also observed when the high-level scores were combined with acoustic scores.-
dcterms.accessRightsopen accessen_US
dcterms.bibliographicCitationProceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, 2009, p. 1275-1278-
dcterms.issued2009-
dc.identifier.scopus2-s2.0-70450175241-
dc.description.validate202508 bcch-
dc.description.oaVersion of Recorden_US
dc.identifier.FolderNumberOA_Othersen_US
dc.description.fundingSourceRGCen_US
dc.description.fundingSourceOthersen_US
dc.description.fundingTextCenter for Multimedia Signal Processing, The Hong Polytechnic University (1-BB9W)en_US
dc.description.pubStatusPublisheden_US
dc.description.oaCategoryVoR alloweden_US
Appears in Collections:Conference Paper
Files in This Item:
File Description SizeFormat 
zhang09c_interspeech.pdf1.26 MBAdobe PDFView/Open
Open Access Information
Status open access
File Version Version of Record
Access
View full-text via PolyU eLinks SFX Query
Show simple item record

Google ScholarTM

Check

Altmetric


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.