Please use this identifier to cite or link to this item: http://hdl.handle.net/10397/114613
PIRA download icon_1.1View/Download Full Text
DC FieldValueLanguage
dc.contributorDepartment of Electrical and Electronic Engineering-
dc.creatorZhang, SX-
dc.creatorMak, MW-
dc.date.accessioned2025-08-18T03:02:15Z-
dc.date.available2025-08-18T03:02:15Z-
dc.identifier.urihttp://hdl.handle.net/10397/114613-
dc.descriptionInterspeech 2008, Brisbane, Australia, 22-26 September 2008en_US
dc.language.isoenen_US
dc.publisherInternational Speech Communication Associationen_US
dc.rightsCopyright © 2008 ISCAen_US
dc.rightsThe following publication Zhang, S.-X., Mak, M.-W. (2008) High-level speaker verification via articulatory-feature based sequence kernels and SVM. Proc. Interspeech 2008, 1393-1396 is available at https://doi.org/10.21437/Interspeech.2008-404.en_US
dc.titleHigh-level speaker verification via articulatory-feature based sequence kernels and SVMen_US
dc.typeConference Paperen_US
dc.identifier.spage1393-
dc.identifier.epage1396-
dc.identifier.doi10.21437/interspeech.2008-404-
dcterms.abstractArticulatory-feature based pronunciation models (AFCPMs) are capable of capturing the pronunciation variations among different speakers and are good for high-level speaker recognition. However, the likelihood-ratio scoring method of AFPCMs is based on a decision boundary created by training the target speaker model and universal background model (UBM) separately. Therefore, the method does not fully utilize the discriminative information available in the training data. To fully harness the discriminative information, this paper proposes training a support vector machine (SVM) for computing the verification scores. More precisely, the models of target speakers, individual background speakers, and claimants are converted to AF-supervectors, which form the inputs to an AF-based kernel of the SVM for computing verification scores. Results show that the proposed AF-kernel scoring is complementary to likelihood-ratio scoring, leading to better performance when the two scoring methods are combined. Further performance enhancement was also observed when the AF scores were combined with acoustic scores derived from a GMM-UBM system.-
dcterms.accessRightsopen accessen_US
dcterms.bibliographicCitationProceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, 2008, p. 1393-1396-
dcterms.issued2008-
dc.identifier.scopus2-s2.0-84867219243-
dc.description.validate202508 bcch-
dc.description.oaVersion of Recorden_US
dc.identifier.FolderNumberOA_Othersen_US
dc.description.fundingSourceRGCen_US
dc.description.fundingSourceOthersen_US
dc.description.fundingTextHKPolyU Project No. A-PA6Fen_US
dc.description.pubStatusPublisheden_US
dc.description.oaCategoryVoR alloweden_US
Appears in Collections:Conference Paper
Files in This Item:
File Description SizeFormat 
zhang08d_interspeech.pdf518.27 kBAdobe PDFView/Open
Open Access Information
Status open access
File Version Version of Record
Access
View full-text via PolyU eLinks SFX Query
Show simple item record

Google ScholarTM

Check

Altmetric


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.