Please use this identifier to cite or link to this item: http://hdl.handle.net/10397/88461
PIRA download icon_1.1View/Download Full Text
DC FieldValueLanguage
dc.contributorDepartment of Electronic and Information Engineeringen_US
dc.creatorJiang, Yen_US
dc.creatorLeung, HFen_US
dc.date.accessioned2020-11-26T03:20:55Z-
dc.date.available2020-11-26T03:20:55Z-
dc.identifier.urihttp://hdl.handle.net/10397/88461-
dc.language.isoenen_US
dc.publisherInstitute of Electrical and Electronics Engineersen_US
dc.rights© 2018 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works.en_US
dc.rightsThe following publication Y. Jiang and H. F. Frank Leung, "Gaussian Mixture Model and Gaussian Supervector for Image Classification," 2018 IEEE 23rd International Conference on Digital Signal Processing (DSP), Shanghai, China, 2018, pp. 1-5 is available at https://dx.doi.org/10.1109/ICDSP.2018.8631558en_US
dc.subjectGaussian mixture modelen_US
dc.subjectEqual-variance Gaussian mixture modelen_US
dc.subjectGaussian supervectoren_US
dc.subjectImage classificationen_US
dc.titleGaussian mixture model and Gaussian supervector for image classificationen_US
dc.typeConference Paperen_US
dc.identifier.spage1en_US
dc.identifier.epage5en_US
dc.identifier.doi10.1109/ICDSP.2018.8631558en_US
dcterms.abstractGaussian Mixture Model (GMM) has been widely used in speech signal and image signal classification tasks. It can be directly used as a classifier, or used as the representation of speech or image signals. Another important usage of GMM is to serve as the Universal Background Model (UBM) to generate speech representations such as Gaussian Supervector (GSV) and i-vector. In this paper, we borrow GSV from speech signal classification studies and apply it as an image representation for image classification. GSV is calculated based on a Universal Background Model (UBM). Apart from employing the conventional GMM as the UBM to calculate GSV, we also propose the Equal-Variance GMM (EV-GMM), where all the variables in all the Gaussian mixture components share the same variance. Moreover, we derive the kernel version of EV-GMM, which generalizes EV-GMM by introducing a kernel. We then compare GSV to the raw image feature and other popular image representations such as Sparse Representation (SR) and Collaborative Representation (CR). Experiments are carried out on a handwritten digit recognition task, and classification results indicate that GSV can work very well and can be even better than other popular image representations. In addition, as the UBM, the proposed EV-GMM can work better than the conventional GMM.en_US
dcterms.accessRightsopen accessen_US
dcterms.bibliographicCitation2018 IEEE 23rd International Conference on Digital Signal Processing (DSP), Shanghai, China, China, 19-21 Nov. 2018, p. 1-5en_US
dcterms.issued2018-11-
dc.relation.conferenceIEEE International Conference on Digital Signal Processing [DSP]en_US
dc.description.validate202011 bcrcen_US
dc.description.oaAccepted Manuscripten_US
dc.identifier.FolderNumbera0512-n04en_US
dc.description.pubStatusPublisheden_US
Appears in Collections:Conference Paper
Files in This Item:
File Description SizeFormat 
Jiang_Gaussian_Mixture_Model.pdfPre-Published version970.58 kBAdobe PDFView/Open
Open Access Information
Status open access
File Version Final Accepted Manuscript
Access
View full-text via PolyU eLinks SFX Query
Show simple item record

Page views

81
Last Week
0
Last month
Citations as of Apr 14, 2024

Downloads

16
Citations as of Apr 14, 2024

SCOPUSTM   
Citations

5
Citations as of Apr 19, 2024

Google ScholarTM

Check

Altmetric


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.