Please use this identifier to cite or link to this item: http://hdl.handle.net/10397/53653
PIRA download icon_1.1View/Download Full Text
DC FieldValueLanguage
dc.contributorDepartment of Electronic and Information Engineering-
dc.creatorGanganath, N-
dc.creatorCheng, CT-
dc.creatorTse, CK-
dc.date.accessioned2016-06-27T02:08:56Z-
dc.date.available2016-06-27T02:08:56Z-
dc.identifier.isbn978-1-4799-6235-8-
dc.identifier.urihttp://hdl.handle.net/10397/53653-
dc.language.isoenen_US
dc.publisherInstitute of Electrical and Electronics Engineersen_US
dc.rights© 2014 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works.en_US
dc.rightsThe following publication Ganganath, N., Cheng, C. -., & Tse, C. K. (2014). Data clustering with cluster size constraints using a modified k-means algorithm. Paper presented at the Proceedings - 2014 International Conference on Cyber-Enabled Distributed Computing and Knowledge Discovery, CyberC 2014, 158-161 is available at 10.1109/CyberC.2014.36en_US
dc.subjectConstrained clusteringen_US
dc.subjectData clusteringen_US
dc.subjectData miningen_US
dc.subjectK-meansen_US
dc.subjectSize constraintsen_US
dc.titleData clustering with cluster size constraints using a modified k-means algorithmen_US
dc.typeConference Paperen_US
dc.identifier.spage158-
dc.identifier.epage161-
dc.identifier.doi10.1109/CyberC.2014.36-
dcterms.abstractData clustering is a frequently used technique in finance, computer science, and engineering. In most of the applications, cluster sizes are either constrained to particular values or available as prior knowledge. Unfortunately, traditional clustering methods cannot impose constrains on cluster sizes. In this paper, we propose some vital modifications to the standard k-means algorithm such that it can incorporate size constraints for each cluster separately. The modified k-means algorithm can be used to obtain clusters in preferred sizes. A potential application would be obtaining clusters with equal cluster size. Moreover, the modified algorithm makes use of prior knowledge of the given data set for selectively initializing the cluster centroids which helps escaping from local minima. Simulation results on multidimensional data demonstrate that the k-means algorithm with the proposed modifications can fulfill cluster size constraints and lead to more accurate and robust results.-
dcterms.accessRightsopen accessen_US
dcterms.bibliographicCitation2014 International Conference on Cyber-Enabled Distributed Computing and Knowledge Discovery (CyberC), 13-15 Oct. 2014, Shanghai, China, p. 158-161-
dcterms.issued2014-
dc.identifier.scopus2-s2.0-84921044566-
dc.relation.conferenceInternational Conference on Cyber-Enabled Distributed Computing and Knowledge Discovery [CyberC]-
dc.identifier.rosgroupid2014001366-
dc.description.ros2014-2015 > Academic research: refereed > Refereed conference paper-
dc.description.oaAccepted Manuscripten_US
dc.identifier.FolderNumbera0020-n10en_US
dc.description.pubStatusPublisheden_US
Appears in Collections:Conference Paper
Files in This Item:
File Description SizeFormat 
Ganganath_Data_Clustering_K-means.pdfPre-published version405.55 kBAdobe PDFView/Open
Open Access Information
Status open access
File Version Final Accepted Manuscript
Access
View full-text via PolyU eLinks SFX Query
Show simple item record

Page views

379
Last Week
0
Last month
Citations as of Apr 21, 2024

Downloads

1,253
Citations as of Apr 21, 2024

SCOPUSTM   
Citations

59
Last Week
0
Last month
Citations as of Apr 26, 2024

WEB OF SCIENCETM
Citations

41
Last Week
0
Last month
Citations as of Apr 25, 2024

Google ScholarTM

Check

Altmetric


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.