Please use this identifier to cite or link to this item: http://hdl.handle.net/10397/103704
PIRA download icon_1.1View/Download Full Text
Title: Synchronization clustering based on central force optimization and its extension for large-scale datasets
Authors: Hang, W 
Choi, KS 
Wang, S 
Issue Date: 15-Feb-2017
Source: Knowledge-based systems, 15 Feb. 2017, v. 118, p. 31-44
Abstract: Although research on clustering methods has been active in recent years, not only must most current clustering methods pre-set the number of clusters or other user-specific parameters but they also perform on large-scale datasets inefficiently. In this paper, we study the clustering problem by exploring the metaphor of gravitational kinematics based on Central Force Optimization (CFO). However, different from the global synchronization of CFO, we propose a new algorithm G-Sync by simulating the partial synchronization phenomenon. Specifically, we view each data object as a probe and simulate the dynamic interaction behaviour of data objects in the gravitational field. As time evolves, similar data objects will naturally come into partial synchronization and form distinct clusters measured by the proposed degree of local synchronization, and the dynamic interaction behaviour of the data objects is continually simulated over time. By introducing the Davies–Bouldin (DB) index, G-Sync can determine clusters of arbitrary size, shape and density. Moreover, pre-setting the number of clusters to be found is not required. The algorithm is further extended for handling large-scale datasets with the scalable S-G-Sync algorithm, which is based on fast kernel density estimation (FastKDE). S-G-Sync initially condenses a large-scale dataset quickly into its reduced dataset, followed by adaptive clustering on the reduced dataset using G-Sync. Finally, the Clustering on Remaining Objects (CRO) algorithm is proposed to cluster the remaining objects in the large-scale dataset and to capture outlier and singleton clusters effectively. The effectiveness of the G-Sync and S-G-Sync algorithms is theoretically analysed and experimentally verified on synthetic and real-world datasets.
Keywords: Central force optimization
Fast kernel density estimation
Gravitational kinematics
Large-scale datasets
Partial synchronization
Synchronization clustering
Publisher: Elsevier
Journal: Knowledge-based systems 
ISSN: 0950-7051
DOI: 10.1016/j.knosys.2016.11.007
Rights: © 2016 Published by Elsevier B.V.
© 2016. This manuscript version is made available under the CC-BY-NC-ND 4.0 license https://creativecommons.org/licenses/by-nc-nd/4.0/.
The following publication Hang, W., Choi, K. S., & Wang, S. (2017). Synchronization clustering based on central force optimization and its extension for large-scale datasets. Knowledge-Based Systems, 118, 31-44 is available at https://doi.org/10.1016/j.knosys.2016.11.007.
Appears in Collections:Journal/Magazine Article

Files in This Item:
File Description SizeFormat 
Choi_Synchronization_Clustering_Central.pdfPre-Published version3.05 MBAdobe PDFView/Open
Open Access Information
Status open access
File Version Final Accepted Manuscript
Access
View full-text via PolyU eLinks SFX Query
Show full item record

Page views

104
Last Week
2
Last month
Citations as of Dec 21, 2025

Downloads

87
Citations as of Dec 21, 2025

SCOPUSTM   
Citations

16
Citations as of Dec 19, 2025

WEB OF SCIENCETM
Citations

15
Citations as of Dec 18, 2025

Google ScholarTM

Check

Altmetric


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.