Please use this identifier to cite or link to this item: http://hdl.handle.net/10397/105479
PIRA download icon_1.1View/Download Full Text
DC FieldValueLanguage
dc.contributorDepartment of Computing-
dc.creatorYang, R-
dc.creatorShi, J-
dc.creatorYang, Y-
dc.creatorHuang, K-
dc.creatorZhang, S-
dc.creatorXiao, X-
dc.date.accessioned2024-04-15T07:34:37Z-
dc.date.available2024-04-15T07:34:37Z-
dc.identifier.isbn978-1-4503-8312-7-
dc.identifier.urihttp://hdl.handle.net/10397/105479-
dc.descriptionWWW '21: The Web Conference 2021, Ljubljana, Slovenia, April 19-23, 2021en_US
dc.language.isoenen_US
dc.publisherAssociation for Computing Machineryen_US
dc.rightsThis paper is published under the Creative Commons Attribution 4.0 International (CC-BY 4.0) license (https://creativecommons.org/licenses/by/4.0/). Authors reserve their rights to disseminate the work on their personal and corporate Web sites with the appropriate attribution.en_US
dc.rights© 2021 IW3C2 (International World Wide Web Conference Committee), publishedunder Creative Commons CC-BY 4.0 License.en_US
dc.rightsThe following publication Renchi Yang, Jieming Shi, Yin Yang, Keke Huang, Shiqi Zhang, and Xiaokui Xiao. 2021. Effective and Scalable Clustering on Massive Attributed Graphs. In Proceedings of the Web Conference 2021 (WWW ’21), April 19–23, 2021, Ljubljana, Slovenia. ACM, New York, NY, USA, 13 pages is available at https://doi.org/10.1145/3442381.3449875.en_US
dc.subjectAttributed graphen_US
dc.subjectGraph clusteringen_US
dc.subjectRandom walken_US
dc.titleEffective and scalable clustering on massive attributed graphsen_US
dc.typeConference Paperen_US
dc.identifier.spage3675-
dc.identifier.epage3687-
dc.identifier.doi10.1145/3442381.3449875-
dcterms.abstractGiven a graph G where each node is associated with a set of attributes, and a parameter k specifying the number of output clusters, k-attributed graph clustering (k-AGC) groups nodes in G into k disjoint clusters, such that nodes within the same cluster share similar topological and attribute characteristics, while those in different clusters are dissimilar. This problem is challenging on massive graphs, e.g., with millions of nodes and billions of attribute values. For such graphs, existing solutions either incur prohibitively high costs, or produce clustering results with compromised quality. In this paper, we propose , an efficient approach to k-AGC that yields high-quality clusters with costs linear to the size of the input graph G. The main contributions of are twofold: (i) a novel formulation of the k-AGC problem based on an attributed multi-hop conductance quality measure custom-made for this problem setting, which effectively captures cluster coherence in terms of both topological proximities and attribute similarities, and (ii) a linear-time optimization solver that obtains high quality clusters iteratively, based on efficient matrix operations such as orthogonal iterations, an alternative optimization approach, as well as an initialization technique that significantly speeds up the convergence of in practice. Extensive experiments, comparing 11 competitors on 6 real datasets, demonstrate that consistently outperforms all competitors in terms of result quality measured against ground truth labels, while being up to orders of magnitude faster. In particular, on the Microsoft Academic Knowledge Graph dataset with 265.2 million edges and 1.1 billion attribute values, outputs high-quality results for 5-AGC within 1.68 hours using a single CPU core, while none of the 11 competitors finish within 3 days.-
dcterms.accessRightsopen accessen_US
dcterms.bibliographicCitationIn WWW '21 : Proceedings of the Web Conference 2021, p. 3675-3687. New York, NY : Association for Computing Machinery; 2021-
dcterms.issued2021-
dc.relation.ispartofbookWWW '21 : Proceedings of the Web Conference 2021-
dc.relation.conferenceWorld Wide Web Conference [WWW]-
dc.description.validate202402 bcch-
dc.description.oaVersion of Recorden_US
dc.identifier.FolderNumberCOMP-0127en_US
dc.description.fundingSourceOthersen_US
dc.description.fundingTextHong Kong Polytechnic Universityen_US
dc.description.pubStatusPublisheden_US
dc.identifier.OPUS50639862en_US
dc.description.oaCategoryCCen_US
Appears in Collections:Conference Paper
Files in This Item:
File Description SizeFormat 
3442381.3449875.pdf1.2 MBAdobe PDFView/Open
Open Access Information
Status open access
File Version Version of Record
Access
View full-text via PolyU eLinks SFX Query
Show simple item record

Page views

54
Citations as of Apr 14, 2025

Downloads

36
Citations as of Apr 14, 2025

SCOPUSTM   
Citations

3
Citations as of Jun 21, 2024

WEB OF SCIENCETM
Citations

6
Citations as of Nov 14, 2024

Google ScholarTM

Check

Altmetric


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.