Please use this identifier to cite or link to this item: http://hdl.handle.net/10397/100743
PIRA download icon_1.1View/Download Full Text
Title: Mining significant crisp-fuzzy spatial association rules
Authors: Shi, W 
Zhang, A 
Webb, GI
Issue Date: 2018
Source: International journal of geographical information science, 2018, v. 32, no. 6, p. 1247-1270
Abstract: Spatial association rule mining (SARM) is an important data mining task for understanding implicit and sophisticated interactions in spatial data. The usefulness of SARM results, represented as sets of rules, depends on their reliability: the abundance of rules, control over the risk of spurious rules, and accuracy of rule interestingness measure (RIM) values. This study presents crisp-fuzzy SARM, a novel SARM method that can enhance the reliability of resultant rules. The method firstly prunes dubious rules using statistically sound tests and crisp supports for the patterns involved, and then evaluates RIMs of accepted rules using fuzzy supports. For the RIM evaluation stage, the study also proposes a Gaussian-curve-based fuzzy data discretization model for SARM with improved design for spatial semantics. The proposed techniques were evaluated by both synthetic and real-world data. The synthetic data was generated with predesigned rules and RIM values, thus the reliability of SARM results could be confidently and quantitatively evaluated. The proposed techniques showed high efficacy in enhancing the reliability of SARM results in all three aspects. The abundance of resultant rules was improved by 50% or more compared with using conventional fuzzy SARM. Minimal risk of spurious rules was guaranteed by statistically sound tests. The probability that the entire result contained any spurious rules was below 1%. The RIM values also avoided large positive errors committed by crisp SARM, which typically exceeded 50% for representative RIMs. The real-world case study on New York City points of interest reconfirms the improved reliability of crisp-fuzzy SARM results, and demonstrates that such improvement is critical for practical spatial data analytics and decision support.
Keywords: Fuzzy sets and logic
Quality issues
Spatial association rules
Spatial data mining
Statistical evaluation
Publisher: Taylor & Francis
Journal: International journal of geographical information science 
ISSN: 1365-8816
EISSN: 1362-3087
DOI: 10.1080/13658816.2018.1434525
Rights: © 2018 Informa UK Limited, trading as Taylor & Francis Group
This is an Accepted Manuscript of an article published by Taylor & Francis in International Journal of Geographical Information Science on 08 Feb 2018 (published online), available at: http://www.tandfonline.com/10.1080/13658816.2018.1434525.
Appears in Collections:Journal/Magazine Article

Files in This Item:
File Description SizeFormat 
Shi_Mining_Significant_Crisp-Fuzzy.pdfPre-Published version1.16 MBAdobe PDFView/Open
Open Access Information
Status open access
File Version Final Accepted Manuscript
Access
View full-text via PolyU eLinks SFX Query
Show full item record

Page views

64
Citations as of Apr 14, 2025

Downloads

53
Citations as of Apr 14, 2025

SCOPUSTM   
Citations

11
Citations as of Sep 12, 2025

WEB OF SCIENCETM
Citations

7
Citations as of Oct 10, 2024

Google ScholarTM

Check

Altmetric


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.