Please use this identifier to cite or link to this item: http://hdl.handle.net/10397/100773
PIRA download icon_1.1View/Download Full Text
DC FieldValueLanguage
dc.contributorDepartment of Land Surveying and Geo-Informatics-
dc.creatorZhang, Aen_US
dc.creatorShi, Wen_US
dc.creatorWebb, GIen_US
dc.date.accessioned2023-08-11T03:13:21Z-
dc.date.available2023-08-11T03:13:21Z-
dc.identifier.issn1384-5810en_US
dc.identifier.urihttp://hdl.handle.net/10397/100773-
dc.language.isoenen_US
dc.publisherSpringeren_US
dc.rights© The Author(s) 2016en_US
dc.rightsThis version of the article has been accepted for publication, after peer review (when applicable) and is subject to Springer Nature’s AM terms of use(https://www.springernature.com/gp/open-research/policies/accepted-manuscript-terms), but is not the Version of Record and does not reflect post-acceptance improvements, or any corrections. The Version of Record is available online at: http://dx.doi.org/10.1007/s10618-015-0446-6.en_US
dc.subjectAssociation rulesen_US
dc.subjectPattern discoveryen_US
dc.subjectStatistical evaluationen_US
dc.subjectUncertain dataen_US
dc.titleMining significant association rules from uncertain dataen_US
dc.typeJournal/Magazine Articleen_US
dc.identifier.spage928en_US
dc.identifier.epage963en_US
dc.identifier.volume30en_US
dc.identifier.issue4en_US
dc.identifier.doi10.1007/s10618-015-0446-6en_US
dcterms.abstractIn association rule mining, the trade-off between avoiding harmful spurious rules and preserving authentic ones is an ever critical barrier to obtaining reliable and useful results. The statistically sound technique for evaluating statistical significance of association rules is superior in preventing spurious rules, yet can also cause severe loss of true rules in presence of data error. This study presents a new and improved method for statistical test on association rules with uncertain erroneous data. An original mathematical model was established to describe data error propagation through computational procedures of the statistical test. Based on the error model, a scheme combining analytic and simulative processes was designed to correct the statistical test for distortions caused by data error. Experiments on both synthetic and real-world data show that the method significantly recovers the loss in true rules (reduces type-2 error) due to data error occurring in original statistically sound method. Meanwhile, the new method maintains effective control over the familywise error rate, which is the distinctive advantage of the original statistically sound technique. Furthermore, the method is robust against inaccurate data error probability information and situations not fulfilling the commonly accepted assumption on independent error probabilities of different data items. The method is particularly effective for rules which were most practically meaningful yet sensitive to data error. The method proves promising in enhancing values of association rule mining results and helping users make correct decisions.-
dcterms.accessRightsopen accessen_US
dcterms.bibliographicCitationData mining and knowledge discovery, July 2016, v. 30, no. 4, p. 928-963en_US
dcterms.isPartOfData mining and knowledge discoveryen_US
dcterms.issued2016-07-
dc.identifier.scopus2-s2.0-84954314823-
dc.description.validate202305 bckw-
dc.description.oaAccepted Manuscripten_US
dc.identifier.FolderNumberLSGI-0439-
dc.description.fundingSourceOthersen_US
dc.description.fundingTextNational Administration of Surveying, Mapping and Geoinformation, P.R. Chinaen_US
dc.description.pubStatusPublisheden_US
dc.identifier.OPUS6607758-
dc.description.oaCategoryGreen (AAM)en_US
Appears in Collections:Journal/Magazine Article
Files in This Item:
File Description SizeFormat 
Zhang_Mining_Significant_Association.pdfPre-Published version1.3 MBAdobe PDFView/Open
Open Access Information
Status open access
File Version Final Accepted Manuscript
Access
View full-text via PolyU eLinks SFX Query
Show simple item record

Page views

56
Citations as of Apr 14, 2025

Downloads

47
Citations as of Apr 14, 2025

SCOPUSTM   
Citations

14
Citations as of Dec 19, 2025

WEB OF SCIENCETM
Citations

7
Citations as of Oct 10, 2024

Google ScholarTM

Check

Altmetric


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.