Please use this identifier to cite or link to this item: http://hdl.handle.net/10397/11057
Title: Distributed mining of classification rules
Authors: Cho, V 
Wüthrich, B
Keywords: Confidence
Consensus
Distributed mining
Rules
Support
Issue Date: 2002
Publisher: Springer
Source: Knowledge and information systems, 2002, v. 4, no. 1, p. 1-30 How to cite?
Journal: Knowledge and information systems 
Abstract: Many successful data-mining techniques and systems have been developed. These techniques usually apply to centralized databases with less restricted requirements on learning and response time. Not so much effort has yet been put into mining distributed databases and real-time issues. In this paper, we investigate issues of fast-distributed data mining. We assume that merging the distributed databases into a single one would either be too costly (distributed case) or the individual fragments would be non-uniform so that mining only one fragment would bias the result (fragmented case). The goal is to classify the objects O of the database into one of several mutually exclusive classes C i . Our approach to make mining fast and feasible is as follows. From each data site or fragment db k , only a single rule r ik is generated for each class C i . A small subset {r i1 , …, r ih } of these individual rules is selected to form a rule set R i for each class C i . These rule subsets represent adequately the hidden knowledge of the entire database. Various selection criteria to form R i are discussed, both theoretically and experimentally.
URI: http://hdl.handle.net/10397/11057
ISSN: 0219-1377
EISSN: 0219-3116
DOI: 10.1007/s10115-002-8191-8
Appears in Collections:Journal/Magazine Article

Access
View full-text via PolyU eLinks SFX Query
Show full item record

Page view(s)

47
Last Week
1
Last month
Checked on Oct 16, 2017

Google ScholarTM

Check

Altmetric



Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.