Please use this identifier to cite or link to this item: http://hdl.handle.net/10397/19536
Title: Parallel outlier detection on uncertain data for GPUs
Authors: Matsumoto, T
Hung, E
Yiu, ML 
Keywords: GPU
Outlier detection
Parallel processing
Uncertain data
Issue Date: 2015
Publisher: Kluwer Academic Publishers
Source: Distributed and parallel databases, 2015, v. 33, no. 3, p. 417-447 How to cite?
Journal: Distributed and Parallel Databases 
Abstract: Outlier detection, also known as anomaly detection, is a common data mining task in identifying data points that are outside expected patterns in a given dataset. It has useful applications such as network intrusion, system faults, and fraudulent activity. In addition, real world data are uncertain in nature and they may be represented as uncertain data. In this paper, we propose an improved parallel algorithm for outlier detection on uncertain data using density sampling and develop an implementation running on both GPUs and multi-core CPUs, using the OpenCL framework. Our main focus is on GPUs, as they are a cost effective massively parallel floating point processor that is suitable for many data mining applications. Our implementation exploits some key features in GPUs, and is significantly different from a traditional CPU implementation. We first present an improved uncertain outlier detection algorithm. Then, we demonstrate two parallel micro-clustering implementations. The performance and detection quality comparisons demonstrate the benefits of the improved algorithm and parallel implementation on GPUs.
URI: http://hdl.handle.net/10397/19536
ISSN: 0926-8782
DOI: 10.1007/s10619-014-7155-9
Appears in Collections:Journal/Magazine Article

Access
View full-text via PolyU eLinks SFX Query
Show full item record

Page view(s)

37
Last Week
1
Last month
Checked on Nov 13, 2017

Google ScholarTM

Check

Altmetric



Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.