Please use this identifier to cite or link to this item:
Title: Accelerating outlier detection with uncertain data using graphics processors
Authors: Matsumoto, T
Hung, E
Issue Date: 2012
Source: Lecture notes in computer science (including subseries Lecture notes in artificial intelligence and lecture notes in bioinformatics), 2012, v. 7302, p. 160-180
Abstract: Outlier detection (also known as anomaly detection) is a common data mining task in which data points that lie outside expected patterns in a given dataset are identified. This is useful in areas such as fault detection, intrusion detection and in pre-processing before further analysis. There are many approaches already in use for outlier detection, typically adapting other existing data mining techniques such as cluster analysis, neural networks and classification methods such as Support Vector Machines. However, in many cases data from sources such as sensor networks can be better represented with an uncertain model. Detecting outliers with uncertain data involves far more computation as each data object is usually represented by a number of probability density functions (pdfs).
In this paper, we demonstrate an implementation of outlier detection with uncertain objects based on an existing density sampling method that we have parallelized using the cross-platform OpenCL framework. While the density sampling method is a well understood and relatively straightforward outlier detection technique, its application to uncertain data results in a much higher computational workload. Our optimized implementation uses an inexpensive GPU (Graphics Processing Unit) to greatly reduce the running time. This improvement in performance may be leveraged when attempting to detect outliers with uncertain data in time sensitive situations such as when responding to sensor failure or network intrusion.
Publisher: Springer
Journal: Lecture notes in computer science (including subseries Lecture notes in artificial intelligence and lecture notes in bioinformatics) 
ISBN: 978-3-642-30219-0
ISSN: 0302-9743
EISSN: 1611-3349
DOI: 10.1007/978-3-642-30220-6_15
Description: The 16th Pacific-Asia Conference on Knowledge Discovery and Data Mining (PAKDD-2012), Kuala Lumpur, Malaysia, May 29 - June 1, 2012
Appears in Collections:Conference Paper

View full-text via PolyU eLinks SFX Query
Show full item record


Last Week
Last month
Citations as of Aug 19, 2020

Page view(s)

Last Week
Last month
Citations as of Sep 20, 2020

Google ScholarTM



Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.