Please use this identifier to cite or link to this item: http://hdl.handle.net/10397/3788
Title: Cleaning and querying large uncertain databases
Authors: Chen, Jinchuan
Keywords: Hong Kong Polytechnic University -- Dissertations
Database management
Data mining
Uncertainty -- Mathematical models
Issue Date: 2009
Publisher: The Hong Kong Polytechnic University
Abstract: The management of uncertain databases has recently attracted tremendous interest from both industry and academy communities. In particular, there is a need to handle uncertain data in many emerging applications, such as the wireless sensor network, biometric and biological databases, location-based services, and data stream applications. To obtain meaningful results over these uncertain data, probabilistic queries are proposed, which augment query results with confidence. Although probabilistic queries are useful, evaluating them is costly, in terms of both I/O and computation. Moreover, the calculation of answer probabilities involves expensive numerical integrations. Therefore the efficient evaluation of probabilistic queries is a challenge for uncertain database management. In this thesis, we report our works for speeding up the evaluation performance of three kinds of important probabilistic queries - nearest-neighbor queries, k-nearest-neighbor queries, and imprecise location-dependent queries. New approaches are proposed to improve the efficiency in both I/O and computation, and they are evaluated by extensive simulations over real and synthetic data sets. Another important issue that we consider in this thesis is the cleaning of uncertain data with the goal of achieving higher quality. Since the applications handling imprecise data have resource limitation, the cleaning process must optimize the use of resources. We study theoretically and experimentally on how the result quality could be maximized with constrained resources, with the use of entropy-based metrics. We also outline the future directions of our work.
Description: xv, 184 p. : ill. ; 30 cm.
PolyU Library Call No.: [THS] LG51 .H577P COMP 2009 Chen
URI: http://hdl.handle.net/10397/3788
Rights: All rights reserved.
Appears in Collections:Thesis

Files in This Item:
File Description SizeFormat 
b2306173x_link.htmFor PolyU Users 162 BHTMLView/Open
b2306173x_ir.pdfFor All Users (Non-printable) 2.04 MBAdobe PDFView/Open
Show full item record

Page view(s)

445
Last Week
6
Last month
Checked on Jun 18, 2017

Download(s)

386
Checked on Jun 18, 2017

Google ScholarTM

Check



Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.