Please use this identifier to cite or link to this item: http://hdl.handle.net/10397/38090
Title: Mining static code metrics for a robust prediction of software defect-proneness
Authors: Li, L
Leung, H 
Keywords: Data mining
Defect-proneness
Robust prediction
Software quality
Uncertainty
Issue Date: 2011
Source: 2011 International Symposium on Empirical Software Engineering and Measurement (ESEM), 22-23 Sept. 2011, Banff, AB, p. 207-214 How to cite?
Abstract: Defect-proneness prediction is affected by multiple aspects including sampling bias, non-metric factors, uncertainty of models etc. These aspects often contribute to prediction uncertainty and result in variance of prediction. This paper proposes two methods of data mining static code metrics to enhance defect-proneness prediction. Given little non-metric or qualitative information extracted from software codes, we first suggest to use a robust unsupervised learning method, shared nearest neighbors (SNN) to extract the similarity patterns of the code metrics. These patterns indicate similar characteristics of the components of the same cluster that may result in introduction of similar defects. Using the similarity patterns with code metrics as predictors, defect-proneness prediction may be improved. The second method uses the Occam's windows and Bayesian model averaging to deal with model uncertainty: first, the datasets are used to train and cross-validate multiple learners and then highly qualified models are selected and integrated into a robust prediction. From a study based on 12 datasets from NASA, we conclude that our proposed solutions can contribute to a better defect-proneness prediction.
URI: http://hdl.handle.net/10397/38090
ISBN: 978-1-4577-2203-5
DOI: 10.1109/ESEM.2011.29
Appears in Collections:Conference Paper

Access
View full-text via PolyU eLinks SFX Query
Show full item record

Page view(s)

23
Last Week
1
Last month
Checked on Sep 24, 2017

Google ScholarTM

Check

Altmetric



Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.