Please use this identifier to cite or link to this item: http://hdl.handle.net/10397/76354
Title: Understanding the value of considering client usage context in package cohesion for fault-proneness prediction
Authors: Zhao, YY
Yang, YB
Lu, HM
Liu, JP
Leung, H 
Wu, YS
Zhou, YM
Xu, BW
Keywords: Context
Cohesion
Metrics
Package
Fault-proneness
Prediction
Issue Date: 2017
Publisher: Springer
Source: Automated software engineering, 2017, v. 24, no. 2, p. 393-453 How to cite?
Journal: Automated software engineering 
Abstract: By far, many package cohesion metrics have been proposed from internal structure view and external usage view. Based on whether client usage context (i.e., the way packages are used by their clients) is exploited, we group these metrics into two categories: non-context-based and context-based. Currently, there is no comprehensive empirical research devoted to understanding the actual value of client usage context for fault-proneness prediction. In this study, we conduct a thorough empirical study to investigate the value of considering client usage context in package cohesion for fault-proneness prediction. First, we use principal component analysis to examine the relationships between context-based and non-context-based cohesion metrics. Second, we employ univariate logistic regression analysis to investigate the correlation between context-based cohesion metrics and fault-proneness. Then, we build multivariate prediction models to analyze the ability of context-based cohesion metrics for fault-proneness prediction when used alone or used together with non-context-based cohesion metrics. To obtain comprehensive evaluations, we evaluate the effectiveness of these multivariate models in the ranking and classification scenarios from both cross-validation and across-version perspectives. The experimental results show that (1) context-based cohesion metrics are complementary to non-context-based cohesion metrics; (2) most of context-based cohesion metrics have a significantly negative association with fault-proneness; (3) when used alone or used together with non-context-based cohesion metrics, context-based cohesion metrics can substantially improve the effectiveness of fault-proneness prediction in most studied systems under both cross-validation and across-version evaluation. Client usage context has an important value in package cohesion for fault-proneness prediction.
URI: http://hdl.handle.net/10397/76354
ISSN: 0928-8910
EISSN: 1573-7535
DOI: 10.1007/s10515-016-0198-6
Appears in Collections:Journal/Magazine Article

Access
View full-text via PolyU eLinks SFX Query
Show full item record

SCOPUSTM   
Citations

2
Citations as of May 12, 2018

WEB OF SCIENCETM
Citations

1
Last Week
0
Last month
Citations as of May 20, 2018

Page view(s)

3
Citations as of May 21, 2018

Google ScholarTM

Check

Altmetric


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.