A comparative analysis of the efficiency of change metrics and static code attributes for defect prediction
| Venue: | in Proceedings of ICSE 2008 |
| Citations: | 16 - 0 self |
BibTeX
@INPROCEEDINGS{Moser_acomparative,
author = {Raimund Moser and Witold Pedrycz and Giancarlo Succi},
title = {A comparative analysis of the efficiency of change metrics and static code attributes for defect prediction},
booktitle = {in Proceedings of ICSE 2008},
year = {},
pages = {181--190}
}
OpenURL
Abstract
In this paper we present a comparative analysis of the predictive power of two different sets of metrics for defect prediction. We choose one set of product related and one set of process related software metrics and use them for classifying Java files of the Eclipse project as defective respective defect-free. Classification models are built using three common machine learners: logistic regression, Naïve Bayes, and decision trees. To allow different costs for prediction errors we perform cost-sensitive classification, which proves to be very successful:>75% percentage of correctly classified files, a recall of>80%, and a false positive rate <30%. Results indicate that for the Eclipse data, process metrics are more efficient defect predictors than code metrics. Categories and Subject Descriptors D.2.8 [Metrics]: Process metrics and product metrics. D.2.9 [Management]: Software quality assurance.







