Fine-grained just-in-time defect prediction

Luca Pascarella; Fabio Palomba; Alberto Bacchelli

doi:10.1016/j.jss.2018.12.001

Fine-grained just-in-time defect prediction

Luca Pascarella, Fabio Palomba^*, Alberto Bacchelli

^*Corresponding author for this work

Software Engineering

Research output: Contribution to journal › Article › Scientific › peer-review

92 Citations (Scopus)

113 Downloads (Pure)

Abstract

Defect prediction models focus on identifying defect-prone code elements, for example to allow practitioners to allocate testing resources on specific subsystems and to provide assistance during code reviews. While the research community has been highly active in proposing metrics and methods to predict defects on long-term periods (i.e.,at release time), a recent trend is represented by the so-called short-term defect prediction (i.e.,at commit-level). Indeed, this strategy represents an effective alternative in terms of effort required to inspect files likely affected by defects. Nevertheless, the granularity considered by such models might be still too coarse. Indeed, existing commit-level models highlight an entire commit as defective even in cases where only specific files actually contain defects. In this paper, we first investigate to what extent commits are partially defective; then, we propose a novel fine-grained just-in-time defect prediction model to predict the specific files, contained in a commit, that are defective. Finally, we evaluate our model in terms of (i) performance and (ii) the extent to which it decreases the effort required to diagnose a defect. Our study highlights that: (1) defective commits are frequently composed of a mixture of defective and non-defective files, (2) our fine-grained model can accurately predict defective files with an AUC-ROC up to 82% and (3) our model would allow practitioners to save inspection efforts with respect to standard just-in-time techniques.

Original language	English
Pages (from-to)	22-36
Number of pages	15
Journal	Journal of Systems and Software
Volume	150
DOIs	https://doi.org/10.1016/j.jss.2018.12.001
Publication status	Published - 2019

Keywords

Empirical Software Engineering
Just-in-time defect prediction
Mining software repositories

Access to Document

10.1016/j.jss.2018.12.001

50558653 - Fine-Grained Just-In-Time Defect PredictionAccepted author manuscript, 489 KB

Cite this

@article{40c6d9d95d3e48f9ba7bdea21d6af1ce,

title = "Fine-grained just-in-time defect prediction",

abstract = "Defect prediction models focus on identifying defect-prone code elements, for example to allow practitioners to allocate testing resources on specific subsystems and to provide assistance during code reviews. While the research community has been highly active in proposing metrics and methods to predict defects on long-term periods (i.e.,at release time), a recent trend is represented by the so-called short-term defect prediction (i.e.,at commit-level). Indeed, this strategy represents an effective alternative in terms of effort required to inspect files likely affected by defects. Nevertheless, the granularity considered by such models might be still too coarse. Indeed, existing commit-level models highlight an entire commit as defective even in cases where only specific files actually contain defects. In this paper, we first investigate to what extent commits are partially defective; then, we propose a novel fine-grained just-in-time defect prediction model to predict the specific files, contained in a commit, that are defective. Finally, we evaluate our model in terms of (i) performance and (ii) the extent to which it decreases the effort required to diagnose a defect. Our study highlights that: (1) defective commits are frequently composed of a mixture of defective and non-defective files, (2) our fine-grained model can accurately predict defective files with an AUC-ROC up to 82% and (3) our model would allow practitioners to save inspection efforts with respect to standard just-in-time techniques.",

keywords = "Empirical Software Engineering, Just-in-time defect prediction, Mining software repositories",

author = "Luca Pascarella and Fabio Palomba and Alberto Bacchelli",

year = "2019",

doi = "10.1016/j.jss.2018.12.001",

language = "English",

volume = "150",

pages = "22--36",

journal = "Journal of Systems and Software",

issn = "0164-1212",

publisher = "Elsevier",

}

TY - JOUR

T1 - Fine-grained just-in-time defect prediction

AU - Pascarella, Luca

AU - Palomba, Fabio

AU - Bacchelli, Alberto

PY - 2019

Y1 - 2019

N2 - Defect prediction models focus on identifying defect-prone code elements, for example to allow practitioners to allocate testing resources on specific subsystems and to provide assistance during code reviews. While the research community has been highly active in proposing metrics and methods to predict defects on long-term periods (i.e.,at release time), a recent trend is represented by the so-called short-term defect prediction (i.e.,at commit-level). Indeed, this strategy represents an effective alternative in terms of effort required to inspect files likely affected by defects. Nevertheless, the granularity considered by such models might be still too coarse. Indeed, existing commit-level models highlight an entire commit as defective even in cases where only specific files actually contain defects. In this paper, we first investigate to what extent commits are partially defective; then, we propose a novel fine-grained just-in-time defect prediction model to predict the specific files, contained in a commit, that are defective. Finally, we evaluate our model in terms of (i) performance and (ii) the extent to which it decreases the effort required to diagnose a defect. Our study highlights that: (1) defective commits are frequently composed of a mixture of defective and non-defective files, (2) our fine-grained model can accurately predict defective files with an AUC-ROC up to 82% and (3) our model would allow practitioners to save inspection efforts with respect to standard just-in-time techniques.

AB - Defect prediction models focus on identifying defect-prone code elements, for example to allow practitioners to allocate testing resources on specific subsystems and to provide assistance during code reviews. While the research community has been highly active in proposing metrics and methods to predict defects on long-term periods (i.e.,at release time), a recent trend is represented by the so-called short-term defect prediction (i.e.,at commit-level). Indeed, this strategy represents an effective alternative in terms of effort required to inspect files likely affected by defects. Nevertheless, the granularity considered by such models might be still too coarse. Indeed, existing commit-level models highlight an entire commit as defective even in cases where only specific files actually contain defects. In this paper, we first investigate to what extent commits are partially defective; then, we propose a novel fine-grained just-in-time defect prediction model to predict the specific files, contained in a commit, that are defective. Finally, we evaluate our model in terms of (i) performance and (ii) the extent to which it decreases the effort required to diagnose a defect. Our study highlights that: (1) defective commits are frequently composed of a mixture of defective and non-defective files, (2) our fine-grained model can accurately predict defective files with an AUC-ROC up to 82% and (3) our model would allow practitioners to save inspection efforts with respect to standard just-in-time techniques.

KW - Empirical Software Engineering

KW - Just-in-time defect prediction

KW - Mining software repositories

UR - http://www.scopus.com/inward/record.url?scp=85059617149&partnerID=8YFLogxK

U2 - 10.1016/j.jss.2018.12.001

DO - 10.1016/j.jss.2018.12.001

M3 - Article

AN - SCOPUS:85059617149

SN - 0164-1212

VL - 150

SP - 22

EP - 36

JO - Journal of Systems and Software

JF - Journal of Systems and Software

ER -

Fine-grained just-in-time defect prediction

Abstract

Keywords

Access to Document

Other files and links

Fingerprint

Cite this