A Mixed Methods Approach to Mining Code Review Data: Examples and a Study of Multicommit Reviews and Pull Requests

Peter C. Rigby*, Alberto Bacchelli, Georgios Gousios, Murtuza Mukadam

*Corresponding author for this work

Research output: Chapter in Book/Conference proceedings/Edited volumeChapterScientific

7 Citations (Scopus)

Abstract

Software code review has been considered an important quality assurance mechanism for the last 35 years. The techniques for conducting modern code reviews have evolved along with the software industry, and have become progressively incremental and lightweight. We have studied code review in a number of contemporary settings, including Apache, Linux, KDE, Microsoft, Android, and GitHub. Code review is an inherently social activity, so we have used both quantitative and qualitative methods to understand the underlying parameters (or measures) of the process, as well as the rich interactions and motivations for doing code review. In this chapter, we describe how we have used a mixed methods approach to triangulate our findings on code review. We also describe how we use quantitative data to help us sample the most interesting cases from our data to be analyzed qualitatively. To illustrate code review research, we provide new results that contrast single-commit and multicommit reviews. We find that while multicommit reviews take longer and have more lines churned than single-commit reviews, the same number of people are involved in both types of review. To enrich and triangulate our findings, we qualitatively analyze the characteristics of multicommit reviews, and find that there are two types: reviews of branches and revisions of single commits. We also examine the reasons why commits on GitHub pull requests are rejected.

Original languageEnglish
Title of host publicationThe Art and Science of Analyzing Software Data
Place of PublicationWaltham
PublisherElsevier
Pages231-255
Number of pages25
ISBN (Electronic)9780124115439
ISBN (Print)978-0-12-411519-4
DOIs
Publication statusPublished - 1 Sept 2015

Keywords

  • Empirical software engineering
  • Inspection
  • Modern code review

Fingerprint

Dive into the research topics of 'A Mixed Methods Approach to Mining Code Review Data: Examples and a Study of Multicommit Reviews and Pull Requests'. Together they form a unique fingerprint.

Cite this