Blogs (1) >>
ICSE 2019
Sat 25 - Fri 31 May 2019 Montreal, QC, Canada
Fri 31 May 2019 11:10 - 11:30 at Laurier - Defect Prediction Chair(s): Burak Turhan

With the rise of the Mining Software Repositories (MSR) field, defect datasets extracted from software repositories play a foundational role in many empirical studies related to software quality. At the core of defect data preparation is the identification of post-release defects. Prior studies leverage many heuristics (e.g., keywords and issue IDs) to identify post-release defects. However, such the heuristic approach is based on several assumptions, which pose common threats to the validity of many studies. In this paper, we set out to investigate the nature of the difference of defect datasets generated by the heuristic approach and the realistic approach that leverages the earliest affected release that is realistically estimated by a software development team for a given defect. In addition, we investigate the impact of defect identification approaches on the predictive accuracy and the ranking of defective modules that are produced by defect models. Through a case study of defect datasets of 32 releases, we find that the heuristic approach has a large impact on both defect count datasets and binary defect datasets. Surprisingly, we find that the heuristic approach has a minimal impact on defect count models, suggesting that future work should not be too concerned about defect count models that are constructed using heuristic defect datasets. On the other hand, using defect datasets generated by the realistic approach leads to an improvement in the predictive accuracy of defect classification models.

Conference Day
Fri 31 May

Displayed time zone: Eastern Time (US & Canada) change

11:00 - 12:30
11:00
10m
Talk
Perceptions, Expectations, and Challenges in Defect PredictionJournal-First
Journal-First Papers
Zhiyuan WanZhejiang University, Xin XiaMonash University, Ahmed E. HassanQueen's University, David LoSingapore Management University, Jianwei Yin, Xiaohu Yang
11:10
20m
Talk
Mining Software Defects: Should We Consider Affected Releases?Artifacts AvailableArtifacts Evaluated ReusableTechnical Track
Technical Track
Suraj YatishThe University of Adelaide, Jirayus JiarpakdeeMonash University, Patanamon ThongtanunamThe University of Melbourne, Chakkrit TantithamthavornMonash University, Australia
11:30
20m
Talk
Class Imbalance Evolution and Verification Latency in Just-in-Time Software Defect PredictionTechnical Track
Technical Track
George CabralUniversity of Birmingham, Leandro Minku , Emad ShihabConcordia University, Suhaib MujahidConcordia University
11:50
10m
Talk
The Impact of Class Rebalancing Techniques on the Performance and Interpretation of Defect Prediction ModelsJournal-First
Journal-First Papers
Chakkrit TantithamthavornMonash University, Australia, Ahmed E. HassanQueen's University, Kenichi MatsumotoNara Institute of Science and Technology
Pre-print
12:00
10m
Talk
On the Relative Value of Data Resampling Approaches for Software Defect PredictionJournal-First
Journal-First Papers
Kwabena E. BenninBlekinge Institute of Technology, SERL Sweden, Jacky Keung, Akito Monden
Authorizer link
12:10
10m
Talk
Energy-Based Anomaly Detection A New Perspective for Predicting Software FailuresNIER Distinguished Paper AwardNIER
New Ideas and Emerging Results
Cristina MonniUniversità della Svizzera Italiana, Mauro PezzeUniversità della Svizzera italiana (USI) (Switzerland) and Università degli Studi di Milano Bicocca (Italy)
Pre-print
12:20
10m
Talk
Discussion Period
Papers