Blogs (1) >>
ICSE 2019
Sat 25 - Fri 31 May 2019 Montreal, QC, Canada
Thu 30 May 2019 14:50 - 15:10 at Laurier - Automated Repair 2 Chair(s): Hamid Bagheri

Current state-of-the-art automatic software repair (ASR) techniques rely heavily on incomplete specifications, or test suites, to generate repairs. This, however, may cause ASR tools to generate repairs that are incorrect and hard to generalize. To assess patch correctness, researchers have been following two methods separately: (1) Automated annotation, wherein patches are automatically labeled by an independent test suite (ITS) – a patch passing the ITS is regarded as correct or generalizable, and incorrect otherwise, (2) Author annotation, wherein authors of ASR techniques manually annotate the correctness labels of patches generated by their and competing tools. While automated annotation cannot ascertain that a patch is actually correct, author annotation is prone to subjectivity. This concern has caused an on-going debate on the appropriate ways to assess the effectiveness of numerous ASR techniques proposed recently.

In this work, we propose to assess reliability of author and automated annotations on patch correctness assessment. We do this by first constructing a gold set of correctness labels for 189 randomly selected patches generated by 8 state-of-the-art ASR techniques through a user study involving 35 professional developers as independent annotators. By measuring inter-rater agreement as a proxy for annotation quality – as commonly done in the literature – we demonstrate that our constructed gold set is on par with other high-quality gold sets. We then compare labels generated by author and automated annotations with this gold set to assess reliability of the patch assessment methodologies. We subsequently report several findings and highlight implications for future studies.

Thu 30 May

Displayed time zone: Eastern Time (US & Canada) change

14:00 - 15:30
Automated Repair 2Papers / Journal-First Papers / Software Engineering in Practice / Technical Track at Laurier
Chair(s): Hamid Bagheri University of Nebraska-Lincoln, USA
14:00
20m
Talk
SapFix: Automated End-to-End Repair at ScaleSEIPIndustry Program
Software Engineering in Practice
Alexandru Marginean University College London, UK, Johannes Bader Facebook, Satish Chandra Facebook, Mark Harman Facebook and University College London, Yue Jia University College London, Ke Mao Meta, Alexander Mols Facebook, Andrew Scott Facebook
14:20
20m
Talk
VFix: Value-Flow-Guided Precise Program Repair for Null Pointer DereferencesArtifacts Evaluated ReusableTechnical Track
Technical Track
Xuezheng Xu UNSW Sydney, Yulei Sui University of Technology Sydney, Australia, Hua Yan University of New South Wales, Jingling Xue UNSW Sydney
14:40
10m
Talk
ARJA: Automated Repair of Java Programs via Multi-Objective Genetic ProgrammingJournal-First
Journal-First Papers
Yuan Yuan Michigan State University, Wolfgang Banzhaf Michigan State University
14:50
20m
Talk
On Reliability of Patch Correctness AssessmentTechnical Track
Technical Track
Xuan Bach D. Le Carnegie Mellon University, Lingfeng Bao Zhejiang University City College, David Lo Singapore Management University, Xin Xia Monash University, Shanping Li , Corina S. Păsăreanu Carnegie Mellon University Silicon Valley, NASA Ames Research Center
15:10
10m
Talk
Alleviating Patch Overfitting with Automatic Test Generation: A Study of Feasibility and Effectiveness for the Nopol Repair SystemJournal-First
Journal-First Papers
Zhongxing Yu , Matias Martinez University of Valenciennes, Benjamin Danglot University Lille 1 and INRIA, Thomas Durieux INRIA, Martin Monperrus KTH Royal Institute of Technology
15:20
10m
Talk
Discussion Period
Papers