DefeXts: A Curated Dataset of Reproducible Real-World Bugs for Modern JVM LanguagesDemos
Software engineering studies, such as bug detection, localization, repair, and prediction, often require benchmark bug datasets for their experiments. Few publicly available reproducible bug datasets exist for research consumption. Such datasets which do exist tend to be only applicable towards traditional programming languages (e.g., Defects4J for Java and CoreBench for C). Thus, the creation and widespread usage of bug datasets for the popular modern JVM (Java Virtual Machine) programming languages serve to provide a vital resource for software research. This paper introduces DefeXts, a family of extendable bug datasets currently containing child bug datasets for Kotlin (DefeKts) and Groovy (DefeGts). Each dataset contains reproducible real-world bugs and their corresponding patches scraped from real-world projects. Our introductory versions of DefeKts and DefeGts include 225 Kotlin and 301 Groovy bugs patches. As development of DefeXts continues, we aim to include other modern JVM languages, notably Scala.
Wed 29 MayDisplayed time zone: Eastern Time (US & Canada) change
16:00 - 18:00 | SE Datasets, Research Infrastructure, and MethodologyJournal-First Papers / New Ideas and Emerging Results / Demonstrations / Papers / Technical Track at Viger Chair(s): Rashina Hoda The University of Auckland | ||
16:00 20mTalk | BugSwarm: Mining and Continuously Growing a Dataset of Reproducible Failures and FixesTechnical Track Technical Track Naji Dmeiri University of California, Davis, David A Tomassi University of California, Davis, Yichen Wang University of California, Davis, Antara Bhowmick University of California, Davis, Yen-Chuan Liu University of California, Davis, Prem Devanbu University of California, Bogdan Vasilescu Carnegie Mellon University, Cindy Rubio-González University of California, Davis Pre-print | ||
16:20 20mTalk | DefeXts: A Curated Dataset of Reproducible Real-World Bugs for Modern JVM LanguagesDemos Demonstrations Samuel Benton The University of Texas at Dallas, Ali Ghanbari The University of Texas at Dallas, Lingming Zhang | ||
16:40 10mTalk | Open Collaborative Data – using OSS principles to share data in SW engineeringNIER New Ideas and Emerging Results Per Runeson Lund University | ||
16:50 10mTalk | Leveraging Small Software Engineering Data Sets with Pre-trained Neural NetworksNIER New Ideas and Emerging Results | ||
17:00 20mTalk | ActionNet: Vision-based Workflow Action Recognition From Programming ScreencastsTechnical Track Technical Track Dehai Zhao , Zhenchang Xing Australia National University, Chunyang Chen Monash University, Xin Xia Monash University, Guoqiang Li Shanghai Jiao Tong University | ||
17:20 10mTalk | The ABC of Software Engineering ResearchJournal-First Journal-First Papers Klaas-Jan Stol University College Cork and Lero, Ireland, Brian Fitzgerald Lero - The Irish Software Research Centre and University of Limerick Link to publication DOI | ||
17:30 10mTalk | Mining Plausible Hypotheses from the Literature via Meta-AnalysisNIER New Ideas and Emerging Results Vladimir Ivanov , Giancarlo Succi Innopolis University, Jooyong Yi UNIST (Ulsan National Institute of Science and Technology) | ||
17:40 10mTalk | Analyzing Families of Experiments in SE: a Systematic Mapping StudyJournal-First Journal-First Papers Adrian Santos Parrilla , Omar Gomez Escuela Superior Politecnica de Chimborazo Riobamba, Natalia Juristo Universidad Politecnica de Madrid | ||
17:50 10mTalk | Discussion Period Papers |