Blogs (1) >>
ICSE 2019
Sat 25 - Fri 31 May 2019 Montreal, QC, Canada
Wed 29 May 2019 16:40 - 16:50 at Viger - SE Datasets, Research Infrastructure, and Methodology Chair(s): Rashina Hoda

Reliance on data for software systems engineering is increasing, e.g., to train machine learning applications. As a consequence, we foresee increasing costs for data collection and maintenance, leading to the risk of development budgets eaten up by commodity features, leaving little resources for differentiation and innovation. We therefore propose Open Collaborative Data (OCD) - a concept analogous to Open Source Software (OSS) - as a means to share commodity data. In contrast to Open Data (OD), which e.g., governmental agencies provide to catalyze innovation, OCD is shared in open collaboration between commercial organizations, similar to OSS. To achieve this, there is a need for technical infrastructure (e.g., tools for version and access control), licence models, and governance models, all of which have to be tailored for data. However, as data may be sensitive for privacy, anonymization and obfuscation of data is also a research challenge. In this paper, we define the concept of Open Collaborative Data, demonstrate it by the example of map data and image recognition examples, and outline a research agenda for OCD in software engineering as a basis for more efficient evolution of software systems.

Wed 29 May

Displayed time zone: Eastern Time (US & Canada) change

16:00 - 18:00
SE Datasets, Research Infrastructure, and MethodologyJournal-First Papers / New Ideas and Emerging Results / Demonstrations / Papers / Technical Track at Viger
Chair(s): Rashina Hoda The University of Auckland
16:00
20m
Talk
BugSwarm: Mining and Continuously Growing a Dataset of Reproducible Failures and FixesTechnical Track
Technical Track
Naji Dmeiri University of California, Davis, David A Tomassi University of California, Davis, Yichen Wang University of California, Davis, Antara Bhowmick University of California, Davis, Yen-Chuan Liu University of California, Davis, Prem Devanbu University of California, Bogdan Vasilescu Carnegie Mellon University, Cindy Rubio-González University of California, Davis
Pre-print
16:20
20m
Talk
DefeXts: A Curated Dataset of Reproducible Real-World Bugs for Modern JVM LanguagesDemos
Demonstrations
Samuel Benton The University of Texas at Dallas, Ali Ghanbari The University of Texas at Dallas, Lingming Zhang
16:40
10m
Talk
Open Collaborative Data – using OSS principles to share data in SW engineeringNIER
New Ideas and Emerging Results
Per Runeson Lund University
16:50
10m
Talk
Leveraging Small Software Engineering Data Sets with Pre-trained Neural NetworksNIER
New Ideas and Emerging Results
Andrea Janes , Romain Robbes Free University of Bozen-Bolzano
17:00
20m
Talk
ActionNet: Vision-based Workflow Action Recognition From Programming ScreencastsTechnical Track
Technical Track
Dehai Zhao , Zhenchang Xing Australia National University, Chunyang Chen Monash University, Xin Xia Monash University, Guoqiang Li Shanghai Jiao Tong University
17:20
10m
Talk
The ABC of Software Engineering ResearchJournal-First
Journal-First Papers
Klaas-Jan Stol University College Cork and Lero, Ireland, Brian Fitzgerald Lero - The Irish Software Research Centre and University of Limerick
Link to publication DOI
17:30
10m
Talk
Mining Plausible Hypotheses from the Literature via Meta-AnalysisNIER
New Ideas and Emerging Results
Vladimir Ivanov , Giancarlo Succi Innopolis University, Jooyong Yi UNIST (Ulsan National Institute of Science and Technology)
17:40
10m
Talk
Analyzing Families of Experiments in SE: a Systematic Mapping StudyJournal-First
Journal-First Papers
Adrian Santos Parrilla , Omar Gomez Escuela Superior Politecnica de Chimborazo Riobamba, Natalia Juristo Universidad Politecnica de Madrid
17:50
10m
Talk
Discussion Period
Papers