Tools and Benchmarks for Automated Log Parsing (ICSE 2019 - Software Engineering in Practice) - International Conference on Software Engineering 2019 in Montreal, Canada

Blogs (1) >>

Sat 25 - Fri 31 May 2019 Montreal, QC, Canada

Who

Jieming Zhu, Shilin He, Jinyang Liu, Pinjia He, Qi Xie, Zibin Zheng, Michael Lyu

Track

ICSE 2019 Software Engineering in Practice

Time Zone

The program is currently displayed in (GMT-04:00) Eastern Time (US & Canada).

Use conference time zone: (GMT-04:00) Eastern Time (US & Canada)Select other time zone

The GMT offsets shown reflect the offsets at the moment of the conference.

Time Band

By setting a time band, the program will dim events that are outside this time window. This is useful for (virtual) conferences with a continuous program (with repeated sessions).
The time band will also limit the events that are included in the personal iCalendar subscription service.

Display full programSpecify a time band

Save

When

Wed 29 May 2019 14:20 - 14:40 at Mansfield / Sherbrooke - DevOps and Logging Chair(s): Diomidis Spinellis

Abstract

Logs are imperative in the development and maintenance process of many software systems. They record detailed runtime information during system operation that allows developers and support engineers to monitor their systems and dissect anomalous behaviors and errors. The increasing scale and complexity of modern software systems, however, make the volume of logs explodes, thus rendering the infeasibility of the traditional way of manual log inspection. Many recent studies and industrial tools resort to powerful text search and machine learning-based analytics solutions. Due to the unstructured nature of logs, a first crucial step is to parse log messages into structured data for subsequent analysis. In recent years, automated log parsing has been widely studied in both academia and industry, producing a series of log parsers by different techniques. To better understand the characteristics of these log parsers, in this paper, we present a comprehensive evaluation study on automated log parsing and further release the tools and benchmarks to researchers and practitioners. More specifically, we evaluate 13 log parsers on a total of 16 log datasets spanning distributed systems, supercomputers, operating systems, mobile systems, server applications, and standalone software. We report the benchmarking results in terms of accuracy, robustness, and efficiency, which are of practical importance when deploying automated log parsing in production. We also share the success stories and lessons learned in an industrial application at Huawei. We believe that our work could serve as the basis and provide valuable guidance to future research and technology transfer of automated log parsing.

Jieming Zhu

Huawei Noah's Ark Lab

China

Shilin He

Chinese University of Hong Kong

China

Jinyang Liu

Sun Yat-Sen University

Pinjia He

Computer Science and Engineering, The Chinese University of Hong Kong

Qi Xie

Southwest Minzu University

Zibin Zheng

School of Data and Computer Science, Sun Yat-sen University

Michael Lyu