Mining Historical Test Logs to Predict Bugs and Localize Faults in the Test Logs
Technical TrackIndustry Program
Software testing is an integral part of modern software development. However, test runs can produce 1000’s of lines of logged output that make it difficult to find the cause of a fault in the logs. This problem is exacerbated by environmental failures that distract from product faults. In this paper we present techniques with the goal of capturing the maximum number of product faults, while flagging the minimum number of log lines for inspection.
We observe that the location of a fault in a log should be contained in the lines of a failing test log. In contrast, a passing test log should not contain the lines related to a failure. Lines that occur in both a passing and failing log introduce noise when attempting to find the fault in a failing log. We remove the lines that occur in the passing log from the failing log.
After removing these lines, we use information retrieval techniques to flag the most probable lines for investigation. We modify TF-IDF to identify the most relevant log lines related to past product failures. We then vectorize the logs and develop an exclusive version of KNN to identify which logs are likely to lead to product faults and which lines are the most probable indication of the failure.
Our best approach, LogFaultFlagger finds 89% of the total faults and flags less than 1% of the total failed log lines for inspection, which drastically outperforms previous work. Our tool makes daily predictions to Ericsson basestation testers.
Wed 29 May
|14:00 - 14:20|
|14:20 - 14:40|
Jieming ZhuHuawei Noah's Ark Lab, Shilin HeChinese University of Hong Kong, Jinyang LiuSun Yat-Sen University, Pinjia HeComputer Science and Engineering, The Chinese University of Hong Kong, Qi XieSouthwest Minzu University, Zibin ZhengSchool of Data and Computer Science, Sun Yat-sen University, Michael Lyu
|14:40 - 15:00|
Mining Historical Test Logs to Predict Bugs and Localize Faults in the Test LogsTechnical TrackIndustry Program
|15:00 - 15:20|
|15:20 - 15:30|