Authors
Qingwei Lin, Hongyu Zhang, Jian-Guang Lou, Yu Zhang, Xuewei Chen
Publication date
2016/5/14
Book
Proceedings of the 38th International Conference on Software Engineering Companion
Pages
102-111
Description
Logs play an important role in the maintenance of large-scale online service systems. When an online service fails, engineers need to examine recorded logs to gain insights into the failure and identify the potential problems. Traditionally, engineers perform simple keyword search (such as "error" and "exception") of logs that may be associated with the failures. Such an approach is often time consuming and error prone. Through our collaboration with Microsoft service product teams, we propose LogCluster, an approach that clusters the logs to ease log-based problem identification. LogCluster also utilizes a knowledge base to check if the log sequences occurred before. Engineers only need to examine a small number of previously unseen, representative log sequences extracted from the clusters to identify a problem, thus significantly reducing the number of logs that should be examined, meanwhile improving the …
Total citations
201720182019202020212022202320248154464918411356
Scholar articles
Q Lin, H Zhang, JG Lou, Y Zhang, X Chen - Proceedings of the 38th International Conference on …, 2016