Authors
Bo Zhang, Hongyu Zhang, Van-Hoang Le, Pablo Moscato, Aozhong Zhang
Publication date
2023/5
Journal
Automated Software Engineering
Volume
30
Issue
1
Pages
4
Publisher
Springer US
Description
Large-scale software-intensive systems often generate logs for troubleshooting purpose. The system logs are semi-structured text messages that record the internal status of a system at runtime. In this paper, we propose ADR (Anomaly Detection by workflow Relations), which can mine numerical relations from logs and then utilize the discovered relations to detect system anomalies. Firstly the raw log entries are parsed into sequences of log events and transformed to an extended event-count-matrix. The relations among the matrix columns represent the relations among the system events in workflows. Next, ADR evaluates the matrix’s nullspace that corresponds to the linearly dependent relations of the columns. Anomalies can be detected by evaluating whether or not the logs violate the mined relations. We design two types of ADR: sADR (for semi-supervised learning) and uADR (for unsupervised learning). We …
Total citations
2023202443
Scholar articles