Authors
Yingzhe Lyu, Gopi Krishnan Rajbahadur, Dayi Lin, Boyuan Chen, Zhen Ming Jiang
Publication date
2021/11/15
Journal
ACM Transactions on Software Engineering and Methodology (TOSEM)
Volume
31
Issue
1
Pages
1-38
Publisher
ACM
Description
Artificial Intelligence for IT Operations (AIOps) has been adopted in organizations in various tasks, including interpreting models to identify indicators of service failures. To avoid misleading practitioners, AIOps model interpretations should be consistent (i.e., different AIOps models on the same task agree with one another on feature importance). However, many AIOps studies violate established practices in the machine learning community when deriving interpretations, such as interpreting models with suboptimal performance, though the impact of such violations on the interpretation consistency has not been studied.
In this article, we investigate the consistency of AIOps model interpretation along three dimensions: internal consistency, external consistency, and time consistency. We conduct a case study on two AIOps tasks: predicting Google cluster job failures and Backblaze hard drive failures. We find that the …
Total citations
2022202320242115
Scholar articles
Y Lyu, GK Rajbahadur, D Lin, B Chen, ZM Jiang - ACM Transactions on Software Engineering and …, 2021