Khan Zanis Ali, Shin Donghwan, Bianculli Domenico, Briand Lionel C
University of Luxembourg, Esch-sur-Alzette, Luxembourg.
University of Sheffield, Sheffield, United Kingdom.
Empir Softw Eng. 2024;29(6):139. doi: 10.1007/s10664-024-10533-w. Epub 2024 Aug 17.
Software systems log massive amounts of data, recording important runtime information. Such logs are used, for example, for log-based anomaly detection, which aims to automatically detect abnormal behaviors of the system under analysis by processing the information recorded in its logs. Many log-based anomaly detection techniques based on deep learning models include a pre-processing step called log parsing. However, understanding the impact of log parsing on the accuracy of anomaly detection techniques has received surprisingly little attention so far. Investigating what are the key properties log parsing techniques should ideally have to help anomaly detection is therefore warranted. In this paper, we report on a comprehensive empirical study on the impact of log parsing on anomaly detection accuracy, using 13 log parsing techniques, seven anomly detection techniques (five based on deep learning and two based on traditional machine learning) on three publicly available log datasets. Our empirical results show that, despite what is widely assumed, there is no strong correlation between log parsing accuracy and anomaly detection accuracy, regardless of the metric used for measuring log parsing accuracy. Moreover, we experimentally confirm existing theoretical results showing that it is a property that we refer to as distinguishability in log parsing results-as opposed to their accuracy-that plays an essential role in achieving accurate anomaly detection.
软件系统会记录大量数据,记录重要的运行时信息。例如,此类日志用于基于日志的异常检测,其目的是通过处理分析系统日志中记录的信息来自动检测被分析系统的异常行为。许多基于深度学习模型的基于日志的异常检测技术都包括一个称为日志解析的预处理步骤。然而,到目前为止,了解日志解析对异常检测技术准确性的影响却出奇地受到很少关注。因此,有必要研究日志解析技术理想情况下应具备哪些关键属性以帮助进行异常检测。在本文中,我们报告了一项关于日志解析对异常检测准确性影响的全面实证研究,使用了13种日志解析技术、七种异常检测技术(五种基于深度学习,两种基于传统机器学习)以及三个公开可用的日志数据集。我们的实证结果表明,尽管人们普遍认为,但无论用于衡量日志解析准确性的指标如何,日志解析准确性与异常检测准确性之间都没有很强的相关性。此外,我们通过实验证实了现有的理论结果,即我们在日志解析结果中称为可区分性的属性——而不是其准确性——在实现准确的异常检测中起着至关重要的作用。