• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于日志的异常检测中机器学习技术的综合研究。

A comprehensive study of machine learning techniques for log-based anomaly detection.

作者信息

Ali Shan, Boufaied Chaima, Bianculli Domenico, Branco Paula, Briand Lionel

机构信息

University of Ottawa, Ottawa, Canada.

Prince Sultan University, Riyadh, Saudi Arabia.

出版信息

Empir Softw Eng. 2025;30(5):129. doi: 10.1007/s10664-025-10669-3. Epub 2025 Jun 23.

DOI:10.1007/s10664-025-10669-3
PMID:40567922
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12185583/
Abstract

Growth in system complexity increases the need for automated techniques dedicated to different log analysis tasks such as Log-based Anomaly Detection (LAD). The latter has been widely addressed in the literature, mostly by means of a variety of deep learning techniques. However, despite their many advantages, that focus on deep learning techniques is somewhat arbitrary as traditional Machine Learning (ML) techniques may perform well in many cases, depending on the context and datasets. In the same vein, semi-supervised techniques deserve the same attention as supervised techniques since the former have clear practical advantages. Further, current evaluations mostly rely on the assessment of detection accuracy. However, this is not enough to decide whether or not a specific ML technique is suitable to address the LAD problem in a given context. Other aspects to consider include training and prediction times as well as the sensitivity to hyperparameter tuning, which in practice matters to engineers. In this paper, we present a comprehensive empirical study, in which we evaluate a wide array of supervised and semi-supervised, traditional and deep ML techniques w.r.t. four evaluation criteria: detection accuracy, time performance, sensitivity of detection accuracy and time performance to hyperparameter tuning. Our goal is to provide much stronger and comprehensive evidence regarding the relative advantages and drawbacks of alternative techniques for LAD. The experimental results show that supervised traditional and deep ML techniques fare similarly in terms of their detection accuracy and prediction time on most of the benchmark datasets considered in our study. Moreover, overall, sensitivity analysis to hyperparameter tuning with respect to detection accuracy shows that supervised traditional ML techniques are less sensitive than deep learning techniques. Further, semi-supervised techniques yield significantly worse detection accuracy than supervised techniques.

摘要

系统复杂性的增加使得对专门用于不同日志分析任务(如基于日志的异常检测(LAD))的自动化技术的需求也随之增加。后者在文献中已得到广泛探讨,主要是通过各种深度学习技术。然而,尽管深度学习技术有诸多优点,但仅关注深度学习技术在某种程度上是武断的,因为传统机器学习(ML)技术在许多情况下可能表现良好,这取决于具体的上下文和数据集。同样,半监督技术与监督技术一样值得关注,因为前者具有明显的实际优势。此外,当前的评估大多依赖于检测准确率的评估。然而,这不足以确定特定的ML技术在给定上下文中是否适合解决LAD问题。其他需要考虑的方面包括训练和预测时间以及对超参数调整的敏感性,而在实践中这些对工程师来说很重要。在本文中,我们进行了一项全面的实证研究,在研究中我们根据四个评估标准:检测准确率、时间性能、检测准确率和时间性能对超参数调整的敏感性,对一系列监督和半监督、传统和深度ML技术进行了评估。我们的目标是为LAD替代技术的相对优缺点提供更有力和全面的证据。实验结果表明,在我们研究中考虑的大多数基准数据集上,监督传统ML技术和深度ML技术在检测准确率和预测时间方面表现相似。此外,总体而言,关于检测准确率的超参数调整敏感性分析表明,监督传统ML技术比深度学习技术更不敏感。此外,半监督技术产生的检测准确率明显低于监督技术。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/324a/12185583/4158a1cfd8cd/10664_2025_10669_Fig12_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/324a/12185583/96e392e8fd94/10664_2025_10669_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/324a/12185583/b17106e37b46/10664_2025_10669_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/324a/12185583/481a64019325/10664_2025_10669_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/324a/12185583/c2526037dabc/10664_2025_10669_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/324a/12185583/9b4a6fcb84e1/10664_2025_10669_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/324a/12185583/fa8c6f847f05/10664_2025_10669_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/324a/12185583/1dd3e94586fe/10664_2025_10669_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/324a/12185583/cb29750c12a7/10664_2025_10669_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/324a/12185583/6ed149114851/10664_2025_10669_Fig9_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/324a/12185583/274098bf73f1/10664_2025_10669_Fig10_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/324a/12185583/b84cdfd06f27/10664_2025_10669_Fig11_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/324a/12185583/4158a1cfd8cd/10664_2025_10669_Fig12_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/324a/12185583/96e392e8fd94/10664_2025_10669_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/324a/12185583/b17106e37b46/10664_2025_10669_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/324a/12185583/481a64019325/10664_2025_10669_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/324a/12185583/c2526037dabc/10664_2025_10669_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/324a/12185583/9b4a6fcb84e1/10664_2025_10669_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/324a/12185583/fa8c6f847f05/10664_2025_10669_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/324a/12185583/1dd3e94586fe/10664_2025_10669_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/324a/12185583/cb29750c12a7/10664_2025_10669_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/324a/12185583/6ed149114851/10664_2025_10669_Fig9_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/324a/12185583/274098bf73f1/10664_2025_10669_Fig10_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/324a/12185583/b84cdfd06f27/10664_2025_10669_Fig11_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/324a/12185583/4158a1cfd8cd/10664_2025_10669_Fig12_HTML.jpg

相似文献

1
A comprehensive study of machine learning techniques for log-based anomaly detection.基于日志的异常检测中机器学习技术的综合研究。
Empir Softw Eng. 2025;30(5):129. doi: 10.1007/s10664-025-10669-3. Epub 2025 Jun 23.
2
Signs and symptoms to determine if a patient presenting in primary care or hospital outpatient settings has COVID-19.在基层医疗机构或医院门诊环境中,如果患者出现以下症状和体征,可判断其是否患有 COVID-19。
Cochrane Database Syst Rev. 2022 May 20;5(5):CD013665. doi: 10.1002/14651858.CD013665.pub3.
3
Systemic treatments for metastatic cutaneous melanoma.转移性皮肤黑色素瘤的全身治疗
Cochrane Database Syst Rev. 2018 Feb 6;2(2):CD011123. doi: 10.1002/14651858.CD011123.pub2.
4
Behavioral interventions to reduce risk for sexual transmission of HIV among men who have sex with men.降低男男性行为者中艾滋病毒性传播风险的行为干预措施。
Cochrane Database Syst Rev. 2008 Jul 16(3):CD001230. doi: 10.1002/14651858.CD001230.pub2.
5
Systemic pharmacological treatments for chronic plaque psoriasis: a network meta-analysis.系统性药理学治疗慢性斑块状银屑病:网络荟萃分析。
Cochrane Database Syst Rev. 2021 Apr 19;4(4):CD011535. doi: 10.1002/14651858.CD011535.pub4.
6
Systemic pharmacological treatments for chronic plaque psoriasis: a network meta-analysis.慢性斑块状银屑病的全身药理学治疗:一项网状荟萃分析。
Cochrane Database Syst Rev. 2017 Dec 22;12(12):CD011535. doi: 10.1002/14651858.CD011535.pub2.
7
Uterotonic agents for preventing postpartum haemorrhage: a network meta-analysis.预防产后出血的宫缩剂:一项网状Meta分析
Cochrane Database Syst Rev. 2018 Apr 25;4(4):CD011689. doi: 10.1002/14651858.CD011689.pub2.
8
Home treatment for mental health problems: a systematic review.心理健康问题的居家治疗:一项系统综述
Health Technol Assess. 2001;5(15):1-139. doi: 10.3310/hta5150.
9
Weed Detection Using Deep Learning: A Systematic Literature Review.基于深度学习的杂草检测:系统文献综述
Sensors (Basel). 2023 Mar 31;23(7):3670. doi: 10.3390/s23073670.
10
Systemic pharmacological treatments for chronic plaque psoriasis: a network meta-analysis.慢性斑块状银屑病的全身药理学治疗:一项网状Meta分析。
Cochrane Database Syst Rev. 2020 Jan 9;1(1):CD011535. doi: 10.1002/14651858.CD011535.pub3.

本文引用的文献

1
Impact of log parsing on deep learning-based anomaly detection.日志解析对基于深度学习的异常检测的影响。
Empir Softw Eng. 2024;29(6):139. doi: 10.1007/s10664-024-10533-w. Epub 2024 Aug 17.
2
Estimating the support of a high-dimensional distribution.估计高维分布的支撑集。
Neural Comput. 2001 Jul;13(7):1443-71. doi: 10.1162/089976601750264965.
3
Long short-term memory.长短期记忆
Neural Comput. 1997 Nov 15;9(8):1735-80. doi: 10.1162/neco.1997.9.8.1735.