• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

数据分布和自助法设置对过程质量控制中使用孤立森林进行异常检测的影响

Impact of Data Distribution and Bootstrap Setting on Anomaly Detection Using Isolation Forest in Process Quality Control.

作者信息

Choi Hyunyul, Jung Kihyo

机构信息

School of Industrial and Management Engineering, Korea University, Seoul 02841, Republic of Korea.

School of Industrial Engineering, University of Ulsan, Ulsan 44610, Republic of Korea.

出版信息

Entropy (Basel). 2025 Jul 18;27(7):761. doi: 10.3390/e27070761.

DOI:10.3390/e27070761
PMID:40724477
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12294628/
Abstract

This study investigates the impact of data distribution and bootstrap resampling on the anomaly detection performance of the Isolation Forest (iForest) algorithm in statistical process control. Although iForest has received attention for its multivariate and ensemble-based nature, its performance under non-normal data distributions and varying bootstrap settings remains underexplored. To address this gap, a comprehensive simulation was performed across 18 scenarios involving log-normal, gamma, and -distributions with different mean shift levels and bootstrap configurations. The results show that iForest substantially outperforms the conventional Hotelling's T control chart, especially in non-Gaussian settings and under small-to-medium process shifts. Enabling bootstrap resampling led to marginal improvements across classification metrics, including accuracy, precision, recall, F1-score, and average run length (ARL). However, a key limitation of iForest was its reduced sensitivity to subtle process changes, such as a 1σ mean shift, highlighting an area for future enhancement.

摘要

本研究调查了数据分布和自助重采样对统计过程控制中孤立森林(iForest)算法异常检测性能的影响。尽管iForest因其多变量和基于集成的特性而受到关注,但其在非正态数据分布和不同自助设置下的性能仍未得到充分探索。为了填补这一空白,我们在18种场景下进行了全面模拟,这些场景涉及具有不同均值偏移水平和自助配置的对数正态分布、伽马分布和t分布。结果表明,iForest显著优于传统的霍特林T控制图,特别是在非高斯设置和中小过程偏移情况下。启用自助重采样在包括准确率、精确率、召回率、F1分数和平均运行长度(ARL)在内的分类指标上带来了边际改进。然而,iForest的一个关键限制是其对细微过程变化(如1σ均值偏移)的敏感性降低,这突出了未来需要改进的一个领域。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/72c3/12294628/fc07d424a67c/entropy-27-00761-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/72c3/12294628/14d07a944b81/entropy-27-00761-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/72c3/12294628/22a879ec1acf/entropy-27-00761-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/72c3/12294628/6c90df09b54b/entropy-27-00761-g003a.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/72c3/12294628/7f6f57a7e61e/entropy-27-00761-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/72c3/12294628/e846f418d008/entropy-27-00761-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/72c3/12294628/99af6dc25b26/entropy-27-00761-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/72c3/12294628/fc07d424a67c/entropy-27-00761-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/72c3/12294628/14d07a944b81/entropy-27-00761-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/72c3/12294628/22a879ec1acf/entropy-27-00761-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/72c3/12294628/6c90df09b54b/entropy-27-00761-g003a.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/72c3/12294628/7f6f57a7e61e/entropy-27-00761-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/72c3/12294628/e846f418d008/entropy-27-00761-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/72c3/12294628/99af6dc25b26/entropy-27-00761-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/72c3/12294628/fc07d424a67c/entropy-27-00761-g007.jpg

相似文献

1
Impact of Data Distribution and Bootstrap Setting on Anomaly Detection Using Isolation Forest in Process Quality Control.数据分布和自助法设置对过程质量控制中使用孤立森林进行异常检测的影响
Entropy (Basel). 2025 Jul 18;27(7):761. doi: 10.3390/e27070761.
2
Signs and symptoms to determine if a patient presenting in primary care or hospital outpatient settings has COVID-19.在基层医疗机构或医院门诊环境中,如果患者出现以下症状和体征,可判断其是否患有 COVID-19。
Cochrane Database Syst Rev. 2022 May 20;5(5):CD013665. doi: 10.1002/14651858.CD013665.pub3.
3
Cost-effectiveness of using prognostic information to select women with breast cancer for adjuvant systemic therapy.利用预后信息为乳腺癌患者选择辅助性全身治疗的成本效益
Health Technol Assess. 2006 Sep;10(34):iii-iv, ix-xi, 1-204. doi: 10.3310/hta10340.
4
Short-Term Memory Impairment短期记忆障碍
5
Reminiscence therapy for dementia.痴呆症的回忆疗法
Cochrane Database Syst Rev. 2018 Mar 1;3(3):CD001120. doi: 10.1002/14651858.CD001120.pub3.
6
Home treatment for mental health problems: a systematic review.心理健康问题的居家治疗:一项系统综述
Health Technol Assess. 2001;5(15):1-139. doi: 10.3310/hta5150.
7
Health professionals' experience of teamwork education in acute hospital settings: a systematic review of qualitative literature.医疗专业人员在急症医院环境中团队合作教育的经验:对定性文献的系统综述
JBI Database System Rev Implement Rep. 2016 Apr;14(4):96-137. doi: 10.11124/JBISRIR-2016-1843.
8
Implementation of a knowledge-based decision support system for treatment plan auditing through automation.通过自动化实现基于知识的治疗计划审核决策支持系统。
Med Phys. 2023 Nov;50(11):6978-6989. doi: 10.1002/mp.16472. Epub 2023 May 21.
9
[Volume and health outcomes: evidence from systematic reviews and from evaluation of Italian hospital data].[容量与健康结果:来自系统评价和意大利医院数据评估的证据]
Epidemiol Prev. 2013 Mar-Jun;37(2-3 Suppl 2):1-100.
10
Sequential versus standard triple first-line therapy for Helicobacter pylori eradication.用于根除幽门螺杆菌的序贯疗法与标准三联一线疗法对比
Cochrane Database Syst Rev. 2016 Jun 28;2016(6):CD009034. doi: 10.1002/14651858.CD009034.pub2.

本文引用的文献

1
Estimating the support of a high-dimensional distribution.估计高维分布的支撑集。
Neural Comput. 2001 Jul;13(7):1443-71. doi: 10.1162/089976601750264965.