Suppr超能文献

数据分布和自助法设置对过程质量控制中使用孤立森林进行异常检测的影响

Impact of Data Distribution and Bootstrap Setting on Anomaly Detection Using Isolation Forest in Process Quality Control.

作者信息

Choi Hyunyul, Jung Kihyo

机构信息

School of Industrial and Management Engineering, Korea University, Seoul 02841, Republic of Korea.

School of Industrial Engineering, University of Ulsan, Ulsan 44610, Republic of Korea.

出版信息

Entropy (Basel). 2025 Jul 18;27(7):761. doi: 10.3390/e27070761.

Abstract

This study investigates the impact of data distribution and bootstrap resampling on the anomaly detection performance of the Isolation Forest (iForest) algorithm in statistical process control. Although iForest has received attention for its multivariate and ensemble-based nature, its performance under non-normal data distributions and varying bootstrap settings remains underexplored. To address this gap, a comprehensive simulation was performed across 18 scenarios involving log-normal, gamma, and -distributions with different mean shift levels and bootstrap configurations. The results show that iForest substantially outperforms the conventional Hotelling's T control chart, especially in non-Gaussian settings and under small-to-medium process shifts. Enabling bootstrap resampling led to marginal improvements across classification metrics, including accuracy, precision, recall, F1-score, and average run length (ARL). However, a key limitation of iForest was its reduced sensitivity to subtle process changes, such as a 1σ mean shift, highlighting an area for future enhancement.

摘要

本研究调查了数据分布和自助重采样对统计过程控制中孤立森林(iForest)算法异常检测性能的影响。尽管iForest因其多变量和基于集成的特性而受到关注,但其在非正态数据分布和不同自助设置下的性能仍未得到充分探索。为了填补这一空白,我们在18种场景下进行了全面模拟,这些场景涉及具有不同均值偏移水平和自助配置的对数正态分布、伽马分布和t分布。结果表明,iForest显著优于传统的霍特林T控制图,特别是在非高斯设置和中小过程偏移情况下。启用自助重采样在包括准确率、精确率、召回率、F1分数和平均运行长度(ARL)在内的分类指标上带来了边际改进。然而,iForest的一个关键限制是其对细微过程变化(如1σ均值偏移)的敏感性降低,这突出了未来需要改进的一个领域。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/72c3/12294628/14d07a944b81/entropy-27-00761-g001.jpg

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验