Park YeongHyeon, Kang Sungho, Kim Myung Jin, Lee Yeonho, Kim Hyeong Seok, Yi Juneho
Department of Electrical and Computer Engineering, Sungkyunkwan University, Suwon, 16419, Republic of Korea.
SK Planet Co., Ltd., Seongnam, 13487, Republic of Korea.
Sci Rep. 2024 Aug 14;14(1):18872. doi: 10.1038/s41598-024-69698-5.
Due to scarcity of anomaly situations in the early manufacturing stage, an unsupervised anomaly detection (UAD) approach is widely adopted which only uses normal samples for training. This approach is based on the assumption that the trained UAD model will accurately reconstruct normal patterns but struggles with unseen anomalies. To enhance the UAD performance, reconstruction-by-inpainting based methods have recently been investigated, especially on the masking strategy of suspected defective regions. However, there are still issues to overcome: (1) time-consuming inference due to multiple masking, (2) output inconsistency by random masking, and (3) inaccurate reconstruction of normal patterns for large masked areas. Motivated by this, this study proposes a novel reconstruction-by-inpainting method, dubbed Excision And Recovery (EAR), that features single deterministic masking based on the ImageNet pre-trained DINO-ViT and visual obfuscation for hint-providing. Experimental results on the MVTec AD dataset show that deterministic masking by pre-trained attention effectively cuts out suspected defective regions and resolves the aforementioned issues 1 and 2. Also, hint-providing by mosaicing proves to enhance the performance than emptying those regions by binary masking, thereby overcomes issue 3. The proposed approach achieves a high performance without any change of the model structure. Promising results are shown through laboratory tests with public industrial datasets. To suggest EAR be possibly adopted in various industries as a practically deployable solution, future steps include evaluating its applicability in relevant manufacturing environments.
由于在制造早期异常情况稀缺,一种无监督异常检测(UAD)方法被广泛采用,该方法仅使用正常样本进行训练。这种方法基于这样的假设,即训练后的UAD模型将准确重建正常模式,但难以处理未见过的异常情况。为了提高UAD的性能,最近研究了基于图像修复的重建方法,特别是关于可疑缺陷区域的掩蔽策略。然而,仍有一些问题需要克服:(1)由于多次掩蔽导致推理耗时,(2)随机掩蔽导致输出不一致,以及(3)对于大的掩蔽区域,正常模式的重建不准确。受此启发,本研究提出了一种新颖的基于图像修复的重建方法,称为切除与恢复(EAR),其特点是基于ImageNet预训练的DINO-ViT进行单一确定性掩蔽,并通过视觉混淆提供提示。在MVTec AD数据集上的实验结果表明,通过预训练注意力进行确定性掩蔽有效地切除了可疑缺陷区域,并解决了上述问题1和2。此外,通过拼接提供提示被证明比通过二值掩蔽清空这些区域更能提高性能,从而克服了问题3。所提出的方法在不改变模型结构的情况下实现了高性能。通过使用公共工业数据集的实验室测试展示了有希望的结果。为了表明EAR可能作为一种实际可部署的解决方案在各个行业中被采用,未来的步骤包括评估其在相关制造环境中的适用性。