Liao Lin, Lu Congde, Gao Yujie, Yu Hao, Cai Biao
College of Computer Science and Cyber Security, Chengdu University of Technology, Chengdu 610059, China.
School of Mechanical and Electrical Engineering, Chengdu University of Technology, Chengdu 610059, China.
Sensors (Basel). 2025 May 20;25(10):3205. doi: 10.3390/s25103205.
In anomaly detection tasks, labeled defect data are often scarce. Unsupervised learning leverages only normal samples during training, making it particularly suitable for anomaly detection tasks. Among unsupervised methods, normalizing flow models have shown distinct advantages. They allow precise modeling of data distributions and enable direct computation of sample log-likelihoods. Recent work has largely focused on feature fusion strategies. However, most of the flow-based methods emphasize spatial information while neglecting the critical role of channel-wise features. To address this limitation, we propose GCAFlow, a novel flow-based model enhanced with a global context-aware channel attention mechanism. In addition, we design a hierarchical convolutional subnetwork to improve the probabilistic modeling capacity of the flow-based framework. This subnetwork supports more accurate estimation of data likelihoods and enhances anomaly detection performance. We evaluate GCAFlow on three benchmark anomaly detection datasets, and the results demonstrate that it consistently outperforms existing flow-based models in both accuracy and robustness. In particular, on the VisA dataset, GCAFlow achieves an image-level AUROC of 98.2% and a pixel-level AUROC of 99.0%.
在异常检测任务中,有标签的缺陷数据往往很稀缺。无监督学习在训练过程中仅利用正常样本,这使其特别适用于异常检测任务。在无监督方法中,归一化流模型已显示出明显优势。它们允许对数据分布进行精确建模,并能直接计算样本对数似然值。最近的工作主要集中在特征融合策略上。然而,大多数基于流的方法都强调空间信息,而忽略了通道级特征的关键作用。为解决这一局限性,我们提出了GCAFlow,这是一种通过全局上下文感知通道注意力机制增强的新型基于流的模型。此外,我们设计了一个分层卷积子网,以提高基于流的框架的概率建模能力。该子网支持更准确地估计数据似然值,并增强异常检测性能。我们在三个基准异常检测数据集上对GCAFlow进行了评估,结果表明,在准确性和鲁棒性方面,它始终优于现有的基于流的模型。特别是在VisA数据集上,GCAFlow实现了98.2%的图像级AUROC和99.0%的像素级AUROC。