Huang Zongyuan, Zhang Baohua, Hu Guoqiang, Li Longyuan, Xu Yanyan, Jin Yaohui
IEEE Trans Neural Netw Learn Syst. 2024 Oct;35(10):14754-14769. doi: 10.1109/TNNLS.2023.3281501. Epub 2024 Oct 7.
Anomaly detection plays a crucial role in various real-world applications, including healthcare and finance systems. Owing to the limited number of anomaly labels in these complex systems, unsupervised anomaly detection methods have attracted great attention in recent years. Two major challenges faced by the existing unsupervised methods are as follows: 1) distinguishing between normal and abnormal data when they are highly mixed together and 2) defining an effective metric to maximize the gap between normal and abnormal data in a hypothesis space, which is built by a representation learner. To that end, this work proposes a novel scoring network with a score-guided regularization to learn and enlarge the anomaly score disparities between normal and abnormal data, enhancing the capability of anomaly detection. With such score-guided strategy, the representation learner can gradually learn more informative representation during the model training stage, especially for the samples in the transition field. Moreover, the scoring network can be incorporated into most of the deep unsupervised representation learning (URL)-based anomaly detection models and enhances them as a plug-in component. We next integrate the scoring network into an autoencoder (AE) and four state-of-the-art models to demonstrate the effectiveness and transferability of the design. These score-guided models are collectively called SG-Models. Extensive experiments on both synthetic and real-world datasets confirm the state-of-the-art performance of SG-Models.
异常检测在包括医疗保健和金融系统在内的各种实际应用中发挥着至关重要的作用。由于这些复杂系统中异常标签数量有限,无监督异常检测方法近年来备受关注。现有无监督方法面临的两个主要挑战如下:1)当正常数据和异常数据高度混合在一起时,区分它们;2)定义一个有效的度量标准,以最大化由表示学习器构建的假设空间中正常数据和异常数据之间的差距。为此,这项工作提出了一种具有分数引导正则化的新型评分网络,以学习和扩大正常数据和异常数据之间的异常分数差异,增强异常检测能力。通过这种分数引导策略,表示学习器可以在模型训练阶段逐渐学习到更多信息丰富的表示,特别是对于过渡区域中的样本。此外,评分网络可以集成到大多数基于深度无监督表示学习(URL)的异常检测模型中,并作为一个插件组件增强它们的性能。接下来,我们将评分网络集成到一个自动编码器(AE)和四个先进模型中,以证明该设计的有效性和可迁移性。这些分数引导模型统称为SG模型。在合成数据集和真实世界数据集上进行的大量实验证实了SG模型的领先性能。