IEEE Trans Med Imaging. 2023 Dec;42(12):3860-3870. doi: 10.1109/TMI.2023.3314318. Epub 2023 Nov 30.
Anomaly detection (AD) aims to determine if an instance has properties different from those seen in normal cases. The success of this technique depends on how well a neural network learns from normal instances. We observe that the learning difficulty scales exponentially with the input resolution, making it infeasible to apply AD to high-resolution images. Resizing them to a lower resolution is a compromising solution and does not align with clinical practice where the diagnosis could depend on image details. In this work, we propose to train the network and perform inference at the patch level, through the sliding window algorithm. This simple operation allows the network to receive high-resolution images but introduces additional training difficulties, including inconsistent image structure and higher variance. We address these concerns by setting the network's objective to learn augmentation-invariant features. We further study the augmentation function in the context of medical imaging. In particular, we observe that the resizing operation, a key augmentation in general computer vision literature, is detrimental to detection accuracy, and the inverting operation can be beneficial. We also propose a new module that encourages the network to learn from adjacent patches to boost detection performance. Extensive experiments are conducted on breast tomosynthesis and chest X-ray datasets and our method improves 8.03% and 5.66% AUC on image-level classification respectively over the current leading techniques. The experimental results demonstrate the effectiveness of our approach.
异常检测(AD)旨在确定实例是否具有与正常情况下看到的属性不同的属性。该技术的成功取决于神经网络从正常实例中学习的效果如何。我们观察到,学习难度随输入分辨率呈指数级增长,使得 AD 无法应用于高分辨率图像。将它们调整为较低的分辨率是一种妥协的解决方案,不符合临床实践,因为诊断可能取决于图像细节。在这项工作中,我们建议通过滑动窗口算法在补丁级别上训练网络并进行推理。这个简单的操作允许网络接收高分辨率图像,但会引入额外的训练困难,包括不一致的图像结构和更高的方差。我们通过将网络的目标设置为学习增强不变特征来解决这些问题。我们进一步研究了医学成像背景下的增强功能。特别是,我们观察到在一般计算机视觉文献中关键的增强操作,即调整大小操作,对检测精度不利,而反转操作可能是有益的。我们还提出了一个新的模块,鼓励网络从相邻的补丁中学习,以提高检测性能。我们在乳房断层合成和胸部 X 射线数据集上进行了广泛的实验,我们的方法在图像级分类上分别提高了 8.03%和 5.66%的 AUC,优于当前领先的技术。实验结果证明了我们方法的有效性。