Liu Fangcen, Gao Chenqiang, Chen Fang, Meng Deyu, Zuo Wangmeng, Gao Xinbo
IEEE Trans Image Process. 2023;32:5921-5932. doi: 10.1109/TIP.2023.3326396. Epub 2023 Nov 1.
The infrared small and dim (S&D) target detection is one of the key techniques in the infrared search and tracking system. Since the local regions similar to infrared S&D targets spread over the whole background, exploring the correlation amongst image features in large-range dependencies to mine the difference between the target and background is crucial for robust detection. However, existing deep learning-based methods are limited by the locality of convolutional neural networks, which impairs the ability to capture large-range dependencies. Additionally, the S&D appearance of the infrared target makes the detection model highly possible to miss detection. To this end, we propose a robust and general infrared S&D target detection method with the transformer. We adopt the self-attention mechanism of the transformer to learn the correlation of image features in a larger range. Moreover, we design a feature enhancement module to learn discriminative features of S&D targets to avoid miss-detections. After that, to avoid the loss of the target information, we adopt a decoder with the U-Net-like skip connection operation to contain more information of S&D targets. Finally, we get the detection result by a segmentation head. Extensive experiments on two public datasets show the obvious superiority of the proposed method over state-of-the-art methods, and the proposed method has a stronger generalization ability and better noise tolerance.
红外小目标与暗目标(S&D)检测是红外搜索与跟踪系统中的关键技术之一。由于类似于红外S&D目标的局部区域散布在整个背景中,探索大范围依赖关系中图像特征之间的相关性以挖掘目标与背景之间的差异对于稳健检测至关重要。然而,现有的基于深度学习的方法受到卷积神经网络局部性的限制,这削弱了捕获大范围依赖关系的能力。此外,红外目标的S&D外观使得检测模型极有可能漏检。为此,我们提出一种基于Transformer的稳健通用红外S&D目标检测方法。我们采用Transformer的自注意力机制在更大范围内学习图像特征的相关性。此外,我们设计了一个特征增强模块来学习S&D目标的判别特征以避免漏检。之后,为避免目标信息丢失,我们采用具有类似U-Net跳跃连接操作的解码器来包含更多S&D目标的信息。最后,我们通过一个分割头得到检测结果。在两个公共数据集上进行的大量实验表明,所提方法相对于现有方法具有明显优势,且所提方法具有更强的泛化能力和更好的噪声容忍度。