用于可见光-红外行人检测的轮廓信息引导多尺度特征检测方法

Contour Information-Guided Multi-Scale Feature Detection Method for Visible-Infrared Pedestrian Detection.

作者信息

Xu Xiaoyu, Zhan Weida, Zhu Depeng, Jiang Yichun, Chen Yu, Guo Jinxin

机构信息

National Demonstration Center for Experimental Electrical, School of Electronic and Information Engineering, Changchun University of Science and Technology, Changchun 130022, China.

出版信息

Entropy (Basel). 2023 Jul 4;25(7):1022. doi: 10.3390/e25071022.

DOI:10.3390/e25071022

PMID:37509969

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10378104/

Abstract

Infrared pedestrian target detection is affected by factors such as the low resolution and contrast of infrared pedestrian images, as well as the complexity of the background and the presence of multiple targets occluding each other, resulting in indistinct target features. To address these issues, this paper proposes a method to enhance the accuracy of pedestrian target detection by employing contour information to guide multi-scale feature detection. This involves analyzing the shapes and edges of the targets in infrared images at different scales to more accurately identify and differentiate them from the background and other targets. First, we propose a preprocessing method to suppress background interference and extract color information from visible images. Second, we propose an information fusion residual block combining a U-shaped structure and residual connection to form a feature extraction network. Then, we propose an attention mechanism based on a contour information-guided approach to guide the network to extract the depth features of pedestrian targets. Finally, we use the clustering method of mIoU to generate anchor frame sizes applicable to the KAIST pedestrian dataset and propose a hybrid loss function to enhance the network's adaptability to pedestrian targets. The extensive experimental results show that the method proposed in this paper outperforms other comparative algorithms in pedestrian detection, proving its superiority.

摘要

红外行人目标检测受到多种因素的影响，如红外行人图像的低分辨率和对比度、背景的复杂性以及多个目标相互遮挡的情况，导致目标特征不清晰。为了解决这些问题，本文提出了一种利用轮廓信息引导多尺度特征检测来提高行人目标检测准确性的方法。这包括在不同尺度下分析红外图像中目标的形状和边缘，以便更准确地将它们与背景和其他目标区分开来。首先，我们提出一种预处理方法来抑制背景干扰并从可见光图像中提取颜色信息。其次，我们提出一种结合U形结构和残差连接的信息融合残差块来形成特征提取网络。然后，我们提出一种基于轮廓信息引导的注意力机制，引导网络提取行人目标的深度特征。最后，我们使用mIoU的聚类方法生成适用于KAIST行人数据集的锚框尺寸，并提出一种混合损失函数来增强网络对行人目标的适应性。大量实验结果表明，本文提出的方法在行人检测方面优于其他对比算法，证明了其优越性。