Khor WeeLiam, Chen Yichen Kelly, Roberts Michael, Ciampa Francesco
Department of Mechanical Engineering Sciences, University of Surrey, Guildford, GU2 7XH, UK.
Department of Technology, Design and Environment, Oxford Brookes University, Wheatley, OX33 1HX, UK.
Sci Rep. 2024 Apr 9;14(1):8353. doi: 10.1038/s41598-024-56636-8.
This paper presents a study on the effectiveness of a convolutional neural network (CNN) in classifying infrared images for security scanning. Infrared thermography was explored as a non-invasive security scanner for stand-off and walk-through concealed object detection. Heat generated by human subjects radiates off the clothing surface, allowing detection by an infrared camera. However, infrared lacks in penetration capability compared to longer electromagnetic waves, leading to less obvious visuals on the clothing surface. ResNet-50 was used as the CNN model to automate the classification process of thermal images. The ImageNet database was used to pre-train the model, which was further fine-tuned using infrared images obtained from experiments. Four image pre-processing approaches were explored, i.e., raw infrared image, subject cropped region-of-interest (ROI) image, K-means, and Fuzzy-c clustered images. All these approaches were evaluated using the receiver operating characteristic curve on an internal holdout set, with an area-under-the-curve of 0.8923, 0.9256, 0.9485, and 0.9669 for the raw image, ROI cropped, K-means, and Fuzzy-c models, respectively. The CNN models trained using various image pre-processing approaches suggest that the prediction performance can be improved by the removal of non-decision relevant information and the visual highlighting of features.
本文介绍了一项关于卷积神经网络(CNN)在安全扫描红外图像分类中的有效性的研究。红外热成像技术被用作一种非侵入式安全扫描仪,用于远距离和步行通过式隐藏物体检测。人体散发的热量从衣物表面辐射出去,从而可以被红外摄像机检测到。然而,与更长的电磁波相比,红外的穿透能力较弱,导致衣物表面的视觉效果不那么明显。ResNet-50被用作CNN模型来自动化热图像的分类过程。使用ImageNet数据库对模型进行预训练,并使用从实验中获得的红外图像对其进行进一步微调。探索了四种图像预处理方法,即原始红外图像、主体裁剪后的感兴趣区域(ROI)图像、K均值图像和模糊C聚类图像。所有这些方法都在内部保留集上使用接收者操作特征曲线进行评估,原始图像、ROI裁剪图像、K均值图像和模糊C模型的曲线下面积分别为0.8923、0.9256、0.9485和0.9669。使用各种图像预处理方法训练的CNN模型表明,通过去除与决策无关的信息和突出显示特征的视觉效果,可以提高预测性能。