基于深度学习的远距离监测系统中短波红外目标检测：一种自动化的跨谱方法。

Deep Learning Based SWIR Object Detection in Long-Range Surveillance Systems: An Automated Cross-Spectral Approach.

机构信息

School of Electrical Engineering, University of Belgrade, Bul. Kralja Aleksandara 73, 11120 Belgrade, Serbia.

Vlatacom Institute of High Technologies, Milutina Milankovica 5, 11070 Belgrade, Serbia.

出版信息

Sensors (Basel). 2022 Mar 27;22(7):2562. doi: 10.3390/s22072562.

DOI:10.3390/s22072562

PMID:35408177

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9002380/

Abstract

SWIR imaging bears considerable advantages over visible-light (color) and thermal images in certain challenging propagation conditions. Thus, the SWIR imaging channel is frequently used in multi-spectral imaging systems (MSIS) for long-range surveillance in combination with color and thermal imaging to improve the probability of correct operation in various day, night and climate conditions. Integration of deep-learning (DL)-based real-time object detection in MSIS enables an increase in efficient utilization for complex long-range surveillance solutions such as border or critical assets control. Unfortunately, a lack of datasets for DL-based object detection models training for the SWIR channel limits their performance. To overcome this, by using the MSIS setting we propose a new cross-spectral automatic data annotation methodology for SWIR channel training dataset creation, in which the visible-light channel provides a source for detecting object types and bounding boxes which are then transformed to the SWIR channel. A mathematical image transformation that overcomes differences between the SWIR and color channel and their image distortion effects for various magnifications are explained in detail. With the proposed cross-spectral methodology, the goal of the paper is to improve object detection in SWIR images captured in challenging outdoor scenes. Experimental tests for two object types (cars and persons) using a state-of-the-art YOLOX model demonstrate that retraining with the proposed automatic cross-spectrally created SWIR image dataset significantly improves average detection precision. We achieved excellent improvements in detection performance in various variants of the YOLOX model (nano, tiny and x).

摘要

SWIR 成像在某些具有挑战性的传播条件下比可见光（彩色）和热像具有更大的优势。因此，SWIR 成像通道在多光谱成像系统（MSIS）中经常与彩色和热成像结合使用，以提高在各种白天、夜间和气候条件下正确运行的概率。在 MSIS 中集成基于深度学习（DL）的实时目标检测可以提高复杂远程监控解决方案（如边界或关键资产控制）的有效利用率。不幸的是，用于 SWIR 通道的基于 DL 的目标检测模型训练缺乏数据集，限制了它们的性能。为了克服这一问题，我们使用 MSIS 设置提出了一种新的跨光谱自动数据注释方法，用于创建 SWIR 通道训练数据集，其中可见光通道提供了检测目标类型和边界框的来源，然后将其转换到 SWIR 通道。详细解释了一种克服 SWIR 和彩色通道之间差异以及它们对各种放大倍数的图像失真效果的数学图像变换。通过提出的跨光谱方法，本文的目标是提高在具有挑战性的户外场景中捕获的 SWIR 图像中的目标检测。使用最先进的 YOLOX 模型对两种目标类型（汽车和人员）进行的实验测试表明，使用所提出的自动跨光谱创建的 SWIR 图像数据集进行重新训练可以显著提高平均检测精度。我们在 YOLOX 模型的各种变体（纳米、微小和 x）中实现了出色的检测性能改进。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1baf/9002380/f512d39d2884/sensors-22-02562-g0A1.jpg

相似文献

Deep Learning Based SWIR Object Detection in Long-Range Surveillance Systems: An Automated Cross-Spectral Approach.

Sensors (Basel). 2022 Mar 27;22(7):2562. doi: 10.3390/s22072562.

Multi-Object Tracking on SWIR Images for City Surveillance in an Edge-Computing Environment.

Sensors (Basel). 2023 Jul 13;23(14):6373. doi: 10.3390/s23146373.

Object recognition in medical images via anatomy-guided deep learning.

Med Image Anal. 2022 Oct;81:102527. doi: 10.1016/j.media.2022.102527. Epub 2022 Jun 25.

An Edge-Based Selection Method for Improving Regions-of-Interest Localizations Obtained Using Multiple Deep Learning Object-Detection Models in Breast Ultrasound Images.

Sensors (Basel). 2022 Sep 6;22(18):6721. doi: 10.3390/s22186721.

Short-wave infrared polarimetric image reconstruction using a deep convolutional neural network based on a high-frequency correlation.

Appl Opt. 2022 Aug 20;61(24):7163-7172. doi: 10.1364/AO.460752.

Automatic creation of annotations for chest radiographs based on the positional information extracted from radiographic image reports.

Comput Methods Programs Biomed. 2021 Sep;209:106331. doi: 10.1016/j.cmpb.2021.106331. Epub 2021 Aug 4.

GMLM-CNN: A Hybrid Solution to SWIR-VIS Face Verification with Limited Imagery.

Sensors (Basel). 2022 Dec 5;22(23):9500. doi: 10.3390/s22239500.

Combining deep learning with chemometrics when it is really needed: A case of real time object detection and spectral model application for spectral image processing.

Anal Chim Acta. 2022 Apr 15;1202:339668. doi: 10.1016/j.aca.2022.339668. Epub 2022 Mar 1.

Bacterial image analysis using multi-task deep learning approaches for clinical microscopy.

PeerJ Comput Sci. 2024 Aug 8;10:e2180. doi: 10.7717/peerj-cs.2180. eCollection 2024.

Bi-channel image registration and deep-learning segmentation (BIRDS) for efficient, versatile 3D mapping of mouse brain.

Elife. 2021 Jan 18;10:e63455. doi: 10.7554/eLife.63455.

引用本文的文献

Multi-Object Tracking on SWIR Images for City Surveillance in an Edge-Computing Environment.

Sensors (Basel). 2023 Jul 13;23(14):6373. doi: 10.3390/s23146373.

本文引用的文献

EDGE20: A Cross Spectral Evaluation Dataset for Multiple Surveillance Problems.

IEEE Winter Conf Appl Comput Vis. 2020 May 14;2020 IEEE Winter Conference on Applications of Computer Vision:2674-2683. doi: 10.1109/wacv45572.2020.9093573.

Signal Processing Platform for Long-Range Multi-Spectral Electro-Optical Systems.

Sensors (Basel). 2022 Feb 8;22(3):1294. doi: 10.3390/s22031294.

Thermal Imager Range: Predictions, Expectations, and Reality.

Sensors (Basel). 2019 Jul 28;19(15):3313. doi: 10.3390/s19153313.

Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks.

IEEE Trans Pattern Anal Mach Intell. 2017 Jun;39(6):1137-1149. doi: 10.1109/TPAMI.2016.2577031. Epub 2016 Jun 6.

Spatial Pyramid Pooling in Deep Convolutional Networks for Visual Recognition.

IEEE Trans Pattern Anal Mach Intell. 2015 Sep;37(9):1904-16. doi: 10.1109/TPAMI.2015.2389824.

Pedestrian detection in far-infrared daytime images using a hierarchical codebook of SURF.

Sensors (Basel). 2015 Apr 13;15(4):8570-94. doi: 10.3390/s150408570.