基于全自动化 DCNN 的热图像注释，使用基于 RGB 数据预训练的神经网络。

Fully Automated DCNN-Based Thermal Images Annotation Using Neural Network Pretrained on RGB Data.

机构信息

Robotics and AI Research Group, Faculty of Electrical Engineering, Brno University of Technology, 61600 Brno, Czech Republic.

Cybernetics and Robotics Research Group, Central European Institute of Technology, Brno University of Technology, 61600 Brno, Czech Republic.

出版信息

Sensors (Basel). 2021 Feb 23;21(4):1552. doi: 10.3390/s21041552.

DOI:10.3390/s21041552

PMID:33672344

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7926581/

Abstract

One of the biggest challenges of training deep neural network is the need for massive data annotation. To train the neural network for object detection, millions of annotated training images are required. However, currently, there are no large-scale thermal image datasets that could be used to train the state of the art neural networks, while voluminous RGB image datasets are available. This paper presents a method that allows to create hundreds of thousands of annotated thermal images using the RGB pre-trained object detector. A dataset created in this way can be used to train object detectors with improved performance. The main gain of this work is the novel method for fully automatic thermal image labeling. The proposed system uses the RGB camera, thermal camera, 3D LiDAR, and the pre-trained neural network that detects objects in the RGB domain. Using this setup, it is possible to run the fully automated process that annotates the thermal images and creates the automatically annotated thermal training dataset. As the result, we created a dataset containing hundreds of thousands of annotated objects. This approach allows to train deep learning models with similar performance as the common human-annotation-based methods do. This paper also proposes several improvements to fine-tune the results with minimal human intervention. Finally, the evaluation of the proposed solution shows that the method gives significantly better results than training the neural network with standard small-scale hand-annotated thermal image datasets.

摘要

训练深度神经网络的最大挑战之一是需要大量的数据标注。为了训练用于目标检测的神经网络，需要数百万张标注的训练图像。然而，目前还没有可以用于训练最先进的神经网络的大规模热图像数据集，而大量的 RGB 图像数据集是可用的。本文提出了一种使用 RGB 预训练的目标检测器创建数十万张标注热图像的方法。以这种方式创建的数据集可用于训练性能得到提高的目标检测器。这项工作的主要成果是一种全新的全自动热图像标注方法。所提出的系统使用 RGB 相机、热相机、3D LiDAR 和在 RGB 域中检测物体的预训练神经网络。使用这种设置，可以运行全自动流程，对热图像进行标注，并创建自动标注的热训练数据集。结果，我们创建了一个包含数十万标注对象的数据集。这种方法可以训练出与常见的基于人工标注的方法具有相似性能的深度学习模型。本文还提出了几种改进方法，可以在最小的人工干预下微调结果。最后，对所提出的解决方案的评估表明，与使用标准的小规模手动标注热图像数据集训练神经网络相比，该方法的效果显著更好。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ad56/7926581/404296f80616/sensors-21-01552-g001.jpg

相似文献

Fully Automated DCNN-Based Thermal Images Annotation Using Neural Network Pretrained on RGB Data.基于全自动化 DCNN 的热图像注释，使用基于 RGB 数据预训练的神经网络。

Sensors (Basel). 2021 Feb 23;21(4):1552. doi: 10.3390/s21041552.

Semi-supervised training of deep convolutional neural networks with heterogeneous data and few local annotations: An experiment on prostate histopathology image classification.基于异构数据和少量局部标注的深度卷积神经网络的半监督学习：前列腺组织病理学图像分类实验。

Med Image Anal. 2021 Oct;73:102165. doi: 10.1016/j.media.2021.102165. Epub 2021 Jul 14.

A novel end-to-end classifier using domain transferred deep convolutional neural networks for biomedical images.一种使用域转移深度卷积神经网络的新型端到端生物医学图像分类器。

Comput Methods Programs Biomed. 2017 Mar;140:283-293. doi: 10.1016/j.cmpb.2016.12.019. Epub 2017 Jan 6.

Mid-fusion of road scene polarization images on pretrained RGB neural networks.基于预训练 RGB 神经网络的道路场景极化图像中间融合。

J Opt Soc Am A Opt Image Sci Vis. 2021 Apr 1;38(4):515-525. doi: 10.1364/JOSAA.413604.

Ossification area localization in pediatric hand radiographs using deep neural networks for object detection.使用基于深度学习的目标检测方法对小儿手部 X 线片中的骨化区域进行定位。

PLoS One. 2018 Nov 16;13(11):e0207496. doi: 10.1371/journal.pone.0207496. eCollection 2018.

A paced multi-stage block-wise approach for object detection in thermal images.一种用于热图像目标检测的基于块的多阶段步长方法。

Vis Comput. 2023;39(6):2347-2363. doi: 10.1007/s00371-022-02445-x. Epub 2022 Apr 7.

Novel Transfer Learning Approach for Medical Imaging with Limited Labeled Data.用于有限标注数据的医学成像的新型迁移学习方法。

Cancers (Basel). 2021 Mar 30;13(7):1590. doi: 10.3390/cancers13071590.

Automated location invariant animal detection in camera trap images using publicly available data sources.利用公开可用数据源在相机陷阱图像中进行自动位置不变动物检测。

Ecol Evol. 2021 Mar 10;11(9):4494-4506. doi: 10.1002/ece3.7344. eCollection 2021 May.

RGB-D Object Recognition Using Multi-Modal Deep Neural Network and DS Evidence Theory.基于多模态深度神经网络和证据理论的 RGB-D 目标识别。

Sensors (Basel). 2019 Jan 27;19(3):529. doi: 10.3390/s19030529.

UAV-Based Image and LiDAR Fusion for Pavement Crack Segmentation.基于无人机的图像与激光雷达融合用于路面裂缝分割

Sensors (Basel). 2023 Nov 21;23(23):9315. doi: 10.3390/s23239315.

引用本文的文献

Auxiliary diagnosis of primary bone tumors based on Machine learning model.基于机器学习模型的原发性骨肿瘤辅助诊断

J Bone Oncol. 2024 Nov 9;49:100648. doi: 10.1016/j.jbo.2024.100648. eCollection 2024 Dec.

Brno urban dataset: Winter extension.布尔诺城市数据集：冬季扩展。

Data Brief. 2021 Dec 3;40:107667. doi: 10.1016/j.dib.2021.107667. eCollection 2022 Feb.

本文引用的文献

Imbalance Problems in Object Detection: A Review.目标检测中的不平衡问题：综述

IEEE Trans Pattern Anal Mach Intell. 2021 Oct;43(10):3388-3415. doi: 10.1109/TPAMI.2020.2981890. Epub 2021 Sep 2.

CNN-Based Person Detection Using Infrared Images for Night-Time Intrusion Warning Systems.基于卷积神经网络的夜间入侵预警系统红外图像人体检测。

Sensors (Basel). 2019 Dec 19;20(1):34. doi: 10.3390/s20010034.

The ApolloScape Open Dataset for Autonomous Driving and Its Application.阿波罗景观开放数据集在自动驾驶中的应用

IEEE Trans Pattern Anal Mach Intell. 2020 Oct;42(10):2702-2719. doi: 10.1109/TPAMI.2019.2926463. Epub 2019 Jul 2.

Synthetic Data Generation for End-to-End Thermal Infrared Tracking.端到端热红外跟踪的合成数据生成。

IEEE Trans Image Process. 2019 Apr;28(4):1837-1850. doi: 10.1109/TIP.2018.2879249. Epub 2018 Nov 2.

Dermatologist-level classification of skin cancer with deep neural networks.基于深度神经网络的皮肤癌皮肤科医生级分类。

Nature. 2017 Feb 2;542(7639):115-118. doi: 10.1038/nature21056. Epub 2017 Jan 25.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

基于全自动化 DCNN 的热图像注释，使用基于 RGB 数据预训练的神经网络。

Fully Automated DCNN-Based Thermal Images Annotation Using Neural Network Pretrained on RGB Data.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献