基于深度学习的目标检测研究综述。

Object Detection With Deep Learning: A Review.

出版信息

IEEE Trans Neural Netw Learn Syst. 2019 Nov;30(11):3212-3232. doi: 10.1109/TNNLS.2018.2876865. Epub 2019 Jan 28.

DOI:10.1109/TNNLS.2018.2876865

Abstract

Due to object detection's close relationship with video analysis and image understanding, it has attracted much research attention in recent years. Traditional object detection methods are built on handcrafted features and shallow trainable architectures. Their performance easily stagnates by constructing complex ensembles that combine multiple low-level image features with high-level context from object detectors and scene classifiers. With the rapid development in deep learning, more powerful tools, which are able to learn semantic, high-level, deeper features, are introduced to address the problems existing in traditional architectures. These models behave differently in network architecture, training strategy, and optimization function. In this paper, we provide a review of deep learning-based object detection frameworks. Our review begins with a brief introduction on the history of deep learning and its representative tool, namely, the convolutional neural network. Then, we focus on typical generic object detection architectures along with some modifications and useful tricks to improve detection performance further. As distinct specific detection tasks exhibit different characteristics, we also briefly survey several specific tasks, including salient object detection, face detection, and pedestrian detection. Experimental analyses are also provided to compare various methods and draw some meaningful conclusions. Finally, several promising directions and tasks are provided to serve as guidelines for future work in both object detection and relevant neural network-based learning systems.

摘要

由于目标检测与视频分析和图像理解密切相关，近年来吸引了很多研究关注。传统的目标检测方法是基于手工制作的特征和浅层可训练的架构构建的。通过构建将多个低水平图像特征与来自目标检测器和场景分类器的高水平上下文相结合的复杂集成，它们的性能很容易停滞不前。随着深度学习的快速发展，引入了更强大的工具，这些工具能够学习语义、高级、更深层次的特征，以解决传统架构中存在的问题。这些模型在网络架构、训练策略和优化函数方面表现不同。在本文中，我们提供了一个基于深度学习的目标检测框架的综述。我们的综述首先简要介绍了深度学习的历史及其代表性工具，即卷积神经网络。然后，我们专注于典型的通用目标检测架构，以及一些改进和有用的技巧，以进一步提高检测性能。由于不同的特定检测任务具有不同的特点，我们还简要地调查了几个特定的任务，包括显著目标检测、人脸检测和行人检测。还提供了实验分析来比较各种方法，并得出一些有意义的结论。最后，提供了几个有前途的方向和任务，为目标检测和相关基于神经网络的学习系统的未来工作提供指导。

相似文献

Object Detection With Deep Learning: A Review.

IEEE Trans Neural Netw Learn Syst. 2019 Nov;30(11):3212-3232. doi: 10.1109/TNNLS.2018.2876865. Epub 2019 Jan 28.

Hierarchical Recurrent Neural Hashing for Image Retrieval With Hierarchical Convolutional Features.

IEEE Trans Image Process. 2018;27(1):106-120. doi: 10.1109/TIP.2017.2755766.

Deep learning-based small object detection: A survey.

Math Biosci Eng. 2023 Feb 2;20(4):6551-6590. doi: 10.3934/mbe.2023282.

Embedding topological features into convolutional neural network salient object detection.

Neural Netw. 2020 Jan;121:308-318. doi: 10.1016/j.neunet.2019.09.009. Epub 2019 Sep 25.

Salient object detection based on multi-scale contrast.

Neural Netw. 2018 May;101:47-56. doi: 10.1016/j.neunet.2018.02.005. Epub 2018 Feb 13.

Salient Object Detection with Recurrent Fully Convolutional Networks.

IEEE Trans Pattern Anal Mach Intell. 2019 Jul;41(7):1734-1746. doi: 10.1109/TPAMI.2018.2846598. Epub 2018 Jun 12.

Salient Object Detection in the Deep Learning Era: An In-Depth Survey.

IEEE Trans Pattern Anal Mach Intell. 2022 Jun;44(6):3239-3259. doi: 10.1109/TPAMI.2021.3051099. Epub 2022 May 5.

Open Source Assessment of Deep Learning Visual Object Detection.

Sensors (Basel). 2022 Jun 17;22(12):4575. doi: 10.3390/s22124575.

Embedding Attention and Residual Network for Accurate Salient Object Detection.

IEEE Trans Cybern. 2020 May;50(5):2050-2062. doi: 10.1109/TCYB.2018.2879859. Epub 2018 Nov 27.

Tools, techniques, datasets and application areas for object detection in an image: a review.

Multimed Tools Appl. 2022;81(27):38297-38351. doi: 10.1007/s11042-022-13153-y. Epub 2022 Apr 23.

引用本文的文献

Evaluating Hemodynamic Changes in Preterm Infants Using Recent YOLO Models.

Bioengineering (Basel). 2025 Jul 29;12(8):815. doi: 10.3390/bioengineering12080815.

From Detection to Diagnosis: An Advanced Transfer Learning Pipeline Using YOLO11 with Morphological Post-Processing for Brain Tumor Analysis for MRI Images.

J Imaging. 2025 Aug 21;11(8):282. doi: 10.3390/jimaging11080282.

Twin-AI: Intelligent Barrier Eddy Current Separator with Digital Twin and AI Integration.

Sensors (Basel). 2025 Jul 31;25(15):4731. doi: 10.3390/s25154731.

WCS-YOLOv8s: an improved YOLOv8s model for target identification and localization throughout the strawberry growth process.

Front Plant Sci. 2025 Jul 11;16:1579335. doi: 10.3389/fpls.2025.1579335. eCollection 2025.

A hybrid model for detecting motion artifacts in ballistocardiogram signals.

Biomed Eng Online. 2025 Jul 23;24(1):92. doi: 10.1186/s12938-025-01426-0.

Comparison classification algorithms and the YOLO method for video analysis and object detection.

Sci Rep. 2025 Jul 14;15(1):25432. doi: 10.1038/s41598-025-09814-1.

Monochromatic LeafAdaptNet (MLAN): an adaptive approach to spinach leaf disease detection using monochromatic imaging.

World J Microbiol Biotechnol. 2025 Jul 8;41(7):255. doi: 10.1007/s11274-025-04442-3.

Positive-negative prototypes fusion framework for open set recognition.

Sci Rep. 2025 Jul 3;15(1):23815. doi: 10.1038/s41598-025-09625-4.

RNA G-quadruplexes: emerging regulators of gene expression and therapeutic targets.

Funct Integr Genomics. 2025 Jul 3;25(1):143. doi: 10.1007/s10142-025-01656-4.

Performance of two different artificial intelligence models in dental implant planning among four different implant planning software: a comparative study.

BMC Oral Health. 2025 Jul 2;25(1):984. doi: 10.1186/s12903-025-06336-0.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

基于深度学习的目标检测研究综述。

Object Detection With Deep Learning: A Review.

出版信息

相似文献

引用本文的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献