Suppr超能文献

基于深度学习的目标检测研究综述。

Object Detection With Deep Learning: A Review.

出版信息

IEEE Trans Neural Netw Learn Syst. 2019 Nov;30(11):3212-3232. doi: 10.1109/TNNLS.2018.2876865. Epub 2019 Jan 28.

Abstract

Due to object detection's close relationship with video analysis and image understanding, it has attracted much research attention in recent years. Traditional object detection methods are built on handcrafted features and shallow trainable architectures. Their performance easily stagnates by constructing complex ensembles that combine multiple low-level image features with high-level context from object detectors and scene classifiers. With the rapid development in deep learning, more powerful tools, which are able to learn semantic, high-level, deeper features, are introduced to address the problems existing in traditional architectures. These models behave differently in network architecture, training strategy, and optimization function. In this paper, we provide a review of deep learning-based object detection frameworks. Our review begins with a brief introduction on the history of deep learning and its representative tool, namely, the convolutional neural network. Then, we focus on typical generic object detection architectures along with some modifications and useful tricks to improve detection performance further. As distinct specific detection tasks exhibit different characteristics, we also briefly survey several specific tasks, including salient object detection, face detection, and pedestrian detection. Experimental analyses are also provided to compare various methods and draw some meaningful conclusions. Finally, several promising directions and tasks are provided to serve as guidelines for future work in both object detection and relevant neural network-based learning systems.

摘要

由于目标检测与视频分析和图像理解密切相关,近年来吸引了很多研究关注。传统的目标检测方法是基于手工制作的特征和浅层可训练的架构构建的。通过构建将多个低水平图像特征与来自目标检测器和场景分类器的高水平上下文相结合的复杂集成,它们的性能很容易停滞不前。随着深度学习的快速发展,引入了更强大的工具,这些工具能够学习语义、高级、更深层次的特征,以解决传统架构中存在的问题。这些模型在网络架构、训练策略和优化函数方面表现不同。在本文中,我们提供了一个基于深度学习的目标检测框架的综述。我们的综述首先简要介绍了深度学习的历史及其代表性工具,即卷积神经网络。然后,我们专注于典型的通用目标检测架构,以及一些改进和有用的技巧,以进一步提高检测性能。由于不同的特定检测任务具有不同的特点,我们还简要地调查了几个特定的任务,包括显著目标检测、人脸检测和行人检测。还提供了实验分析来比较各种方法,并得出一些有意义的结论。最后,提供了几个有前途的方向和任务,为目标检测和相关基于神经网络的学习系统的未来工作提供指导。

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验