Suppr
超能文献

一种用于航空图像中车辆检测的特征融合深度投影卷积神经网络。

A feature fusion deep-projection convolution neural network for vehicle detection in aerial images.

机构信息

School of Communication and Information Engineering, Shanghai University, Shanghai, China.

Shanghai Institute for Advanced Communication and Data Science, Shanghai University, Shanghai, China.

出版信息

PLoS One. 2021 May 7;16(5):e0250782. doi: 10.1371/journal.pone.0250782. eCollection 2021.

DOI:10.1371/journal.pone.0250782

PMID:33961655

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8104367/

Abstract

With the rapid development of Unmanned Aerial Vehicles, vehicle detection in aerial images plays an important role in different applications. Comparing with general object detection problems, vehicle detection in aerial images is still a challenging research topic since it is plagued by various unique factors, e.g. different camera angle, small vehicle size and complex background. In this paper, a Feature Fusion Deep-Projection Convolution Neural Network is proposed to enhance the ability to detect small vehicles in aerial images. The backbone of the proposed framework utilizes a novel residual block named stepwise res-block to explore high-level semantic features as well as conserve low-level detail features at the same time. A specially designed feature fusion module is adopted in the proposed framework to further balance the features obtained from different levels of the backbone. A deep-projection deconvolution module is used to minimize the impact of the information contamination introduced by down-sampling/up-sampling processes. The proposed framework has been evaluated by UCAS-AOD, VEDAI, and DOTA datasets. According to the evaluation results, the proposed framework outperforms other state-of-the-art vehicle detection algorithms for aerial images.

摘要

随着无人机的快速发展，航空图像中的车辆检测在不同的应用中起着重要的作用。与一般的目标检测问题相比，航空图像中的车辆检测仍然是一个具有挑战性的研究课题，因为它受到各种独特因素的困扰，例如不同的相机角度、车辆尺寸小和复杂的背景。在本文中，提出了一种特征融合深度投影卷积神经网络，以提高在航空图像中检测小型车辆的能力。所提出框架的骨干网络利用一种名为逐步残差块的新型残差块来同时探索高层语义特征和保留低层细节特征。在所提出的框架中采用了一个特别设计的特征融合模块，以进一步平衡来自骨干网不同层次的特征。使用深度投影反卷积模块来最小化下采样/上采样过程引入的信息污染的影响。该框架已经在 UCAS-AOD、VEDAI 和 DOTA 数据集上进行了评估。根据评估结果，所提出的框架在航空图像车辆检测方面优于其他最先进的算法。