Suppr超能文献

DAR-Net:用于航空图像中车辆检测的密集注意力残差网络。

DAR-Net: Dense Attentional Residual Network for Vehicle Detection in Aerial Images.

机构信息

School of Communication and Information Engineering, Shanghai University, Shanghai, China.

出版信息

Comput Intell Neurosci. 2021 Nov 26;2021:6340823. doi: 10.1155/2021/6340823. eCollection 2021.

Abstract

With the rapid development of deep learning and the wide usage of Unmanned Aerial Vehicles (UAVs), CNN-based algorithms of vehicle detection in aerial images have been widely studied in the past several years. As a downstream task of the general object detection, there are some differences between the vehicle detection in aerial images and the general object detection in ground view images, e.g., larger image areas, smaller target sizes, and more complex background. In this paper, to improve the performance of this task, a Dense Attentional Residual Network (DAR-Net) is proposed. The proposed network employs a novel dense waterfall residual block (DW res-block) to effectively preserve the spatial information and extract high-level semantic information at the same time. A multiscale receptive field attention (MRFA) module is also designed to select the informative feature from the feature maps and enhance the ability of multiscale perception. Based on the DW res-block and MRFA module, to protect the spatial information, the proposed framework adopts a new backbone that only downsamples the feature map 3 times; i.e., the total downsampling ratio of the proposed backbone is 8. These designs could alleviate the degradation problem, improve the information flow, and strengthen the feature reuse. In addition, deep-projection units are used to reduce the impact of information loss caused by downsampling operations, and the identity mapping is applied to each stage of the proposed backbone to further improve the information flow. The proposed DAR-Net is evaluated on VEDAI, UCAS-AOD, and DOTA datasets. The experimental results demonstrate that the proposed framework outperforms other state-of-the-art algorithms.

摘要

随着深度学习的快速发展和无人机 (UAV) 的广泛应用,近年来基于卷积神经网络 (CNN) 的航空图像车辆检测算法得到了广泛的研究。作为一般目标检测的下游任务,航空图像中的车辆检测与地面视图图像中的一般目标检测存在一些差异,例如更大的图像区域、更小的目标尺寸和更复杂的背景。在本文中,为了提高该任务的性能,提出了一种密集注意残留网络 (DAR-Net)。所提出的网络采用新颖的密集瀑布残差块 (DW res-block) 来有效地保留空间信息并同时提取高层语义信息。还设计了一种多尺度感受野注意力 (MRFA) 模块,用于从特征图中选择有意义的特征并增强多尺度感知能力。基于 DW res-block 和 MRFA 模块,为了保护空间信息,所提出的框架采用了一种新的仅对特征图进行 3 次下采样的骨干网络;即,所提出的骨干网络的总下采样率为 8。这些设计可以缓解降级问题,改善信息流,并增强特征重用。此外,深度投影单元用于减少下采样操作引起的信息丢失的影响,并且身份映射应用于所提出的骨干网的每个阶段,以进一步改善信息流。所提出的 DAR-Net 在 VEDAI、UCAS-AOD 和 DOTA 数据集上进行了评估。实验结果表明,所提出的框架优于其他最先进的算法。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/757f/8642025/75552889a753/CIN2021-6340823.001.jpg

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验