用于航空图像目标检测的采样等变自注意力网络

Sampling Equivariant Self-Attention Networks for Object Detection in Aerial Images.

作者信息

Yang Guo-Ye, Li Xiang-Li, Xiao Zi-Kai, Mu Tai-Jiang, Martin Ralph R, Hu Shi-Min

出版信息

IEEE Trans Image Process. 2023;32:6413-6425. doi: 10.1109/TIP.2023.3327586. Epub 2023 Nov 28.

DOI:10.1109/TIP.2023.3327586

Abstract

Objects in aerial images show greater variations in scale and orientation than in other images, making them harder to detect using vanilla deep convolutional neural networks. Networks with sampling equivariance can adapt sampling from input feature maps to object transformation, allowing a convolutional kernel to extract effective object features under different transformations. However, methods such as deformable convolutional networks can only provide sampling equivariance under certain circumstances, as they sample by location. We propose sampling equivariant self-attention networks, which treat self-attention restricted to a local image patch as convolution sampling by masks instead of locations, and a transformation embedding module to improve the equivariant sampling further. We further propose a novel randomized normalization module to enhance network generalization and a quantitative evaluation metric to fairly evaluate the ability of sampling equivariance of different models. Experiments show that our model provides significantly better sampling equivariance than existing methods without additional supervision and can thus extract more effective image features. Our model achieves state-of-the-art results on the DOTA-v1.0, DOTA-v1.5, and HRSC2016 datasets without additional computations or parameters.

摘要

航空图像中的物体在尺度和方向上的变化比其他图像更大，这使得使用普通深度卷积神经网络来检测它们变得更加困难。具有采样等变性的网络可以使从输入特征图的采样适应物体变换，从而让卷积核在不同变换下提取有效的物体特征。然而，诸如可变形卷积网络之类的方法仅在某些情况下才能提供采样等变性，因为它们是按位置进行采样的。我们提出了采样等变自注意力网络，该网络将限制在局部图像块上的自注意力视为通过掩码而非位置进行的卷积采样，以及一个变换嵌入模块来进一步改善等变采样。我们还进一步提出了一种新颖的随机归一化模块以增强网络泛化能力，并提出了一种定量评估指标来公平地评估不同模型的采样等变能力。实验表明，我们的模型在无需额外监督的情况下提供了比现有方法明显更好的采样等变性，因此能够提取更有效的图像特征。我们的模型在DOTA-v1.0、DOTA-v1.5和HRSC2016数据集上取得了领先的结果，且无需额外的计算或参数。

相似文献

Sampling Equivariant Self-Attention Networks for Object Detection in Aerial Images.用于航空图像目标检测的采样等变自注意力网络

IEEE Trans Image Process. 2023;32:6413-6425. doi: 10.1109/TIP.2023.3327586. Epub 2023 Nov 28.

A feature fusion deep-projection convolution neural network for vehicle detection in aerial images.一种用于航空图像中车辆检测的特征融合深度投影卷积神经网络。

PLoS One. 2021 May 7;16(5):e0250782. doi: 10.1371/journal.pone.0250782. eCollection 2021.

VolterraNet: A Higher Order Convolutional Network With Group Equivariance for Homogeneous Manifolds.Volterra网络：一种用于齐性流形的具有群等变性的高阶卷积网络。

IEEE Trans Pattern Anal Mach Intell. 2022 Feb;44(2):823-833. doi: 10.1109/TPAMI.2020.3035130. Epub 2022 Jan 7.

Learning Generalized Transformation Equivariant Representations Via AutoEncoding Transformations.通过自动编码变换学习广义变换等变表示。

IEEE Trans Pattern Anal Mach Intell. 2022 Apr;44(4):2045-2057. doi: 10.1109/TPAMI.2020.3029801. Epub 2022 Mar 4.

SIL-Net: A Semi-Isotropic L-shaped network for dermoscopic image segmentation.SIL-Net：一种用于皮肤镜图像分割的半各向同性L形网络。

Comput Biol Med. 2022 Nov;150:106146. doi: 10.1016/j.compbiomed.2022.106146. Epub 2022 Sep 27.

YOLOv4 with Deformable-Embedding-Transformer Feature Extractor for Exact Object Detection in Aerial Imagery.基于可变形嵌入-Transformer 特征提取的 YOLOv4 用于航空影像中精确目标检测

Sensors (Basel). 2023 Feb 24;23(5):2522. doi: 10.3390/s23052522.

Augmented Equivariant Attention Networks for Microscopy Image Transformation.增强等变注意网络在显微镜图像变换中的应用。

IEEE Trans Med Imaging. 2022 Nov;41(11):3194-3206. doi: 10.1109/TMI.2022.3179665. Epub 2022 Oct 27.

Shared-Weight-Based Multi-Dimensional Feature Alignment Network for Oriented Object Detection in Remote Sensing Imagery.基于共享权值的多维特征对齐网络在遥感图像目标检测中的应用。

Sensors (Basel). 2022 Dec 25;23(1):207. doi: 10.3390/s23010207.

Scale Enhancement Pyramid Network for Small Object Detection from UAV Images.用于无人机图像中小目标检测的尺度增强金字塔网络

Entropy (Basel). 2022 Nov 21;24(11):1699. doi: 10.3390/e24111699.

Equivariant neural networks for inverse problems.用于逆问题的等变神经网络。

Inverse Probl. 2021 Aug;37(8):085006. doi: 10.1088/1361-6420/ac104f. Epub 2021 Jul 26.

引用本文的文献

Crop Mapping Based on Sentinel-2 Images Using Semantic Segmentation Model of Attention Mechanism.基于注意力机制语义分割模型的 Sentinel-2 图像作物分类。

Sensors (Basel). 2023 Aug 7;23(15):7008. doi: 10.3390/s23157008.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

用于航空图像目标检测的采样等变自注意力网络

Sampling Equivariant Self-Attention Networks for Object Detection in Aerial Images.

作者信息

出版信息

相似文献

引用本文的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献