Yuan Shuai, Zhang Lixian, Dong Runmin, Xiong Jie, Zheng Juepeng, Fu Haohuan, Gong Peng
IEEE Trans Cybern. 2024 Oct;54(10):6118-6131. doi: 10.1109/TCYB.2024.3392474. Epub 2024 Oct 9.
In high-resolution remote sensing images (RSIs), complex composite object detection (e.g., coal-fired power plant detection and harbor detection) is challenging due to multiple discrete parts with variable layouts leading to complex weak inter-relationship and blurred boundaries, instead of a clearly defined single object. To address this issue, this article proposes an end-to-end framework, i.e., relational part-aware network (REPAN), to explore the semantic correlation and extract discriminative features among multiple parts. Specifically, we first design a part region proposal network (P-RPN) to locate discriminative yet subtle regions. With butterfly units (BFUs) embedded, feature-scale confusion problems stemming from aliasing effects can be largely alleviated. Second, a feature relation Transformer (FRT) plumbs the depths of the spatial relationships by part-and-global joint learning, exploring correlations between various parts to enhance significant part representation. Finally, a contextual detector (CD) classifies and detects parts and the whole composite object through multirelation-aware features, where part information guides to locate the whole object. We collect three remote sensing object detection datasets with four categories to evaluate our method. Consistently surpassing the performance of state-of-the-art methods, the results of extensive experiments underscore the effectiveness and superiority of our proposed method.
在高分辨率遥感图像(RSI)中,复杂复合物体检测(例如燃煤电厂检测和港口检测)具有挑战性,因为多个离散部分具有可变布局,导致复杂的弱相互关系和模糊边界,而不是一个定义明确的单个物体。为了解决这个问题,本文提出了一个端到端框架,即关系部分感知网络(REPAN),以探索语义相关性并提取多个部分之间的判别特征。具体而言,我们首先设计了一个部分区域提议网络(P-RPN)来定位有判别力但微妙的区域。通过嵌入蝶形单元(BFU),可以在很大程度上缓解由混叠效应引起的特征尺度混淆问题。其次,一个特征关系Transformer(FRT)通过部分和全局联合学习来探究空间关系的深度,探索各个部分之间的相关性以增强重要部分的表示。最后,一个上下文检测器(CD)通过多关系感知特征对部分和整个复合物体进行分类和检测,其中部分信息指导定位整个物体。我们收集了三个包含四类的遥感物体检测数据集来评估我们的方法。广泛实验的结果始终超过了现有方法的性能,突出了我们提出的方法的有效性和优越性。