Liu Wansi, Wang Huan, Duan Jiapeng, Cao Lixiang, Feng Teng, Tian Xiaomin
School of Remote Sensing and Information Engineering, North China Institute of Aerospace Engineering, Langfang 065000, China.
Hebei Collaborative Innovation Center for Aerospace Remote Sensing Information Processing and Application, Langfang 065000, China.
Sensors (Basel). 2025 Aug 1;25(15):4749. doi: 10.3390/s25154749.
Synthetic aperture radar (SAR), as an active microwave imaging system, has the capability of all-weather and all-time observation. In response to the challenges of aircraft detection in SAR images due to the complex background interference caused by the continuous scattering of airport buildings and the demand for real-time processing, this paper proposes a YOLOv7-MTI recognition model that combines the attention mechanism and involution. By integrating the MTCN module and involution, performance is enhanced. The Multi-TASP-Conv network (MTCN) module aims to effectively extract low-level semantic and spatial information using a shared lightweight attention gate structure to achieve cross-dimensional interaction between "channels and space" with very few parameters, capturing the dependencies among multiple dimensions and improving feature representation ability. Involution helps the model adaptively adjust the weights of spatial positions through dynamic parameterized convolution kernels, strengthening the discrete strong scattering points specific to aircraft and suppressing the continuous scattering of the background, thereby alleviating the interference of complex backgrounds. Experiments on the SAR-AIRcraft-1.0 dataset, which includes seven categories such as A220, A320/321, A330, ARJ21, Boeing737, Boeing787, and others, show that the mAP and mRecall of YOLOv7-MTI reach 93.51% and 96.45%, respectively, outperforming Faster R-CNN, SSD, YOLOv5, YOLOv7, and YOLOv8. Compared with the basic YOLOv7, mAP is improved by 1.47%, mRecall by 1.64%, and FPS by 8.27%, achieving an effective balance between accuracy and speed, providing research ideas for SAR aircraft recognition.
合成孔径雷达(SAR)作为一种有源微波成像系统,具备全天候、全时段观测能力。针对机场建筑物持续散射导致的复杂背景干扰以及SAR图像中飞机检测面临的实时处理需求等挑战,本文提出了一种结合注意力机制和内卷的YOLOv7-MTI识别模型。通过集成MTCN模块和内卷,性能得到提升。多任务自适应空间金字塔卷积网络(MTCN)模块旨在利用共享的轻量级注意力门结构有效提取低级语义和空间信息,以极少的参数实现“通道与空间”之间的跨维度交互,捕捉多维度之间的依赖关系并提高特征表示能力。内卷有助于模型通过动态参数化卷积核自适应调整空间位置权重,强化飞机特有的离散强散射点并抑制背景的连续散射,从而减轻复杂背景的干扰。在包含A220、A320/321、A330、ARJ21、波音737、波音787等七类的SAR-AIRcraft-1.0数据集上进行的实验表明,YOLOv7-MTI的平均精度均值(mAP)和平均召回率(mRecall)分别达到93.51%和96.45%,优于Faster R-CNN、SSD、YOLOv5、YOLOv7和YOLOv8。与基础的YOLOv7相比,mAP提高了1.47%,mRecall提高了1.64%,帧率(FPS)提高了8.27%,在精度和速度之间实现了有效平衡,为SAR飞机识别提供了研究思路。