• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

MCFA:用于跨视图地理定位的多尺度级联与特征自适应对齐网络

MCFA: Multi-Scale Cascade and Feature Adaptive Alignment Network for Cross-View Geo-Localization.

作者信息

Hou Kaiji, Tong Qiang, Yan Na, Liu Xiulei, Hou Shoulu

机构信息

College of Computer Science, Beijing Information Science and Technology University, Beijing 102206, China.

出版信息

Sensors (Basel). 2025 Jul 21;25(14):4519. doi: 10.3390/s25144519.

DOI:10.3390/s25144519
PMID:40732647
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12299452/
Abstract

Cross-view geo-localization (CVGL) presents significant challenges due to the drastic variations in perspective and scene layout between unmanned aerial vehicle (UAV) and satellite images. Existing methods have made certain advancements in extracting local features from images. However, they exhibit limitations in modeling the interactions among local features and fall short in aligning cross-view representations accurately. To address these issues, we propose a Multi-Scale Cascade and Feature Adaptive Alignment (MCFA) network, which consists of a Multi-Scale Cascade Module (MSCM) and a Feature Adaptive Alignment Module (FAAM). The MSCM captures the features of the target's adjacent regions and enhances the model's robustness by learning key region information through association and fusion. The FAAM, with its dynamically weighted feature alignment module, adaptively adjusts feature differences across different viewpoints, achieving feature alignment between drone and satellite images. Our method achieves state-of-the-art (SOTA) performance on two public datasets, University-1652 and SUES-200. In generalization experiments, our model outperforms existing SOTA methods, with an average improvement of 1.52% in R@1 and 2.09% in AP, demonstrating its effectiveness and strong generalization in cross-view geo-localization tasks.

摘要

跨视角地理定位(CVGL)由于无人机(UAV)图像和卫星图像之间视角和场景布局的巨大差异而面临重大挑战。现有方法在从图像中提取局部特征方面取得了一定进展。然而,它们在对局部特征之间的相互作用进行建模时存在局限性,并且在准确对齐跨视角表示方面存在不足。为了解决这些问题,我们提出了一种多尺度级联和特征自适应对齐(MCFA)网络,它由一个多尺度级联模块(MSCM)和一个特征自适应对齐模块(FAAM)组成。MSCM捕获目标相邻区域的特征,并通过关联和融合学习关键区域信息来增强模型的鲁棒性。FAAM及其动态加权特征对齐模块可自适应调整不同视角之间的特征差异,实现无人机图像和卫星图像之间的特征对齐。我们的方法在两个公共数据集University-1652和SUES-200上取得了领先的(SOTA)性能。在泛化实验中,我们的模型优于现有的SOTA方法,在R@1上平均提高了1.52%,在AP上提高了2.09%,证明了其在跨视角地理定位任务中的有效性和强大的泛化能力。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fe35/12299452/004ccbc7104e/sensors-25-04519-g013.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fe35/12299452/1ddab3a66c8e/sensors-25-04519-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fe35/12299452/65a1fadf22b5/sensors-25-04519-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fe35/12299452/77325574f7d6/sensors-25-04519-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fe35/12299452/7fcc37789a6a/sensors-25-04519-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fe35/12299452/c3cfc42a00de/sensors-25-04519-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fe35/12299452/a632a94822b3/sensors-25-04519-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fe35/12299452/1659f47e0aa4/sensors-25-04519-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fe35/12299452/b02cea107de3/sensors-25-04519-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fe35/12299452/e6e20bb583b3/sensors-25-04519-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fe35/12299452/d83b7f4f8685/sensors-25-04519-g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fe35/12299452/9562b2068664/sensors-25-04519-g011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fe35/12299452/aa6ef11f24e3/sensors-25-04519-g012.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fe35/12299452/004ccbc7104e/sensors-25-04519-g013.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fe35/12299452/1ddab3a66c8e/sensors-25-04519-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fe35/12299452/65a1fadf22b5/sensors-25-04519-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fe35/12299452/77325574f7d6/sensors-25-04519-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fe35/12299452/7fcc37789a6a/sensors-25-04519-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fe35/12299452/c3cfc42a00de/sensors-25-04519-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fe35/12299452/a632a94822b3/sensors-25-04519-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fe35/12299452/1659f47e0aa4/sensors-25-04519-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fe35/12299452/b02cea107de3/sensors-25-04519-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fe35/12299452/e6e20bb583b3/sensors-25-04519-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fe35/12299452/d83b7f4f8685/sensors-25-04519-g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fe35/12299452/9562b2068664/sensors-25-04519-g011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fe35/12299452/aa6ef11f24e3/sensors-25-04519-g012.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fe35/12299452/004ccbc7104e/sensors-25-04519-g013.jpg

相似文献

1
MCFA: Multi-Scale Cascade and Feature Adaptive Alignment Network for Cross-View Geo-Localization.MCFA:用于跨视图地理定位的多尺度级联与特征自适应对齐网络
Sensors (Basel). 2025 Jul 21;25(14):4519. doi: 10.3390/s25144519.
2
AGEN: Adaptive Error Control-Driven Cross-View Geo-Localization Under Extreme Weather Conditions.AGEN:极端天气条件下基于自适应错误控制驱动的跨视角地理定位
Sensors (Basel). 2025 Jun 15;25(12):3749. doi: 10.3390/s25123749.
3
TLTNet: A novel transscale cascade layered transformer network for enhanced retinal blood vessel segmentation.TLTNet:一种新颖的跨尺度级联分层Transformer 网络,用于增强视网膜血管分割。
Comput Biol Med. 2024 Aug;178:108773. doi: 10.1016/j.compbiomed.2024.108773. Epub 2024 Jun 25.
4
Video Instance Segmentation Through Hierarchical Offset Compensation and Temporal Memory Update for UAV Aerial Images.基于层次偏移补偿和时间记忆更新的无人机航空影像视频实例分割
Sensors (Basel). 2025 Jul 9;25(14):4274. doi: 10.3390/s25144274.
5
Multi-level channel-spatial attention and light-weight scale-fusion network (MCSLF-Net): multi-level channel-spatial attention and light-weight scale-fusion transformer for 3D brain tumor segmentation.多级通道空间注意力与轻量级尺度融合网络(MCSLF-Net):用于3D脑肿瘤分割的多级通道空间注意力与轻量级尺度融合变换器
Quant Imaging Med Surg. 2025 Jul 1;15(7):6301-6325. doi: 10.21037/qims-2025-354. Epub 2025 Jun 30.
6
Image dehazing algorithm based on deep transfer learning and local mean adaptation.基于深度迁移学习和局部均值自适应的图像去雾算法
Sci Rep. 2025 Jul 31;15(1):27956. doi: 10.1038/s41598-025-13613-z.
7
iACP-DPNet: a dual-pooling causal dilated convolutional network for interpretable anticancer peptide identification.iACP-DPNet:一种用于可解释抗癌肽识别的双池因果扩张卷积网络。
Funct Integr Genomics. 2025 Jul 4;25(1):147. doi: 10.1007/s10142-025-01641-x.
8
A medical image classification method based on self-regularized adversarial learning.基于自正则化对抗学习的医学图像分类方法。
Med Phys. 2024 Nov;51(11):8232-8246. doi: 10.1002/mp.17320. Epub 2024 Jul 30.
9
Short-Term Memory Impairment短期记忆障碍
10
Diffusion semantic segmentation model: A generative model for medical image segmentation based on joint distribution.扩散语义分割模型:一种基于联合分布的医学图像分割生成模型。
Med Phys. 2025 Jul;52(7):e17928. doi: 10.1002/mp.17928. Epub 2025 Jun 8.

本文引用的文献

1
Sigmoid-weighted linear units for neural network function approximation in reinforcement learning.在强化学习中用于神经网络函数逼近的 Sigmoid 加权线性单元。
Neural Netw. 2018 Nov;107:3-11. doi: 10.1016/j.neunet.2017.12.012. Epub 2018 Jan 11.