• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

通过可匹配关键点辅助图神经网络学习特征匹配

Learning Feature Matching via Matchable Keypoint-Assisted Graph Neural Network.

作者信息

Li Zizhuo, Ma Jiayi

出版信息

IEEE Trans Image Process. 2024 Dec 11;PP. doi: 10.1109/TIP.2024.3512352.

DOI:10.1109/TIP.2024.3512352
PMID:40030484
Abstract

Accurately matching local features between a pair of images corresponding to the same 3D scene is a challenging computer vision task. Previous studies typically utilize attention-based graph neural networks (GNNs) with fully-connected graphs over keypoints within/across images for visual and geometric information reasoning. However, in the background of local feature matching, a significant number of keypoints are non-repeatable due to factors like occlusion and failure of the detector, and thus irrelevant for message passing. The connectivity with non-repeatable keypoints not only introduces redundancy, resulting in limited efficiency (quadratic computational complexity w.r.t. the keypoint number), but also interferes with the representation aggregation process, leading to limited accuracy. Aiming at the best of both worlds on accuracy and efficiency, we propose MaKeGNN, a sparse attention-based GNN architecture which bypasses non-repeatable keypoints and leverages matchable ones to guide compact and meaningful message passing. More specifically, our Bilateral Context-Aware Sampling (BCAS) Module first dynamically samples two small sets of well-distributed keypoints with high matchability scores from the image pair. Then, our Matchable Keypoint-Assisted Context Aggregation (MKACA) Module regards sampled informative keypoints as message bottlenecks and thus constrains each keypoint only to retrieve favorable contextual information from intra- and inter-matchable keypoints, evading the interference of irrelevant and redundant connectivity with non-repeatable ones. Furthermore, considering the potential noise in initial keypoints and sampled matchable ones, the MKACA module adopts a matchability-guided attentional aggregation operation for purer data-dependent context propagation. By these means, MaKeGNN outperforms the state-of-the-arts on multiple highly challenging benchmarks, while significantly reducing computational and memory complexity compared to typical attentional GNNs.

摘要

准确匹配对应于同一3D场景的一对图像之间的局部特征是一项具有挑战性的计算机视觉任务。先前的研究通常利用基于注意力的图神经网络(GNN),在图像内/图像间的关键点上使用全连接图进行视觉和几何信息推理。然而,在局部特征匹配的背景下,由于遮挡和检测器故障等因素,大量关键点是不可重复的,因此与消息传递无关。与不可重复关键点的连接不仅会引入冗余,导致效率有限(计算复杂度与关键点数量呈二次关系),还会干扰表示聚合过程,导致准确性受限。为了在准确性和效率两方面都达到最佳效果,我们提出了MaKeGNN,这是一种基于稀疏注意力的GNN架构,它绕过不可重复的关键点,并利用可匹配的关键点来指导紧凑且有意义的消息传递。更具体地说,我们的双边上下文感知采样(BCAS)模块首先从图像对中动态采样两组分布良好、具有高匹配性分数的小关键点集。然后,我们的可匹配关键点辅助上下文聚合(MKACA)模块将采样的信息丰富的关键点视为消息瓶颈,从而约束每个关键点仅从可匹配的关键点内和可匹配的关键点间检索有利的上下文信息,避免与不可重复关键点的无关和冗余连接的干扰。此外,考虑到初始关键点和采样的可匹配关键点中的潜在噪声,MKACA模块采用了一种基于匹配性的注意力聚合操作,以实现更纯净的数据依赖上下文传播。通过这些方法,MaKeGNN在多个极具挑战性的基准测试中优于现有技术,同时与典型的注意力GNN相比,显著降低了计算和内存复杂度。

相似文献

1
Learning Feature Matching via Matchable Keypoint-Assisted Graph Neural Network.通过可匹配关键点辅助图神经网络学习特征匹配
IEEE Trans Image Process. 2024 Dec 11;PP. doi: 10.1109/TIP.2024.3512352.
2
Dynamic Keypoint Detection Network for Image Matching.用于图像匹配的动态关键点检测网络
IEEE Trans Pattern Anal Mach Intell. 2023 Dec;45(12):14404-14419. doi: 10.1109/TPAMI.2023.3307889. Epub 2023 Nov 3.
3
Robust Estimation and Optimized Transmission of 3D Feature Points for Computer Vision on Mobile Communication Network.移动通讯网络上计算机视觉的三维特征点稳健估计与优化传输
Sensors (Basel). 2022 Nov 7;22(21):8563. doi: 10.3390/s22218563.
4
Detecting keypoints with semantic labels on skull point cloud for plastic surgery.在颅骨点云上检测带有语义标签的关键点用于整形手术。
Quant Imaging Med Surg. 2025 Apr 1;15(4):3501-3516. doi: 10.21037/qims-24-1358. Epub 2025 Mar 24.
5
PCSS: 3D Keypoint Detection for Point Clouds Using Structural Saliency.
IEEE Trans Image Process. 2025;34:2765-2780. doi: 10.1109/TIP.2025.3565380. Epub 2025 May 12.
6
ED-Pose++: Enhanced Explicit Box Detection for Conventional and Interactive Multi-Object Keypoint Detection.ED-Pose++:用于传统和交互式多目标关键点检测的增强型显式框检测
IEEE Trans Pattern Anal Mach Intell. 2025 Jul;47(7):5636-5654. doi: 10.1109/TPAMI.2025.3555527.
7
Learning Local Descriptors by Optimizing the Keypoint-Correspondence Criterion: Applications to Face Matching, Learning From Unlabeled Videos and 3D-Shape Retrieval.通过优化关键点对应准则学习局部描述符:在人脸匹配、无监督视频学习和 3D 形状检索中的应用。
IEEE Trans Image Process. 2019 Jan;28(1):279-290. doi: 10.1109/TIP.2018.2867270.
8
Learning 3D medical image keypoint descriptors with the triplet loss.使用三元组损失学习 3D 医学图像关键点描述符。
Int J Comput Assist Radiol Surg. 2022 Jan;17(1):141-146. doi: 10.1007/s11548-021-02481-3. Epub 2021 Aug 27.
9
Dynamic Graph Message Passing Networks.动态图消息传递网络
IEEE Trans Pattern Anal Mach Intell. 2023 May;45(5):5712-5730. doi: 10.1109/TPAMI.2022.3207500. Epub 2023 Apr 3.
10
BIK-BUS: biologically motivated 3D keypoint based on bottom-up saliency.BIK-BUS:基于自下而上显著度的生物启发式 3D 关键点。
IEEE Trans Image Process. 2015 Jan;24(1):163-75. doi: 10.1109/TIP.2014.2371532. Epub 2014 Nov 20.