全方位图像的透视等变关键点学习。

Perspectively Equivariant Keypoint Learning for Omnidirectional Images.

出版信息

IEEE Trans Image Process. 2023;32:2552-2567. doi: 10.1109/TIP.2023.3270032. Epub 2023 May 5.

DOI:10.1109/TIP.2023.3270032

Abstract

Robust keypoint detection on omnidirectional images against large perspective variations, is a key problem in many computer vision tasks. In this paper, we propose a perspectively equivariant keypoint learning framework named OmniKL for addressing this problem. Specifically, the framework is composed of a perspective module and a spherical module, each one including a keypoint detector specific to the type of the input image and a shared descriptor providing uniform description for omnidirectional and perspective images. In these detectors, we propose a differentiable candidate position sorting operation for localizing keypoints, which directly sorts the scores of the candidate positions in a differentiable manner and returns the globally top-K keypoints on the image. This approach does not break the differentiability of the two modules, thus they are end-to-end trainable. Moreover, we design a novel training strategy combining the self-supervised and co-supervised methods to train the framework without any labeled data. Extensive experiments on synthetic and real-world 360° image datasets demonstrate the effectiveness of OmniKL in detecting perspectively equivariant keypoints on omnidirectional images. Our source code are available online at https://github.com/vandeppce/sphkpt.

摘要

针对大视角变化的全向图像鲁棒关键点检测是许多计算机视觉任务中的一个关键问题。在本文中，我们提出了一种名为 OmniKL 的视角不变关键点学习框架来解决这个问题。具体来说，该框架由一个视角模块和一个球型模块组成，每个模块都包括一个特定于输入图像类型的关键点检测器和一个共享描述符，为全向和透视图像提供统一的描述。在这些检测器中，我们提出了一种可微分的候选位置排序操作，用于定位关键点，该操作以可微分的方式直接对候选位置的得分进行排序，并返回图像上全局的前 K 个关键点。这种方法不会破坏两个模块的可微性，因此它们是端到端可训练的。此外，我们设计了一种新颖的训练策略，结合自监督和协同监督方法来训练框架，而无需任何标记数据。在合成和真实的 360°图像数据集上的广泛实验表明，OmniKL 在全向图像上检测视角不变关键点是有效的。我们的源代码可在 https://github.com/vandeppce/sphkpt 上获得。

相似文献

Perspectively Equivariant Keypoint Learning for Omnidirectional Images.

IEEE Trans Image Process. 2023;32:2552-2567. doi: 10.1109/TIP.2023.3270032. Epub 2023 May 5.

A robust and interpretable deep learning framework for multi-modal registration via keypoints.

Med Image Anal. 2023 Dec;90:102962. doi: 10.1016/j.media.2023.102962. Epub 2023 Sep 13.

Dynamic Keypoint Detection Network for Image Matching.

IEEE Trans Pattern Anal Mach Intell. 2023 Dec;45(12):14404-14419. doi: 10.1109/TPAMI.2023.3307889. Epub 2023 Nov 3.

Joint Representation Learning and Keypoint Detection for Cross-View Geo-Localization.

IEEE Trans Image Process. 2022;31:3780-3792. doi: 10.1109/TIP.2022.3175601. Epub 2022 Jun 2.

From Keypoints to Object Landmarks via Self-Training Correspondence: A Novel Approach to Unsupervised Landmark Discovery.

IEEE Trans Pattern Anal Mach Intell. 2023 Jul;45(7):8390-8404. doi: 10.1109/TPAMI.2023.3234212. Epub 2023 Jun 5.

Log-Spiral Keypoint: A Robust Approach toward Image Patch Matching.

Comput Intell Neurosci. 2015;2015:457495. doi: 10.1155/2015/457495. Epub 2015 May 5.

Unsupervised Learning of Local Equivariant Descriptors for Point Clouds.

IEEE Trans Pattern Anal Mach Intell. 2022 Dec;44(12):9687-9702. doi: 10.1109/TPAMI.2021.3126713. Epub 2022 Nov 7.

Learning an Invariant and Equivariant Network for Weakly Supervised Object Detection.

IEEE Trans Pattern Anal Mach Intell. 2023 Oct;45(10):11977-11992. doi: 10.1109/TPAMI.2023.3275142. Epub 2023 Sep 5.

Learning 3D medical image keypoint descriptors with the triplet loss.

Int J Comput Assist Radiol Surg. 2022 Jan;17(1):141-146. doi: 10.1007/s11548-021-02481-3. Epub 2021 Aug 27.

Neighbor2Neighbor: A Self-Supervised Framework for Deep Image Denoising.

IEEE Trans Image Process. 2022;31:4023-4038. doi: 10.1109/TIP.2022.3176533. Epub 2022 Jun 14.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

全方位图像的透视等变关键点学习。

Perspectively Equivariant Keypoint Learning for Omnidirectional Images.

出版信息

相似文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献