卷积神经网络几何匹配架构。

Convolutional Neural Network Architecture for Geometric Matching.

出版信息

IEEE Trans Pattern Anal Mach Intell. 2019 Nov;41(11):2553-2567. doi: 10.1109/TPAMI.2018.2865351. Epub 2018 Aug 13.

DOI:10.1109/TPAMI.2018.2865351

Abstract

We address the problem of determining correspondences between two images in agreement with a geometric model such as an affine, homography or thin-plate spline transformation, and estimating its parameters. The contributions of this work are three-fold. First, we propose a convolutional neural network architecture for geometric matching. The architecture is based on three main components that mimic the standard steps of feature extraction, matching and simultaneous inlier detection and model parameter estimation, while being trainable end-to-end. Second, we demonstrate that the network parameters can be trained from synthetically generated imagery without the need for manual annotation and that our matching layer significantly increases generalization capabilities to never seen before images. Finally, we show that the same model can perform both instance-level and category-level matching giving state-of-the-art results on the challenging PF, TSS and Caltech-101 datasets.

摘要

我们解决了根据仿射、单应性或薄板样条变换等几何模型确定两幅图像之间对应关系并估计其参数的问题。这项工作的贡献有三点。首先，我们提出了一种用于几何匹配的卷积神经网络架构。该架构基于三个主要组件，这些组件模拟特征提取、匹配以及同时内点检测和模型参数估计的标准步骤，同时可以端到端进行训练。其次，我们证明可以从合成生成的图像中训练网络参数，而无需手动注释，并且我们的匹配层大大提高了对以前未见图像的泛化能力。最后，我们表明，同一个模型可以进行实例级和类别级匹配，在具有挑战性的 PF、TSS 和 Caltech-101 数据集上取得了最先进的结果。

相似文献

Convolutional Neural Network Architecture for Geometric Matching.

IEEE Trans Pattern Anal Mach Intell. 2019 Nov;41(11):2553-2567. doi: 10.1109/TPAMI.2018.2865351. Epub 2018 Aug 13.

NCNet: Neighbourhood Consensus Networks for Estimating Image Correspondences.

IEEE Trans Pattern Anal Mach Intell. 2022 Feb;44(2):1020-1034. doi: 10.1109/TPAMI.2020.3016711. Epub 2022 Jan 7.

Convolutional Hough Matching Networks for Robust and Efficient Visual Correspondence.

IEEE Trans Pattern Anal Mach Intell. 2023 Jul;45(7):8159-8175. doi: 10.1109/TPAMI.2022.3233884. Epub 2023 Jun 5.

Gum-Net: Unsupervised Geometric Matching for Fast and Accurate 3D Subtomogram Image Alignment and Averaging.

Proc IEEE Comput Soc Conf Comput Vis Pattern Recognit. 2020 Jun;2020:4072-4082. doi: 10.1109/cvpr42600.2020.00413. Epub 2020 Aug 5.

A novel end-to-end classifier using domain transferred deep convolutional neural networks for biomedical images.

Comput Methods Programs Biomed. 2017 Mar;140:283-293. doi: 10.1016/j.cmpb.2016.12.019. Epub 2017 Jan 6.

Deep feature descriptor based hierarchical dense matching for X-ray angiographic images.

Comput Methods Programs Biomed. 2019 Jul;175:233-242. doi: 10.1016/j.cmpb.2019.04.006. Epub 2019 Apr 22.

Show, Match and Segment: Joint Weakly Supervised Learning of Semantic Matching and Object Co-Segmentation.

IEEE Trans Pattern Anal Mach Intell. 2021 Oct;43(10):3632-3647. doi: 10.1109/TPAMI.2020.2985395. Epub 2021 Sep 3.

DualRC: A Dual-Resolution Learning Framework With Neighbourhood Consensus for Visual Correspondences.

IEEE Trans Pattern Anal Mach Intell. 2024 Jan;46(1):236-249. doi: 10.1109/TPAMI.2023.3316770. Epub 2023 Dec 5.

Reducing the U-Net size for practical scenarios: Virus recognition in electron microscopy images.

Comput Methods Programs Biomed. 2019 Sep;178:31-39. doi: 10.1016/j.cmpb.2019.05.026. Epub 2019 Jun 1.

Robust Vehicle Detection in Aerial Images Based on Cascaded Convolutional Neural Networks.

Sensors (Basel). 2017 Nov 24;17(12):2720. doi: 10.3390/s17122720.

引用本文的文献

IDNet: An inception-like deformable non-local network for projection compensation over non-flat textured surfaces.

PLoS One. 2025 May 20;20(5):e0318812. doi: 10.1371/journal.pone.0318812. eCollection 2025.

PViT-AIR: Puzzling vision transformer-based affine image registration for multi histopathology and faxitron images of breast tissue.

Med Image Anal. 2025 Jan;99:103356. doi: 10.1016/j.media.2024.103356. Epub 2024 Sep 30.

Automatic vectorization of historical maps: A benchmark.

PLoS One. 2024 Feb 15;19(2):e0298217. doi: 10.1371/journal.pone.0298217. eCollection 2024.

Deep learning-based affine medical image registration for multimodal minimal-invasive image-guided interventions - A comparative study on generalizability.

Z Med Phys. 2024 May;34(2):291-317. doi: 10.1016/j.zemedi.2023.05.003. Epub 2023 Jun 22.

Homologous point transformer for multi-modality prostate image registration.

PeerJ Comput Sci. 2022 Dec 1;8:e1155. doi: 10.7717/peerj-cs.1155. eCollection 2022.

Dance Action Recognition Model Using Deep Learning Network in Streaming Media Environment.

J Environ Public Health. 2022 Sep 12;2022:8955326. doi: 10.1155/2022/8955326. eCollection 2022.

Self-Supervised Rigid Registration for Multimodal Retinal Images.

IEEE Trans Image Process. 2022;31:5733-5747. doi: 10.1109/TIP.2022.3201476. Epub 2022 Sep 2.

Application of Mobile Virtual Reality Technology Combined with Neural Network in Facial Expression Recognition.

Comput Intell Neurosci. 2022 Aug 5;2022:4288187. doi: 10.1155/2022/4288187. eCollection 2022.

Texture Image Classification Based on Deep Learning and Wireless Sensor Technology.

Comput Intell Neurosci. 2022 May 24;2022:1761635. doi: 10.1155/2022/1761635. eCollection 2022.

A Cognitive Sample Consensus Method for the Stitching of Drone-Based Aerial Images Supported by a Generative Adversarial Network for False Positive Reduction.

Sensors (Basel). 2022 Mar 23;22(7):2474. doi: 10.3390/s22072474.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

卷积神经网络几何匹配架构。

Convolutional Neural Network Architecture for Geometric Matching.

出版信息

相似文献

引用本文的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献