基于深度神经网络的多模态视网膜图像两步配准。

Two-Step Registration on Multi-Modal Retinal Images via Deep Neural Networks.

出版信息

IEEE Trans Image Process. 2022;31:823-838. doi: 10.1109/TIP.2021.3135708. Epub 2022 Jan 4.

DOI:10.1109/TIP.2021.3135708

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8912939/

Abstract

Multi-modal retinal image registration plays an important role in the ophthalmological diagnosis process. The conventional methods lack robustness in aligning multi-modal images of various imaging qualities. Deep-learning methods have not been widely developed for this task, especially for the coarse-to-fine registration pipeline. To handle this task, we propose a two-step method based on deep convolutional networks, including a coarse alignment step and a fine alignment step. In the coarse alignment step, a global registration matrix is estimated by three sequentially connected networks for vessel segmentation, feature detection and description, and outlier rejection, respectively. In the fine alignment step, a deformable registration network is set up to find pixel-wise correspondence between a target image and a coarsely aligned image from the previous step to further improve the alignment accuracy. Particularly, an unsupervised learning framework is proposed to handle the difficulties of inconsistent modalities and lack of labeled training data for the fine alignment step. The proposed framework first changes multi-modal images into a same modality through modality transformers, and then adopts photometric consistency loss and smoothness loss to train the deformable registration network. The experimental results show that the proposed method achieves state-of-the-art results in Dice metrics and is more robust in challenging cases.

摘要

多模态视网膜图像配准在眼科诊断过程中起着重要作用。传统方法在对齐具有不同成像质量的多模态图像时缺乏鲁棒性。深度学习方法尚未广泛应用于这项任务，特别是对于粗到精的配准流水线。为了处理这项任务，我们提出了一种基于深度卷积网络的两步法，包括粗对准步骤和精对准步骤。在粗对准步骤中，通过三个依次连接的网络分别进行血管分割、特征检测和描述以及异常值剔除，来估计全局配准矩阵。在精对准步骤中，建立一个可变形配准网络，以在目标图像和前一步骤中粗对准的图像之间找到像素级对应关系，从而进一步提高对准精度。特别是，提出了一种无监督学习框架来处理精细对准步骤中模态不一致和缺乏标记训练数据的困难。该框架首先通过模态变换器将多模态图像转换为相同的模态，然后采用光度一致性损失和平滑度损失来训练可变形配准网络。实验结果表明，所提出的方法在 Dice 度量上达到了最先进的水平，并且在具有挑战性的情况下更具鲁棒性。

相似文献

Two-Step Registration on Multi-Modal Retinal Images via Deep Neural Networks.

IEEE Trans Image Process. 2022;31:823-838. doi: 10.1109/TIP.2021.3135708. Epub 2022 Jan 4.

FDRN: A fast deformable registration network for medical images.

Med Phys. 2021 Oct;48(10):6453-6463. doi: 10.1002/mp.15011. Epub 2021 Jul 6.

Robust Content-Adaptive Global Registration for Multimodal Retinal Images Using Weakly Supervised Deep-Learning Framework.

IEEE Trans Image Process. 2021;30:3167-3178. doi: 10.1109/TIP.2021.3058570. Epub 2021 Feb 25.

Adversarial learning for mono- or multi-modal registration.

Med Image Anal. 2019 Dec;58:101545. doi: 10.1016/j.media.2019.101545. Epub 2019 Aug 24.

Image synthesis-based multi-modal image registration framework by using deep fully convolutional networks.

Med Biol Eng Comput. 2019 May;57(5):1037-1048. doi: 10.1007/s11517-018-1924-y. Epub 2018 Dec 7.

Geometry-Consistent Adversarial Registration Model for Unsupervised Multi-Modal Medical Image Registration.

IEEE J Biomed Health Inform. 2023 Jul;27(7):3455-3466. doi: 10.1109/JBHI.2023.3270199. Epub 2023 Jun 30.

Unsupervised End-to-End Brain Tumor Magnetic Resonance Image Registration Using RBCNN: Rigid Transformation, B-Spline Transformation and Convolutional Neural Network.

Curr Med Imaging. 2022;18(4):387-397. doi: 10.2174/1573405617666210806125526.

Automated cardiac segmentation of cross-modal medical images using unsupervised multi-domain adaptation and spatial neural attention structure.

Med Image Anal. 2021 Aug;72:102135. doi: 10.1016/j.media.2021.102135. Epub 2021 Jun 17.

Memory-efficient 2.5D convolutional transformer networks for multi-modal deformable registration with weak label supervision applied to whole-heart CT and MRI scans.

Int J Comput Assist Radiol Surg. 2019 Nov;14(11):1901-1912. doi: 10.1007/s11548-019-02068-z. Epub 2019 Sep 19.

CMAN: Cascaded Multi-scale Spatial Channel Attention-guided Network for large 3D deformable registration of liver CT images.

Med Image Anal. 2024 Aug;96:103212. doi: 10.1016/j.media.2024.103212. Epub 2024 May 22.

引用本文的文献

Artificial intelligence technology in ophthalmology public health: current applications and future directions.

Front Cell Dev Biol. 2025 Apr 17;13:1576465. doi: 10.3389/fcell.2025.1576465. eCollection 2025.

A Rapid Head Organ Localization System Based on Clinically Realistic Images: A 3D Two Step Progressive Registration Method with CVH Anatomical Knowledge Mapping.

Bioengineering (Basel). 2024 Sep 1;11(9):891. doi: 10.3390/bioengineering11090891.

Medical image registration and its application in retinal images: a review.

Vis Comput Ind Biomed Art. 2024 Aug 21;7(1):21. doi: 10.1186/s42492-024-00173-8.

Enhanced multimodal medical image fusion via modified DWT with arithmetic optimization algorithm.

Sci Rep. 2024 Aug 20;14(1):19261. doi: 10.1038/s41598-024-69997-x.

ACCURATE REGISTRATION BETWEEN ULTRA-WIDE-FIELD AND NARROW ANGLE RETINA IMAGES WITH 3D EYEBALL SHAPE OPTIMIZATION.

Proc Int Conf Image Proc. 2023 Oct;2023:2750-2754. doi: 10.1109/icip49359.2023.10223163. Epub 2023 Sep 11.

MEMO: dataset and methods for robust multimodal retinal image registration with large or small vessel density differences.

Biomed Opt Express. 2024 Apr 30;15(5):3457-3479. doi: 10.1364/BOE.516481. eCollection 2024 May 1.

L2NLF: a novel linear-to-nonlinear framework for multi-modal medical image registration.

Biomed Eng Lett. 2024 Jan 10;14(3):497-509. doi: 10.1007/s13534-023-00344-1. eCollection 2024 May.

Ultra-wide field and new wide field composite retinal image registration with AI-enabled pipeline and 3D distortion correction algorithm.

Eye (Lond). 2024 Apr;38(6):1189-1195. doi: 10.1038/s41433-023-02868-3. Epub 2023 Dec 19.

A robust and interpretable deep learning framework for multi-modal registration via keypoints.

Med Image Anal. 2023 Dec;90:102962. doi: 10.1016/j.media.2023.102962. Epub 2023 Sep 13.

Efficacy and accuracy of artificial intelligence to overlay multimodal images from different optical instruments in patients with retinitis pigmentosa.

Clin Exp Ophthalmol. 2023 Jul;51(5):446-452. doi: 10.1111/ceo.14234. Epub 2023 Apr 26.

本文引用的文献

Multi-scale U-net with Edge Guidance for Multimodal Retinal Image Deformable Registration.

Annu Int Conf IEEE Eng Med Biol Soc. 2020 Jul;2020:1360-1363. doi: 10.1109/EMBC44109.2020.9175613.

Weakly-Supervised Vessel Detection in Ultra-Widefield Fundus Photography via Iterative Multi-Modal Registration and Learning.

IEEE Trans Med Imaging. 2021 Oct;40(10):2748-2758. doi: 10.1109/TMI.2020.3027665. Epub 2021 Sep 30.

Multimodal affine registration for ICGA and MCSL fundus images of high myopia.

Biomed Opt Express. 2020 Jul 20;11(8):4443-4457. doi: 10.1364/BOE.393178. eCollection 2020 Aug 1.

Unsupervised 3D End-to-End Medical Image Registration With Volume Tweening Network.

IEEE J Biomed Health Inform. 2020 May;24(5):1394-1404. doi: 10.1109/JBHI.2019.2951024. Epub 2019 Nov 1.

Unsupervised learning of probabilistic diffeomorphic registration for images and surfaces.

Med Image Anal. 2019 Oct;57:226-236. doi: 10.1016/j.media.2019.07.006. Epub 2019 Jul 12.

Vessel Optimal Transport for Automated Alignment of Retinal Fundus Images.

IEEE Trans Image Process. 2019 Dec;28(12):6154-6168. doi: 10.1109/TIP.2019.2925287. Epub 2019 Jul 2.

VoxelMorph: A Learning Framework for Deformable Medical Image Registration.

IEEE Trans Med Imaging. 2019 Feb 4. doi: 10.1109/TMI.2019.2897538.

A deep learning framework for unsupervised affine and deformable image registration.

Med Image Anal. 2019 Feb;52:128-143. doi: 10.1016/j.media.2018.11.010. Epub 2018 Dec 8.

Convolutional Neural Network Architecture for Geometric Matching.

IEEE Trans Pattern Anal Mach Intell. 2019 Nov;41(11):2553-2567. doi: 10.1109/TPAMI.2018.2865351. Epub 2018 Aug 13.

Multi-modal and multi-vendor retina image registration.

Biomed Opt Express. 2018 Jan 3;9(2):410-422. doi: 10.1364/BOE.9.000410. eCollection 2018 Feb 1.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

基于深度神经网络的多模态视网膜图像两步配准。

Two-Step Registration on Multi-Modal Retinal Images via Deep Neural Networks.

出版信息

IEEE Trans Image Process. 2022;31:823-838. doi: 10.1109/TIP.2021.3135708. Epub 2022 Jan 4.

DOI:10.1109/TIP.2021.3135708

PMID:34932479

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8912939/

Abstract

摘要

基于深度神经网络的多模态视网膜图像两步配准。

Two-Step Registration on Multi-Modal Retinal Images via Deep Neural Networks.

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

基于深度神经网络的多模态视网膜图像两步配准。

Two-Step Registration on Multi-Modal Retinal Images via Deep Neural Networks.

出版信息