Computer Assisted Clinical Medicine, Medical Faculty Mannheim, Heidelberg University, Theodor-Kutzer-Ufer 1-3, 68167 Mannheim, Germany; Mannheim Institute for Intelligent Systems in Medicine, Medical Faculty Mannheim, Heidelberg University, Theodor-Kutzer-Ufer 1-3, 68167 Mannheim, Germany.
Computer Assisted Clinical Medicine, Medical Faculty Mannheim, Heidelberg University, Theodor-Kutzer-Ufer 1-3, 68167 Mannheim, Germany; Mannheim Institute for Intelligent Systems in Medicine, Medical Faculty Mannheim, Heidelberg University, Theodor-Kutzer-Ufer 1-3, 68167 Mannheim, Germany.
Z Med Phys. 2024 May;34(2):291-317. doi: 10.1016/j.zemedi.2023.05.003. Epub 2023 Jun 22.
Multimodal image registration is applied in medical image analysis as it allows the integration of complementary data from multiple imaging modalities. In recent years, various neural network-based approaches for medical image registration have been presented in papers, but due to the use of different datasets, a fair comparison is not possible. In this research 20 different neural networks for an affine registration of medical images were implemented. The networks' performance and the networks' generalizability to new datasets were evaluated using two multimodal datasets - a synthetic and a real patient dataset - of three-dimensional CT and MR images of the liver. The networks were first trained semi-supervised using the synthetic dataset and then evaluated on the synthetic dataset and the unseen patient dataset. Afterwards, the networks were finetuned on the patient dataset and subsequently evaluated on the patient dataset. The networks were compared using our own developed CNN as benchmark and a conventional affine registration with SimpleElastix as baseline. Six networks improved the pre-registration Dice coefficient of the synthetic dataset significantly (p-value < 0.05) and nine networks improved the pre-registration Dice coefficient of the patient dataset significantly and are therefore able to generalize to the new datasets used in our experiments. Many different machine learning-based methods have been proposed for affine multimodal medical image registration, but few are generalizable to new data and applications. It is therefore necessary to conduct further research in order to develop medical image registration techniques that can be applied more widely.
多模态图像配准在医学图像分析中得到了广泛应用,因为它允许整合来自多种成像模态的互补数据。近年来,已有许多基于神经网络的医学图像配准方法在论文中提出,但由于使用了不同的数据集,因此无法进行公平的比较。在这项研究中,我们实现了 20 种用于医学图像仿射配准的神经网络。我们使用两个多模态数据集——一个合成数据集和一个真实患者数据集——对三维 CT 和 MR 肝脏图像进行评估,以评估网络的性能和对新数据集的泛化能力。这些网络首先使用合成数据集进行半监督训练,然后在合成数据集和未见过的患者数据集上进行评估。之后,这些网络在患者数据集上进行微调,并在患者数据集上进行后续评估。我们使用自己开发的 CNN 作为基准,并使用 SimpleElastix 进行常规仿射配准,对这些网络进行了比较。有 6 个网络显著提高了合成数据集的预配准 Dice 系数(p 值<0.05),有 9 个网络显著提高了患者数据集的预配准 Dice 系数,因此能够泛化到我们实验中使用的新数据集。已经提出了许多基于机器学习的方法用于仿射多模态医学图像配准,但很少有方法可以推广到新的数据和应用。因此,有必要进一步研究以开发更广泛应用的医学图像配准技术。