Chen Boqi, Niethammer Marc
Department of Computer Science, University of North Carolina at Chapel Hill.
Med Image Comput Comput Assist Interv. 2023 Oct;14229:271-281. doi: 10.1007/978-3-031-43999-5_26. Epub 2023 Oct 1.
Multiple imaging modalities are often used for disease diagnosis, prediction, or population-based analyses. However, not all modalities might be available due to cost, different study designs, or changes in imaging technology. If the differences between the types of imaging are small, data harmonization approaches can be used; for larger changes, direct image synthesis approaches have been explored. In this paper, we develop an approach based on multi-modal metric learning to synthesize images of diverse modalities. We use metric learning via multi-modal image retrieval, resulting in embeddings that can relate images of different modalities. Given a large image database, the learned image embeddings allow us to use k-nearest neighbor (-NN) regression for image synthesis. Our driving medical problem is knee osteoarthritis (KOA), but our developed method is general after proper image alignment. We test our approach by synthesizing cartilage thickness maps obtained from 3D magnetic resonance (MR) images using 2D radiographs. Our experiments show that the proposed method outperforms direct image synthesis and that the synthesized thickness maps retain information relevant to downstream tasks such as progression prediction and Kellgren-Lawrence grading (KLG). Our results suggest that retrieval approaches can be used to obtain high-quality and meaningful image synthesis results given large image databases.
多种成像方式常用于疾病诊断、预测或基于人群的分析。然而,由于成本、不同的研究设计或成像技术的变化,并非所有成像方式都可用。如果成像类型之间的差异较小,可以使用数据协调方法;对于较大的变化,则探索了直接图像合成方法。在本文中,我们开发了一种基于多模态度量学习的方法来合成不同模态的图像。我们通过多模态图像检索使用度量学习,得到能够关联不同模态图像的嵌入。给定一个大型图像数据库,学习到的图像嵌入使我们能够使用k近邻(-NN)回归进行图像合成。我们的驱动医学问题是膝关节骨关节炎(KOA),但经过适当的图像对齐后,我们开发的方法具有通用性。我们通过使用二维X线照片合成从三维磁共振(MR)图像获得的软骨厚度图来测试我们的方法。我们的实验表明,所提出的方法优于直接图像合成,并且合成的厚度图保留了与下游任务(如进展预测和凯尔格伦-劳伦斯分级(KLG))相关的信息。我们的结果表明,在有大型图像数据库的情况下,检索方法可用于获得高质量且有意义的图像合成结果。