Xu Isaac R L, Van Booven Derek J, Goberdhan Sankalp, Breto Adrian, Porto Joao, Alhusseini Mohammad, Algohary Ahmad, Stoyanova Radka, Punnen Sanoj, Mahne Anton, Arora Himanshu
John P Hussman Institute for Human Genomics, Miller School of Medicine, University of Miami, Miami, FL 33136, USA.
College of Medicine, University of Central Florida, Orlando, FL 32816, USA.
J Pers Med. 2023 Mar 18;13(3):547. doi: 10.3390/jpm13030547.
The recent integration of open-source data with machine learning models, especially in the medical field, has opened new doors to studying disease progression and/or regression. However, the ability to use medical data for machine learning approaches is limited by the specificity of data for a particular medical condition. In this context, the most recent technologies, like generative adversarial networks (GANs), are being looked upon as a potential way to generate high-quality synthetic data that preserve the clinical variability of a condition. However, despite some success, GAN model usage remains largely minimal when depicting the heterogeneity of a disease such as prostate cancer. Previous studies from our group members have focused on automating the quantitative multi-parametric magnetic resonance imaging (mpMRI) using habitat risk scoring (HRS) maps on the prostate cancer patients in the BLaStM trial. In the current study, we aimed to use the images from the BLaStM trial and other sources to train the GAN models, generate synthetic images, and validate their quality. In this context, we used T2-weighted prostate MRI images as training data for Single Natural Image GANs (SinGANs) to make a generative model. A deep learning semantic segmentation pipeline trained the model to segment the prostate boundary on 2D MRI slices. Synthetic images with a high-level segmentation boundary of the prostate were filtered and used in the quality control assessment by participating scientists with varying degrees of experience (more than ten years, one year, or no experience) to work with MRI images. Results showed that the most experienced participating group correctly identified conventional vs. synthetic images with 67% accuracy, the group with one year of experience correctly identified the images with 58% accuracy, and the group with no prior experience reached 50% accuracy. Nearly half (47%) of the synthetic images were mistakenly evaluated as conventional. Interestingly, in a blinded quality assessment, a board-certified radiologist did not significantly differentiate between conventional and synthetic images in the context of the mean quality of synthetic and conventional images. Furthermore, to validate the usability of the generated synthetic images from prostate cancer MRIs, we subjected these to anomaly detection along with the original images. Importantly, the success rate of anomaly detection for quality control-approved synthetic data in phase one corresponded to that of the conventional images. In sum, this study shows promise that high-quality synthetic images from MRIs can be generated using GANs. Such an AI model may contribute significantly to various clinical applications which involve supervised machine-learning approaches.
开源数据与机器学习模型的近期整合,尤其是在医学领域,为研究疾病进展和/或消退打开了新的大门。然而,将医学数据用于机器学习方法的能力受到特定医学状况数据特异性的限制。在这种背景下,诸如生成对抗网络(GANs)等最新技术被视为生成高质量合成数据的潜在方法,这些数据能保留某种状况的临床变异性。然而,尽管取得了一些成功,但在描绘前列腺癌等疾病的异质性时,GAN模型的使用仍然很少。我们团队成员之前的研究专注于在BLaStM试验中,使用栖息地风险评分(HRS)图对前列腺癌患者的定量多参数磁共振成像(mpMRI)进行自动化处理。在当前研究中,我们旨在使用BLaStM试验及其他来源的图像来训练GAN模型、生成合成图像并验证其质量。在此背景下,我们使用T2加权前列腺MRI图像作为单自然图像GAN(SinGANs)的训练数据来构建生成模型。一个深度学习语义分割管道对该模型进行训练,以在二维MRI切片上分割前列腺边界。具有前列腺高级分割边界的合成图像经过筛选,并由具有不同经验程度(超过十年、一年或无经验)的参与研究的科学家用于质量控制评估,这些科学家均有处理MRI图像的经验。结果显示,经验最丰富的参与组正确识别传统图像与合成图像的准确率为67%,有一年经验的组正确识别图像的准确率为58%,而无经验的组准确率达到50%。近一半(47%)的合成图像被错误地评估为传统图像。有趣的是,在一项盲法质量评估中,一位获得委员会认证的放射科医生在合成图像和传统图像的平均质量方面,并未显著区分传统图像和合成图像。此外,为了验证从前列腺癌MRI生成的合成图像是否可用,我们将这些合成图像与原始图像一起进行异常检测。重要的是,在第一阶段,经质量控制批准的合成数据的异常检测成功率与传统图像相当。总之,这项研究表明使用GANs生成高质量MRI合成图像具有前景。这样的人工智能模型可能会对涉及监督式机器学习方法的各种临床应用做出重大贡献。