Wang Yida, Gao Ankang, Yang Hongxi, Bai Jie, Zhao Guohua, Zhang Huiting, Song Yang, Wang Chenglong, Zhang Yong, Cheng Jingliang, Yang Guang
Shanghai Key Laboratory of Magnetic Resonance, School of Physics and Electronic Science, East China Normal University, Shanghai, China.
Institute of Magnetic Resonance and Molecular Imaging in Medicine, East China Normal University, Shanghai, China.
Sci Rep. 2025 Jan 28;15(1):3591. doi: 10.1038/s41598-025-87778-y.
Prediction of isocitrate dehydrogenase (IDH) mutation status and epilepsy occurrence are important to glioma patients. Although machine learning models have been constructed for both issues, the correlation between them has not been explored. Our study aimed to exploit this correlation to improve the performance of both of the IDH mutation status identification and epilepsy diagnosis models in patients with glioma II-IV. 399 patients were retrospectively enrolled and divided into a training (n = 279) and an independent test (n = 120) cohort. Multi-center dataset (n = 228) from The Cancer Imaging Archive (TCIA) was used for external test for identification of IDH mutation status. Region of interests comprising the entire tumor and peritumoral edema were automatically segmented using a pre-trained deep learning model. Radiomic features were extracted from T1-weighted, T2-weighted, post-Gadolinium T1 weighted, and T2 fluid-attenuated inversion recovery images. We proposed an iterative approach derived from LASSO to select features shared by two tasks and features specific to each task, before using them to construct the final models. Receiver operating characteristic (ROC) analysis was employed to evaluate the model. The IDH mutation identification model achieved area under the ROC curve (AUC) values of 0.948, 0.946 and 0.860 on the training, internal test, and external test cohorts, respectively. The epilepsy diagnosis model achieved AUCs of 0.924 and 0.880 on the training and internal test cohorts, respectively. The proposed models can identify IDH status and epilepsy with fewer features, thus having better interpretability and lower risk of overfitting. This not only improves its chance of application in clinical settings, but also provides a new scheme to study multiple correlated clinical tasks.
预测异柠檬酸脱氢酶(IDH)突变状态和癫痫发作对胶质瘤患者很重要。尽管已经针对这两个问题构建了机器学习模型,但它们之间的相关性尚未得到探索。我们的研究旨在利用这种相关性来提高II-IV级胶质瘤患者IDH突变状态识别模型和癫痫诊断模型的性能。回顾性纳入399例患者,并将其分为训练队列(n = 279)和独立测试队列(n = 120)。来自癌症影像存档(TCIA)的多中心数据集(n = 228)用于IDH突变状态识别的外部测试。使用预训练的深度学习模型自动分割包含整个肿瘤和瘤周水肿的感兴趣区域。从T1加权、T2加权、钆增强T1加权和T2液体衰减反转恢复图像中提取放射组学特征。我们提出了一种源自套索回归的迭代方法,用于选择两个任务共有的特征以及每个任务特有的特征,然后使用它们构建最终模型。采用受试者操作特征(ROC)分析来评估模型。IDH突变识别模型在训练队列、内部测试队列和外部测试队列上的ROC曲线下面积(AUC)值分别为0.948、0.946和0.860。癫痫诊断模型在训练队列和内部测试队列上的AUC分别为0.924和0.880。所提出的模型可以用更少的特征识别IDH状态和癫痫,因此具有更好的可解释性和更低的过拟合风险。这不仅提高了其在临床环境中应用的机会,还为研究多个相关临床任务提供了一种新方案。