Liang Zhu, Li Jiamin, He Shuyan, Li Siyuan, Cai Runzhi, Chen Chunyuan, Zhang Yan, Deng Biao, Wu Yanxia
Department of Thoracic Surgery, Affiliated Hospital of Guangdong Medical University, Zhanjiang, Guangdong, China.
The First Clinical Medical College, Guangdong Medical University, Zhanjiang, Guangdong, China.
Med Phys. 2025 Jul;52(7):e17892. doi: 10.1002/mp.17892. Epub 2025 May 19.
Thymomas, though rare, present a wide range of clinical behaviors, from indolent to aggressive forms, making accurate risk stratification crucial for treatment planning. Traditional methods such as histopathology and radiological assessments often lack the ability to capture tumor heterogeneity, which can impact prognosis. Radiomics, combined with machine learning, provides a method to extract and analyze quantitative imaging features, offering the potential to improve tumor classification and risk prediction. By segmenting tumors into distinct habitat zones, it becomes possible to assess intratumoral heterogeneity more effectively. This study employs radiomics and machine learning techniques to enhance thymoma risk prediction, aiming to improve diagnostic consistency and reduce variability in radiologists' assessments.
This study aims to identify different habitat zones within thymomas through CT imaging feature analysis and to establish a predictive model to differentiate between high and low-risk thymomas. Additionally, the study explores how this model can assist radiologists.
We obtained CT imaging data from 133 patients with thymoma who were treated at the Affiliated Hospital of Guangdong Medical University from 2015 to 2023. Images from the plain scan phase, venous phase, arterial phase, and their differential images (subtracted images) were used. Tumor regions were segmented into three habitat zones using K-Means clustering. Imaging features from each habitat zone were extracted using the PyRadiomics (van Griethuysen, 2017) library. The 28 most distinguishing features were selected through Mann-Whitney U tests (Mann, 1947) and Spearman's correlation analysis (Spearman, 1904). Five predictive models were built using the same machine learning algorithm (Support Vector Machine [SVM]): Habitat1, Habitat2, Habitat3 (trained on features from individual tumor habitat regions), Habitat All (trained on combined features from all regions), and Intra (trained on intratumoral features), and their performances were evaluated for comparison. The models' diagnostic outcomes were compared with the diagnoses of four radiologists (two junior and two experienced physicians).
The AUC (area under curve) for habitat zone 1 was 0.818, for habitat zone 2 was 0.732, and for habitat zone 3 was 0.763. The comprehensive model, which combined data from all habitat zones, achieved an AUC of 0.960, outperforming the model based on traditional radiomic features (AUC of 0.720). The model significantly improved the diagnostic accuracy of all four radiologists. The AUCs for junior radiologists 1 and 2 increased from 0.747 and 0.775 to 0.932 and 0.972, respectively, while for experienced radiologists 1 and 2, the AUCs increased from 0.932 and 0.859 to 0.977 and 0.972, respectively.
This study successfully identified distinct habitat zones within thymomas through CT imaging feature analysis and developed an efficient predictive model that significantly improved diagnostic accuracy. This model offers a novel tool for risk assessment of thymomas and can aid in guiding clinical decision-making.
胸腺瘤虽罕见,但呈现出广泛的临床行为,从惰性到侵袭性形式都有,这使得准确的风险分层对于治疗规划至关重要。传统方法如组织病理学和放射学评估往往缺乏捕捉肿瘤异质性的能力,而肿瘤异质性会影响预后。放射组学与机器学习相结合,提供了一种提取和分析定量影像特征的方法,有望改善肿瘤分类和风险预测。通过将肿瘤分割成不同的生境区域,可以更有效地评估肿瘤内异质性。本研究采用放射组学和机器学习技术来增强胸腺瘤风险预测,旨在提高诊断一致性并减少放射科医生评估中的变异性。
本研究旨在通过CT影像特征分析识别胸腺瘤内不同的生境区域,并建立一个预测模型以区分高风险和低风险胸腺瘤。此外,该研究还探索了此模型如何协助放射科医生。
我们获取了2015年至2023年在广东医科大学附属医院接受治疗的133例胸腺瘤患者的CT影像数据。使用平扫期、静脉期、动脉期图像及其差值图像(相减图像)。利用K均值聚类将肿瘤区域分割为三个生境区域。使用PyRadiomics(van Griethuysen,2017)库提取每个生境区域的影像特征。通过曼-惠特尼U检验(Mann,1947)和斯皮尔曼相关性分析(Spearman,1904)选择28个最具区分性的特征。使用相同的机器学习算法(支持向量机[SVM])构建五个预测模型:生境1、生境2、生境(3在单个肿瘤生境区域的特征上进行训练)、全生境(在所有区域的组合特征上进行训练)和瘤内(在肿瘤内特征上进行训练),并对它们的性能进行评估以作比较。将模型的诊断结果与四位放射科医生(两位初级医生和两位经验丰富的医生)的诊断进行比较。
生境区域1的曲线下面积(AUC)为0.818,生境区域2为0.732,生境区域3为0.763。结合所有生境区域数据的综合模型的AUC为0.960,优于基于传统放射组学特征的模型(AUC为0.720)。该模型显著提高了所有四位放射科医生的诊断准确性。初级放射科医生1和2的AUC分别从0.747和0.775提高到0.932和0.972,而经验丰富的放射科医生1和2的AUC分别从0.932和0.859提高到0.977和0.972。
本研究通过CT影像特征分析成功识别了胸腺瘤内不同的生境区域,并开发了一个有效提高诊断准确性的预测模型。该模型为胸腺瘤风险评估提供了一种新工具,有助于指导临床决策。