Respiratory Department, Zhoupu Hospital, Shanghai University of Medicine and Health Sciences, Shanghai, China.
College of Medical Instrumentation and Collaborative Innovation Canter, Shanghai University of Medicine and Health Sciences, Shanghai, China.
J Imaging Inform Med. 2024 Oct;37(5):2135-2148. doi: 10.1007/s10278-024-00967-5. Epub 2024 Apr 2.
This study aimed to develop an interpretable diagnostic model for subtyping of pulmonary adenocarcinoma, including minimally invasive adenocarcinoma (MIA), adenocarcinoma in situ (AIS), and invasive adenocarcinoma (IAC), by integrating 3D-radiomic features and clinical data. Data from multiple hospitals were collected, and 10 key features were selected from 1600 3D radiomic signatures and 11 radiological features. Diverse decision rules were extracted using ensemble learning methods (gradient boosting, random forest, and AdaBoost), fused, ranked, and selected via RuleFit and SHAP to construct a rule-based diagnostic model. The model's performance was evaluated using AUC, precision, accuracy, recall, and F1-score and compared with other models. The rule-based diagnostic model exhibited excellent performance in the training, testing, and validation cohorts, with AUC values of 0.9621, 0.9529, and 0.8953, respectively. This model outperformed counterparts relying solely on selected features and previous research models. Specifically, the AUC values for the previous research models in the three cohorts were 0.851, 0.893, and 0.836. It is noteworthy that individual models employing GBDT, random forest, and AdaBoost demonstrated AUC values of 0.9391, 0.8681, and 0.9449 in the training cohort, 0.9093, 0.8722, and 0.9363 in the testing cohort, and 0.8440, 0.8640, and 0.8750 in the validation cohort, respectively. These results highlight the superiority of the rule-based diagnostic model in the assessment of lung adenocarcinoma subtypes, while also providing insights into the performance of individual models. Integrating diverse decision rules enhanced the accuracy and interpretability of the diagnostic model for lung adenocarcinoma subtypes. This approach bridges the gap between complex predictive models and clinical utility, offering valuable support to healthcare professionals and patients.
本研究旨在通过整合 3D 放射组学特征和临床数据,开发一种用于肺腺癌亚型分类的可解释诊断模型,包括微浸润性腺癌(MIA)、原位腺癌(AIS)和浸润性腺癌(IAC)。该研究从多家医院收集数据,从 1600 个 3D 放射组学特征和 11 个影像学特征中选择了 10 个关键特征。使用集成学习方法(梯度提升、随机森林和 AdaBoost)提取不同的决策规则,通过 RuleFit 和 SHAP 融合、排序和选择,构建基于规则的诊断模型。使用 AUC、精度、准确性、召回率和 F1 评分评估模型性能,并与其他模型进行比较。基于规则的诊断模型在训练、测试和验证队列中表现出优异的性能,AUC 值分别为 0.9621、0.9529 和 0.8953。该模型优于仅依赖于选定特征和先前研究模型的模型。具体来说,三个队列中先前研究模型的 AUC 值分别为 0.851、0.893 和 0.836。值得注意的是,在训练队列中,单个模型使用 GBDT、随机森林和 AdaBoost 分别表现出 0.9391、0.8681 和 0.9449 的 AUC 值,在测试队列中,分别表现出 0.9093、0.8722 和 0.9363 的 AUC 值,在验证队列中,分别表现出 0.8440、0.8640 和 0.8750 的 AUC 值。这些结果突出了基于规则的诊断模型在评估肺腺癌亚型方面的优越性,同时也提供了对单个模型性能的洞察。整合不同的决策规则提高了肺腺癌亚型诊断模型的准确性和可解释性。这种方法弥合了复杂预测模型和临床实用性之间的差距,为医疗保健专业人员和患者提供了有价值的支持。