Zhan Yuxin, Song Feipeng, Zhang Wenjia, Gong Tong, Zhao Shuai, Lv Fajin
School of Science, Chongqing University of Technology, Chongqing, China.
Department of Radiology, The First Affiliated Hospital of Chongqing Medical University, Chongqing, China.
Front Immunol. 2024 Nov 20;15:1446511. doi: 10.3389/fimmu.2024.1446511. eCollection 2024.
The aim of this study was to develop and validate a prediction model for classification of pulmonary nodules based on preoperative CT imaging.
A data set of Centers 1 (training set: 2633; internal testing set: 1129); Center 2 and Center 3 (external testing set: 218) of patients with pulmonary nodule cases was retrospectively collected. Handcrafted features were extracted from noncontrast chest CT scans by three senior radiologists. A total of 22 clinically handcrafted parameters (age, gender, L-RADS, and PNI-GARS et al.) were used to construct machine learning models (random forest, gradient boosting, and explainable boosting) for the classification of preoperative pulmonary nodules, and the parameters of the model were adjusted to achieve optimal performance. To evaluate the prediction capacity of each model. Both 5-fold cross-validation and 10-fold cross-validation were used to test the robustness of the models.
The explainable boosting model had the best performance on our constructed data. The model achieves an accuracy of 89.9%, a precision of 97.48%, a specificity of 89.5%, a sensitivity of 91.1%, and an AUC of 90.3%. In human-machine comparison, the AUC of machine learning models (90.4%, 95% CI: 85.5%-94.8%) was significantly improved compared to radiologists (60%, 95% CI: 50%-71.4%).
The explainable boosting model exhibited superior performance on our dataset, achieving high accuracy and precision in the diagnosis of pulmonary nodules compared to experienced radiologists.
本研究旨在开发并验证一种基于术前CT影像的肺结节分类预测模型。
回顾性收集了中心1(训练集:2633例;内部测试集:1129例)、中心2和中心3(外部测试集:218例)的肺结节患者数据集。由三位资深放射科医生从非增强胸部CT扫描中提取手工特征。共使用22个临床手工参数(年龄、性别、L-RADS和PNI-GARS等)构建用于术前肺结节分类的机器学习模型(随机森林、梯度提升和可解释提升),并调整模型参数以实现最佳性能。为评估每个模型的预测能力,采用5折交叉验证和10折交叉验证来测试模型的稳健性。
可解释提升模型在我们构建的数据上表现最佳。该模型的准确率为89.9%,精确率为97.48%,特异性为89.5%,灵敏度为91.1%,AUC为90.3%。在人机比较中,机器学习模型的AUC(90.4%,95%CI:85.5%-94.8%)相比放射科医生(60%,95%CI:50%-71.4%)有显著提高。
可解释提升模型在我们的数据集中表现出卓越性能,与经验丰富的放射科医生相比,在肺结节诊断中实现了高精度和高精确率。