Department of Thoracic Surgery, The Affiliated Hospital of Qingdao University, Qingdao, China.
Health management center, The Affiliated Hospital of Qingdao University, Qingdao, China.
Thorac Cancer. 2021 Dec;12(23):3130-3140. doi: 10.1111/1759-7714.14140. Epub 2021 Oct 28.
To develop and validate a risk prediction nomogram based on a deep learning convolutional neural networks (CNN) model and epidemiological characteristics for lung cancer screening in patients with small pulmonary nodules (SPN).
This study included three data sets. First, a CNN model was developed and tested on data set 1. Then, a hybrid prediction model was developed on data set 2 by multivariable binary logistic regression analysis. We combined the CNN model score and the selected epidemiological risk factors, and a risk prediction nomogram was presented. An independent multicenter cohort was used for model external validation. The performance of the nomogram was assessed with respect to its calibration and discrimination.
The final hybrid model included the CNN model score and the screened risk factors included age, gender, smoking status and family history of cancer. The nomogram showed good discrimination and calibration with an area under the curve (AUC) of 91.6% (95% CI: 89.4%-93.5%), compare with the CNN model, the improvement was significance. The performance of the nomogram still showed good discrimination and good calibration in the multicenter validation cohort, with an AUC of 88.3% (95% CI: 83.1%-92.3%).
Our study showed that epidemiological characteristics should be considered in lung cancer screening, which can significantly improve the efficiency of the artificial intelligence (AI) model alone. We combined the CNN model score with Asian lung cancer epidemiological characteristics to develop a new nomogram to facilitate and accurately perform individualized lung cancer screening, especially for Asians.
为了开发和验证一种基于深度学习卷积神经网络(CNN)模型和流行病学特征的肺癌筛查小肺结节(SPN)风险预测列线图。
本研究包括三个数据集。首先,在数据集 1 上开发和测试了 CNN 模型。然后,通过多变量二分类逻辑回归分析在数据集 2 上开发了混合预测模型。我们结合了 CNN 模型评分和选定的流行病学危险因素,并提出了风险预测列线图。一个独立的多中心队列用于模型外部验证。通过校准和区分来评估列线图的性能。
最终的混合模型包括 CNN 模型评分和筛查的危险因素,包括年龄、性别、吸烟状况和癌症家族史。该列线图具有良好的判别力和校准度,曲线下面积(AUC)为 91.6%(95%CI:89.4%-93.5%),与 CNN 模型相比,差异有统计学意义。该列线图在多中心验证队列中仍具有良好的判别力和良好的校准度,AUC 为 88.3%(95%CI:83.1%-92.3%)。
本研究表明,在肺癌筛查中应考虑流行病学特征,这可以显著提高人工智能(AI)模型的效率。我们结合 CNN 模型评分和亚洲肺癌流行病学特征,开发了一种新的列线图,以方便和准确地进行个体化肺癌筛查,特别是针对亚洲人。