OWL：一种基于英国生物银行、PLCO 和 NLST 人群的肺癌筛查的优化和独立验证的机器学习预测模型。

OWL: an optimized and independently validated machine learning prediction model for lung cancer screening based on the UK Biobank, PLCO, and NLST populations.

机构信息

Department of Biostatistics, Center for Global Health, School of Public Health, Nanjing Medical University, Nanjing, 211166, China.

Department of Environmental Health, Harvard T.H. Chan School of Public Health, Boston, MA, 02115, USA; Pulmonary and Critical Care Division, Department of Medicine, Massachusetts General Hospital and Harvard Medical School, Boston, MA, 02114, USA.

出版信息

EBioMedicine. 2023 Feb;88:104443. doi: 10.1016/j.ebiom.2023.104443. Epub 2023 Jan 24.

DOI:10.1016/j.ebiom.2023.104443

PMID:36701900

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9881220/

Abstract

BACKGROUND

A reliable risk prediction model is critically important for identifying individuals with high risk of developing lung cancer as candidates for low-dose chest computed tomography (LDCT) screening. Leveraging a cutting-edge machine learning technique that accommodates a wide list of questionnaire-based predictors, we sought to optimize and validate a lung cancer prediction model.

METHODS

We developed an Optimized early Warning model for Lung cancer risk (OWL) using the XGBoost algorithm with 323,344 participants from the England area in UK Biobank (training set), and independently validated it with 93,227 participants from UKB Scotland and Wales area (validation set 1), as well as 70,605 and 66,231 participants in the Prostate, Lung, Colorectal, and Ovarian cancer screening trial (PLCO) control and intervention subpopulations, respectively (validation sets 2 & 3) and 23,138 and 18,669 participants in the United States National Lung Screening Trial (NLST) control and intervention subpopulations, respectively (validation sets 4 & 5). By comparing with three competitive prediction models, i.e., PLCO modified 2012 (PLCO), PLCO modified 2014 (PLCO), and the Liverpool Lung cancer Project risk model version 3 (LLPv3), we assessed the discrimination of OWL by the area under receiver operating characteristic curve (AUC) at the designed time point. We further evaluated the calibration using relative improvement in the ratio of expected to observed lung cancer cases (RI), and illustrated the clinical utility by the decision curve analysis.

FINDINGS

For general population, with validation set 1, OWL (AUC = 0.855, 95% CI: 0.829-0.880) presented a better discriminative capability than PLCO (AUC = 0.821, 95% CI: 0.794-0.848) (p < 0.001); with validation sets 2 & 3, AUC of OWL was comparable to PLCO (AUC-AUC < 1%). For ever-smokers, OWL outperformed PLCO and PLCO among ever-smokers in validation set 1 (AUC = 0.842, 95% CI: 0.814-0.871; AUC = 0.792, 95% CI: 0.760-0.823; AUC = 0.791, 95% CI: 0.760-0.822, all p < 0.001). OWL remained comparable to PLCO and PLCO in discrimination (AUC difference from -0.014 to 0.008) among the ever-smokers in validation sets 2 to 5. In all the validation sets, OWL outperformed LLPv3 among the general population and the ever-smokers. Of note, OWL showed significantly better calibration than PLCO, PLCO (RI from 43.1% to 92.3%, all p < 0.001), and LLPv3 (RI from 41.4% to 98.7%, all p < 0.001) in most cases. For clinical utility, OWL exhibited significant improvement in average net benefits (NB) over PLCO in validation set 1 (NB improvement: 32, p < 0.001); among ever smokers of validation set 1, OWL (average NB = 289) retained significant improvement over PLCO (average NB = 213) (p < 0.001). OWL had equivalent NBs with PLCO and PLCO in PLCO and NLST populations, while outperforming LLPv3 in the three populations.

INTERPRETATION

OWL, with a high degree of predictive accuracy and robustness, is a general framework with scientific justifications and clinical utility that can aid in screening individuals with high risks of lung cancer.

FUNDING

National Natural Science Foundation of China, the US NIH.

摘要

背景

对于识别出患有肺癌风险较高的个体，并将其作为低剂量胸部计算机断层扫描（LDCT）筛查的候选者，一个可靠的风险预测模型至关重要。通过利用一种先进的机器学习技术，该技术可以容纳基于问卷的广泛预测因子列表，我们旨在优化和验证肺癌预测模型。

方法

我们使用 XGBoost 算法在 UK Biobank 的英格兰地区（训练集）的 323344 名参与者中开发了一种名为 Optimized early Warning model for Lung cancer risk（OWL）的模型，并在 UKB 苏格兰和威尔士地区（验证集 1）的 93227 名参与者、前列腺癌、肺癌、结直肠癌和卵巢癌筛查试验（PLCO）对照组和干预组（验证集 2 和 3）中的 70605 名和 66231 名参与者以及美国国家肺癌筛查试验（NLST）对照组和干预组（验证集 4 和 5）中的 23138 名和 18669 名参与者中进行了独立验证。通过与三个有竞争力的预测模型，即 PLCO modified 2012（PLCO）、PLCO modified 2014（PLCO）和 Liverpool Lung cancer Project risk model version 3（LLPv3）进行比较，我们评估了 OWL 在预定时间点的接收者操作特征曲线（ROC）下面积（AUC）的区分能力。我们还使用预期与观察到的肺癌病例的比值的相对改善（RI）来评估校准情况，并通过决策曲线分析说明了临床实用性。

结果

对于一般人群，在验证集 1 中，OWL（AUC=0.855，95%CI：0.829-0.880）在区分能力方面优于 PLCO（AUC=0.821，95%CI：0.794-0.848）（p<0.001）；在验证集 2 和 3 中，OWL 的 AUC 与 PLCO 相当（AUC-AUC<1%）。对于一直吸烟者，OWL 在验证集 1 中优于 PLCO 和 PLCO（ever-smokers）（AUC=0.842，95%CI：0.814-0.871；AUC=0.792，95%CI：0.760-0.823；AUC=0.791，95%CI：0.760-0.822，均 p<0.001）。在验证集 2 到 5 中，OWL 在区分能力方面与 PLCO 和 PLCO 相当（AUC 差值在 0.014 到 0.008 之间）。在所有验证集中，OWL 在一般人群和一直吸烟者中的表现均优于 LLPv3。值得注意的是，OWL 在大多数情况下都显示出比 PLCO 和 PLCO 更好的校准（RI 从 43.1%到 92.3%，均 p<0.001）和 LLPv3（RI 从 41.4%到 98.7%，均 p<0.001）。在临床实用性方面，OWL 在验证集 1 中表现出比 PLCO 显著更高的平均净收益（NB）（NB 提高：32，p<0.001）；在验证集 1 中的一直吸烟者中，OWL（平均 NB=289）与 PLCO（平均 NB=213）相比保持了显著的提高（p<0.001）。OWL 在 PLCO 和 NLST 人群中与 PLCO 和 PLCO 具有相当的 NB，而在这三个人群中均优于 LLPv3。

解释

OWL 具有高度的预测准确性和稳健性，是一种具有科学依据和临床实用性的通用框架，可以帮助筛选出患有肺癌风险较高的个体。

资金

国家自然科学基金、美国 NIH。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/915e/9881220/61ba78a9aea1/gr1.jpg

相似文献

OWL: an optimized and independently validated machine learning prediction model for lung cancer screening based on the UK Biobank, PLCO, and NLST populations.OWL：一种基于英国生物银行、PLCO 和 NLST 人群的肺癌筛查的优化和独立验证的机器学习预测模型。

EBioMedicine. 2023 Feb;88:104443. doi: 10.1016/j.ebiom.2023.104443. Epub 2023 Jan 24.

Deep Learning Using Chest Radiographs to Identify High-Risk Smokers for Lung Cancer Screening Computed Tomography: Development and Validation of a Prediction Model.利用胸部X光片进行深度学习以识别肺癌筛查计算机断层扫描的高危吸烟者：预测模型的开发与验证

Ann Intern Med. 2020 Nov 3;173(9):704-713. doi: 10.7326/M20-1868. Epub 2020 Sep 1.

Assessing eligibility for lung cancer screening using parsimonious ensemble machine learning models: A development and validation study.采用简约集成机器学习模型评估肺癌筛查的资格：一项开发和验证研究。

PLoS Med. 2023 Oct 3;20(10):e1004287. doi: 10.1371/journal.pmed.1004287. eCollection 2023 Oct.

Risk prediction models for selection of lung cancer screening candidates: A retrospective validation study.用于选择肺癌筛查候选者的风险预测模型：一项回顾性验证研究。

PLoS Med. 2017 Apr 4;14(4):e1002277. doi: 10.1371/journal.pmed.1002277. eCollection 2017 Apr.

Predicting the future risk of lung cancer: development, and internal and external validation of the CanPredict (lung) model in 19·67 million people and evaluation of model performance against seven other risk prediction models.预测肺癌未来风险：CanPredict（肺部）模型在 1967 万人中的开发、内部和外部验证以及该模型与其他七个风险预测模型的性能评估。

Lancet Respir Med. 2023 Aug;11(8):685-697. doi: 10.1016/S2213-2600(23)00050-4. Epub 2023 Apr 5.

Selection criteria for lung-cancer screening.肺癌筛查的选择标准。

N Engl J Med. 2013 Feb 21;368(8):728-36. doi: 10.1056/NEJMoa1211776.

Evaluation of the lung cancer risks at which to screen ever- and never-smokers: screening rules applied to the PLCO and NLST cohorts.评估曾经吸烟和从不吸烟人群的肺癌筛查风险：应用于PLCO和NLST队列的筛查规则

PLoS Med. 2014 Dec 2;11(12):e1001764. doi: 10.1371/journal.pmed.1001764. eCollection 2014 Dec.

Implications of Nine Risk Prediction Models for Selecting Ever-Smokers for Computed Tomography Lung Cancer Screening.九种风险预测模型对选择持续吸烟者进行计算机断层扫描肺癌筛查的影响。

Ann Intern Med. 2018 Jul 3;169(1):10-19. doi: 10.7326/M17-2701. Epub 2018 May 15.

Identifying high risk individuals for targeted lung cancer screening: Independent validation of the PLCO risk prediction tool.确定高危人群进行有针对性的肺癌筛查：PLCO 风险预测工具的独立验证。

Int J Cancer. 2017 Jul 15;141(2):242-253. doi: 10.1002/ijc.30673. Epub 2017 Apr 21.

Evaluation of risk prediction models to select lung cancer screening participants in Europe: a prospective cohort consortium analysis.评估风险预测模型以选择欧洲的肺癌筛查参与者：一项前瞻性队列联盟分析。

Lancet Digit Health. 2024 Sep;6(9):e614-e624. doi: 10.1016/S2589-7500(24)00123-7.

引用本文的文献

External Validation of Lung Cancer Prediction Models Combining Epidemiological Predictors in Chinese Ever and Never Smokers: Guangzhou Biobank Cohort Study.结合中国曾经吸烟者和从不吸烟者的流行病学预测因素的肺癌预测模型的外部验证：广州生物样本库队列研究

Cancer Med. 2025 Aug;14(15):e71104. doi: 10.1002/cam4.71104.

Assessment and recalibration of seventeen lung cancer risk prediction models in approximately one million Chinese population utilising healthcare big data: a retrospective cohort analysis.利用医疗大数据对约100万中国人群中的17种肺癌风险预测模型进行评估与重新校准：一项回顾性队列分析

Lancet Reg Health West Pac. 2025 May 16;58:101575. doi: 10.1016/j.lanwpc.2025.101575. eCollection 2025 May.

Improving Lung Cancer Risk Prediction Using Machine Learning: A Comparative Analysis of Stacking Models and Traditional Approaches.使用机器学习改善肺癌风险预测：堆叠模型与传统方法的比较分析

Cancers (Basel). 2025 May 13;17(10):1651. doi: 10.3390/cancers17101651.

A systematic review and meta-analysis of lung cancer risk prediction models.肺癌风险预测模型的系统评价与荟萃分析

Acta Oncol. 2025 May 12;64:661-671. doi: 10.2340/1651-226X.2025.42529.

Predictive performance of risk prediction models for lung cancer incidence in Western and Asian countries: a systematic review and meta-analysis.西方国家和亚洲国家肺癌发病风险预测模型的预测性能：一项系统评价和荟萃分析。

Sci Rep. 2025 Mar 4;15(1):4259. doi: 10.1038/s41598-024-83875-6.

Head-to-head comparisons of risk discrimination by questionnaire-based lung cancer risk prediction models: a systematic review and meta-analysis.基于问卷的肺癌风险预测模型风险辨别能力的直接比较：一项系统评价和荟萃分析。

EClinicalMedicine. 2025 Jan 30;80:103075. doi: 10.1016/j.eclinm.2025.103075. eCollection 2025 Feb.

Gut Microbiota as Mediator and Moderator Between Hepatitis B Virus and Hepatocellular Carcinoma: A Prospective Study.肠道微生物群作为乙型肝炎病毒与肝细胞癌之间的介导者和调节者：一项前瞻性研究

Cancer Med. 2024 Dec;13(24):e70454. doi: 10.1002/cam4.70454.

Interpretable machine learning model for digital lung cancer prescreening in Chinese populations with missing data.用于中国人群中存在缺失数据的数字肺癌预筛查的可解释机器学习模型。

NPJ Digit Med. 2024 Nov 19;7(1):327. doi: 10.1038/s41746-024-01309-z.

Lancet Digit Health. 2024 Sep;6(9):e614-e624. doi: 10.1016/S2589-7500(24)00123-7.

Radiomics based on multiple machine learning methods for diagnosing early bone metastases not visible on CT images.基于多种机器学习方法的放射组学用于诊断CT图像上不可见的早期骨转移。

Skeletal Radiol. 2025 Feb;54(2):335-343. doi: 10.1007/s00256-024-04752-x. Epub 2024 Jul 19.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

OWL：一种基于英国生物银行、PLCO 和 NLST 人群的肺癌筛查的优化和独立验证的机器学习预测模型。

OWL: an optimized and independently validated machine learning prediction model for lung cancer screening based on the UK Biobank, PLCO, and NLST populations.

机构信息

出版信息

BACKGROUND

METHODS

FINDINGS

INTERPRETATION

FUNDING

背景

方法

结果

解释

资金

相似文献

引用本文的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献