基于常规临床和实验室数据的肺癌早期识别的机器学习。

Machine Learning for Early Lung Cancer Identification Using Routine Clinical and Laboratory Data.

机构信息

Department of Health Systems Science, Kaiser Permanente Bernard J. Tyson School of Medicine, Pasadena, California.

Department of Research and Evaluation, Kaiser Permanente Southern California, Pasadena, California.

出版信息

Am J Respir Crit Care Med. 2021 Aug 15;204(4):445-453. doi: 10.1164/rccm.202007-2791OC.

DOI:10.1164/rccm.202007-2791OC

PMID:33823116

Abstract

Most lung cancers are diagnosed at an advanced stage. Presymptomatic identification of high-risk individuals can prompt earlier intervention and improve long-term outcomes. To develop a model to predict a future diagnosis of lung cancer on the basis of routine clinical and laboratory data by using machine learning. We assembled data from 6,505 case patients with non-small cell lung cancer (NSCLC) and 189,597 contemporaneous control subjects and compared the accuracy of a novel machine learning model with a modified version of the well-validated 2012 Prostate, Lung, Colorectal and Ovarian Cancer Screening Trial risk model (mPLCOm2012), by using the area under the receiver operating characteristic curve (AUC), sensitivity, and diagnostic odds ratio (OR) as measures of model performance. Among ever-smokers in the test set, a machine learning model was more accurate than the mPLCOm2012 for identifying NSCLC 9-12 months before clinical diagnosis ( < 0.00001) and demonstrated an AUC of 0.86, a diagnostic OR of 12.3, and a sensitivity of 40.1% at a predefined specificity of 95%. In comparison, the mPLCOm2012 demonstrated an AUC of 0.79, an OR of 7.4, and a sensitivity of 27.9% at the same specificity. The machine learning model was more accurate than standard eligibility criteria for lung cancer screening and more accurate than the mPLCOm2012 when applied to a screening-eligible population. Influential model variables included known risk factors and novel predictors such as white blood cell and platelet counts. A machine learning model was more accurate for early diagnosis of NSCLC than either standard eligibility criteria for screening or the mPLCOm2012, demonstrating the potential to help prevent lung cancer deaths through early detection.

摘要

大多数肺癌在晚期才被诊断出来。对高危人群进行无症状识别可以促使更早的干预，并改善长期预后。我们使用机器学习方法，基于常规临床和实验室数据来建立预测未来肺癌诊断的模型。我们汇集了 6505 例非小细胞肺癌（NSCLC）病例患者和 189597 例同期对照患者的数据，通过接受者操作特征曲线（ROC）下面积（AUC）、敏感性和诊断比值比（OR）来比较新型机器学习模型和经过改良的、经过充分验证的 2012 年前列腺癌、肺癌、结直肠癌和卵巢癌筛查试验风险模型（mPLCOm2012）的准确性，以评估模型性能。在测试集中的既往吸烟者中，与 mPLCOm2012 相比，机器学习模型在临床诊断前 9-12 个月识别 NSCLC 的准确性更高（ < 0.00001），其 AUC 为 0.86，诊断 OR 为 12.3，特异性为 95%时的敏感性为 40.1%。相比之下，mPLCOm2012 的 AUC 为 0.79，OR 为 7.4，特异性为 95%时的敏感性为 27.9%。与标准的肺癌筛查入选标准相比，该机器学习模型更准确，与应用于筛查合格人群的 mPLCOm2012 相比，该模型更准确。有影响力的模型变量包括已知的危险因素和新的预测指标，如白细胞和血小板计数。与标准的筛查入选标准或 mPLCOm2012 相比，机器学习模型更能准确地预测早期 NSCLC，这表明通过早期发现有可能预防肺癌死亡。

相似文献

Machine Learning for Early Lung Cancer Identification Using Routine Clinical and Laboratory Data.基于常规临床和实验室数据的肺癌早期识别的机器学习。

Am J Respir Crit Care Med. 2021 Aug 15;204(4):445-453. doi: 10.1164/rccm.202007-2791OC.

Assessing eligibility for lung cancer screening using parsimonious ensemble machine learning models: A development and validation study.采用简约集成机器学习模型评估肺癌筛查的资格：一项开发和验证研究。

PLoS Med. 2023 Oct 3;20(10):e1004287. doi: 10.1371/journal.pmed.1004287. eCollection 2023 Oct.

Diagnostic Value of Serum miR-182, miR-183, miR-210, and miR-126 Levels in Patients with Early-Stage Non-Small Cell Lung Cancer.血清miR-182、miR-183、miR-210和miR-126水平在早期非小细胞肺癌患者中的诊断价值

PLoS One. 2016 Apr 19;11(4):e0153046. doi: 10.1371/journal.pone.0153046. eCollection 2016.

Early Colorectal Cancer Detected by Machine Learning Model Using Gender, Age, and Complete Blood Count Data.利用性别、年龄和全血细胞计数数据的机器学习模型检测早期结直肠癌

Dig Dis Sci. 2017 Oct;62(10):2719-2727. doi: 10.1007/s10620-017-4722-8. Epub 2017 Aug 23.

Risk prediction models for selection of lung cancer screening candidates: A retrospective validation study.用于选择肺癌筛查候选者的风险预测模型：一项回顾性验证研究。

PLoS Med. 2017 Apr 4;14(4):e1002277. doi: 10.1371/journal.pmed.1002277. eCollection 2017 Apr.

Machine learning-based radiomics strategy for prediction of cell proliferation in non-small cell lung cancer.基于机器学习的放射组学策略预测非小细胞肺癌细胞增殖。

Eur J Radiol. 2019 Sep;118:32-37. doi: 10.1016/j.ejrad.2019.06.025. Epub 2019 Jun 28.

Machine Learning for Early Discrimination Between Lung Cancer and Benign Nodules Using Routine Clinical and Laboratory Data.基于常规临床和实验室数据的机器学习早期鉴别肺癌与良性结节。

Ann Surg Oncol. 2024 Nov;31(12):7738-7749. doi: 10.1245/s10434-024-15762-3. Epub 2024 Jul 16.

OWL: an optimized and independently validated machine learning prediction model for lung cancer screening based on the UK Biobank, PLCO, and NLST populations.OWL：一种基于英国生物银行、PLCO 和 NLST 人群的肺癌筛查的优化和独立验证的机器学习预测模型。

EBioMedicine. 2023 Feb;88:104443. doi: 10.1016/j.ebiom.2023.104443. Epub 2023 Jan 24.

Blood test shows high accuracy in detecting stage I non-small cell lung cancer.血液检测显示出在检测 I 期非小细胞肺癌方面的高度准确性。

BMC Cancer. 2020 Feb 21;20(1):137. doi: 10.1186/s12885-020-6625-x.

Serum lipidomic biomarkers for non-small cell lung cancer in nonsmoking female patients.非吸烟女性非小细胞肺癌的血清脂质组学生物标志物。

J Pharm Biomed Anal. 2020 Jun 5;185:113220. doi: 10.1016/j.jpba.2020.113220. Epub 2020 Feb 29.

引用本文的文献

The Impact of Artificial Intelligence on Lung Cancer Diagnosis and Personalized Treatment.人工智能对肺癌诊断及个性化治疗的影响

Int J Mol Sci. 2025 Aug 31;26(17):8472. doi: 10.3390/ijms26178472.

Development and Validation of a Clinlabomics-Based Nomogram for Predicting the Prognosis of Small Cell Lung Cancer in China: A Multicenter, Retrospective Cohort Study.基于临床实验室组学的列线图预测中国小细胞肺癌预后的开发与验证：一项多中心回顾性队列研究

Cancer Med. 2025 Sep;14(17):e71180. doi: 10.1002/cam4.71180.

ESR Essentials: lung cancer screening with low-dose CT-practice recommendations by the European Society of Thoracic Imaging.红细胞沉降率要点：欧洲胸部影像学会关于低剂量CT肺癌筛查的实践建议

Eur Radiol. 2025 Aug 23. doi: 10.1007/s00330-025-11910-9.

Progress and challenges of artificial intelligence in lung cancer clinical translation.人工智能在肺癌临床转化中的进展与挑战

NPJ Precis Oncol. 2025 Jul 1;9(1):210. doi: 10.1038/s41698-025-00986-7.

T cell receptor profiling of blood to detect lung cancer.通过血液进行T细胞受体分析以检测肺癌。

Cancer Immunol Res. 2025 Jul 1. doi: 10.1158/2326-6066.CIR-24-1109.

Clinical Prediction Models Incorporating Blood Test Trend for Cancer Detection: Systematic Review, Meta-Analysis, and Critical Appraisal.纳入血液检测趋势用于癌症检测的临床预测模型：系统评价、荟萃分析和批判性评估

JMIR Cancer. 2025 Jun 27;11:e70275. doi: 10.2196/70275.

Optimizing Strategy for Lung Cancer Screening: From Risk Prediction to Clinical Decision Support.肺癌筛查的优化策略：从风险预测到临床决策支持

JCO Clin Cancer Inform. 2025 May;9:e2400291. doi: 10.1200/CCI-24-00291. Epub 2025 May 7.

AutoCOPD-A novel and practical machine learning model for COPD detection using whole-lung inspiratory quantitative CT measurements: a retrospective, multicenter study.AutoCOPD——一种使用全肺吸气定量CT测量进行慢性阻塞性肺疾病（COPD）检测的新型实用机器学习模型：一项回顾性多中心研究

EClinicalMedicine. 2025 Apr 3;82:103166. doi: 10.1016/j.eclinm.2025.103166. eCollection 2025 Apr.

Advancements in Diagnostics and Therapeutics for Cancer of Unknown Primary in the Era of Precision Medicine.精准医学时代未知原发灶癌症的诊断与治疗进展

MedComm (2020). 2025 Apr 16;6(5):e70161. doi: 10.1002/mco2.70161. eCollection 2025 May.

Deep learning-based identification of patients at increased risk of cancer using routine laboratory markers.利用常规实验室指标，基于深度学习识别患癌风险增加的患者。

Sci Rep. 2025 Apr 12;15(1):12661. doi: 10.1038/s41598-025-97331-6.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

基于常规临床和实验室数据的肺癌早期识别的机器学习。

Machine Learning for Early Lung Cancer Identification Using Routine Clinical and Laboratory Data.

机构信息

出版信息

相似文献

引用本文的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献