基于常规临床和实验室数据的机器学习早期鉴别肺癌与良性结节。

Machine Learning for Early Discrimination Between Lung Cancer and Benign Nodules Using Routine Clinical and Laboratory Data.

机构信息

Department of Laboratory Medicine, Tongji Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, China.

出版信息

Ann Surg Oncol. 2024 Nov;31(12):7738-7749. doi: 10.1245/s10434-024-15762-3. Epub 2024 Jul 16.

DOI:10.1245/s10434-024-15762-3

PMID:39014163

Abstract

BACKGROUND

Lung cancer poses a global health threat necessitating early detection and precise staging for improved patient outcomes. This study focuses on developing and validating a machine learning-based risk model for early lung cancer screening and staging, using routine clinical data.

METHODS

Two medical center, observational, retrospective studies were conducted, involving 2312 lung cancer patients and 653 patients with benign nodules. Machine learning techniques, including differential analysis and feature selection, were employed to identify key factors for modeling. The study focused on variables such as nodule density, carcinoembryonic antigen (CEA), age, and lifestyle habits. The Logistic Regression model was utilized for early diagnoses, and the XGBoost model was utilized for staging based on selected features.

RESULTS

For early diagnoses, the Logistic Regression model achieved an area under the curve (AUC) of 0.716 (95% confidence interval [CI] 0.607-0.826), with 0.703 sensitivity and 0.654 specificity. The XGBoost model excelled in distinguishing late-stage from early-stage lung cancer, exhibiting an AUC of 0.913 (95% CI 0.862-0.963), with 0.909 sensitivity and 0.814 specificity. These findings highlight the model's potential for enhancing diagnostic accuracy and staging in lung cancer.

CONCLUSION

This study introduces a novel machine learning-based risk model for early lung cancer screening and staging, leveraging routine clinical information and laboratory data. The model shows promise in enhancing accuracy, mitigating overdiagnosis, and improving patient outcomes.

摘要

背景

肺癌对全球健康构成威胁，需要早期发现和精确分期，以改善患者预后。本研究旨在利用常规临床数据开发和验证一种基于机器学习的早期肺癌筛查和分期风险模型。

方法

进行了两项医学中心、观察性、回顾性研究，共纳入 2312 例肺癌患者和 653 例良性结节患者。采用差异分析和特征选择等机器学习技术来识别建模的关键因素。研究重点关注了结节密度、癌胚抗原（CEA）、年龄和生活方式习惯等变量。使用 Logistic 回归模型进行早期诊断，使用 XGBoost 模型基于选定特征进行分期。

结果

对于早期诊断，Logistic 回归模型的曲线下面积（AUC）为 0.716（95%置信区间 [CI] 0.607-0.826），灵敏度为 0.703，特异性为 0.654。XGBoost 模型在区分晚期和早期肺癌方面表现出色，AUC 为 0.913（95% CI 0.862-0.963），灵敏度为 0.909，特异性为 0.814。这些发现突显了该模型在提高肺癌诊断准确性和分期方面的潜力。

结论

本研究提出了一种基于机器学习的早期肺癌筛查和分期的新型风险模型，利用常规临床信息和实验室数据。该模型有望提高准确性、减少过度诊断，并改善患者预后。

相似文献

Machine Learning for Early Discrimination Between Lung Cancer and Benign Nodules Using Routine Clinical and Laboratory Data.基于常规临床和实验室数据的机器学习早期鉴别肺癌与良性结节。

Ann Surg Oncol. 2024 Nov;31(12):7738-7749. doi: 10.1245/s10434-024-15762-3. Epub 2024 Jul 16.

Preoperative diagnosis of malignant pulmonary nodules in lung cancer screening with a radiomics nomogram.肺癌筛查中基于放射组学列线图的恶性肺结节术前诊断。

Cancer Commun (Lond). 2020 Jan;40(1):16-24. doi: 10.1002/cac2.12002. Epub 2020 Mar 3.

Combining serum miRNAs, CEA, and CYFRA21-1 with imaging and clinical features to distinguish benign and malignant pulmonary nodules: a pilot study : Xianfeng Li et al.: Combining biomarker, imaging, and clinical features to distinguish pulmonary nodules.联合血清微小RNA、癌胚抗原和细胞角蛋白19片段与影像学及临床特征鉴别肺结节的良恶性：一项初步研究：李先锋等人：联合生物标志物、影像学和临床特征鉴别肺结节

World J Surg Oncol. 2017 May 25;15(1):107. doi: 10.1186/s12957-017-1171-y.

Establishment and validation of multiclassification prediction models for pulmonary nodules based on machine learning.基于机器学习的肺结节多分类预测模型的建立与验证。

Clin Respir J. 2024 May;18(5):e13769. doi: 10.1111/crj.13769.

[The diagnostic value of machine-learning-based model for predicting the malignancy of solid nodules in multiple pulmonary nodules].基于机器学习的模型对预测多发肺结节中实性结节恶性程度的诊断价值

Zhonghua Wai Ke Za Zhi. 2022 Jun 1;60(6):573-579. doi: 10.3760/cma.j.cn112139-20211101-00511.

[Value of Combined Detection of Cytokines and Tumor Markers in the  Differential Diagnosis of Benign and Malignant Solitary Pulmonary Nodules].[细胞因子与肿瘤标志物联合检测在孤立性肺结节良恶性鉴别诊断中的价值]

Zhongguo Fei Ai Za Zhi. 2021 Jun 20;24(6):426-433. doi: 10.3779/j.issn.1009-3419.2021.102.20.

Development and external validation of a multimodal integrated feature neural network (MIFNN) for the diagnosis of malignancy in small pulmonary nodules (≤10 mm).多模态综合特征神经网络（MIFNN）在≤10mm 小肺结节良恶性诊断中的构建与外部验证

Biomed Phys Eng Express. 2024 May 8;10(4). doi: 10.1088/2057-1976/ad449a.

The effectiveness of deep learning model in differentiating benign and malignant pulmonary nodules on spiral CT.深度学习模型在螺旋 CT 上区分肺良恶性结节的效果。

Technol Health Care. 2024;32(6):5129-5140. doi: 10.3233/THC-241079.

Distinguishing patients with stage I lung cancer versus control individuals using serum mass profiling.利用血清质量谱分析区分 I 期肺癌患者与对照个体。

Cancer Invest. 2014 May;32(4):136-43. doi: 10.3109/07357907.2014.883528. Epub 2014 Feb 28.

Evaluation of models for predicting the probability of malignancy in patients with pulmonary nodules.肺结节患者恶性肿瘤概率预测模型评估。

Biosci Rep. 2020 Feb 28;40(2). doi: 10.1042/BSR20193875.

引用本文的文献

Identification and verification of immune and oxidative stress-related diagnostic indicators for malignant lung nodules through WGCNA and machine learning.通过加权基因共表达网络分析（WGCNA）和机器学习识别并验证恶性肺结节的免疫和氧化应激相关诊断指标

Sci Rep. 2025 Jul 1;15(1):22449. doi: 10.1038/s41598-025-04639-4.

A Hybrid 2D Gaussian Filter and Deep Learning Approach with Visualization of Class Activation for Automatic Lung and Colon Cancer Diagnosis.一种结合二维高斯滤波器和深度学习方法并通过类激活可视化实现肺癌和结肠癌自动诊断的技术。

Technol Cancer Res Treat. 2024 Jan-Dec;23:15330338241301297. doi: 10.1177/15330338241301297.

Development and Validation of Machine Learning Algorithms for Prediction of Colorectal Polyps Based on Electronic Health Records.基于电子健康记录的结直肠息肉预测机器学习算法的开发与验证

Biomedicines. 2024 Aug 27;12(9):1955. doi: 10.3390/biomedicines12091955.

本文引用的文献

[Retracted] Role of the AKT pathway in microRNA expression of human U251 glioblastoma cells.[撤回]AKT信号通路在人U251胶质母细胞瘤细胞微小RNA表达中的作用。

Int J Oncol. 2024 Feb;64(2). doi: 10.3892/ijo.2023.5597. Epub 2023 Dec 8.

The global burden of lung cancer: current status and future trends.全球肺癌负担：现状与未来趋势。

Nat Rev Clin Oncol. 2023 Sep;20(9):624-639. doi: 10.1038/s41571-023-00798-3. Epub 2023 Jul 21.

Lung cancer screening.肺癌筛查。

Lancet. 2023 Feb 4;401(10374):390-408. doi: 10.1016/S0140-6736(22)01694-4. Epub 2022 Dec 20.

Machine Learning for Lung Cancer Diagnosis, Treatment, and Prognosis.机器学习在肺癌诊断、治疗和预后中的应用。

Genomics Proteomics Bioinformatics. 2022 Oct;20(5):850-866. doi: 10.1016/j.gpb.2022.11.003. Epub 2022 Dec 1.

Interpretable recurrent neural network models for dynamic prediction of the extubation failure risk in patients with invasive mechanical ventilation in the intensive care unit.用于重症监护病房有创机械通气患者拔管失败风险动态预测的可解释递归神经网络模型

BioData Min. 2022 Sep 27;15(1):21. doi: 10.1186/s13040-022-00309-7.

Application of a data-driven XGBoost model for the prediction of COVID-19 in the USA: a time-series study.基于数据驱动的 XGBoost 模型在预测美国 COVID-19 中的应用：一项时间序列研究。

BMJ Open. 2022 Jul 1;12(7):e056685. doi: 10.1136/bmjopen-2021-056685.

Five tumor-associated autoantibodies expression levels in serum predict lung cancer and associate with poor outcome.血清中五种肿瘤相关自身抗体的表达水平可预测肺癌并与不良预后相关。

Transl Cancer Res. 2019 Aug;8(4):1364-1373. doi: 10.21037/tcr.2019.07.25.

Lung-RADS Category 3 and 4 Nodules on Lung Cancer Screening in Clinical Practice.肺癌筛查中 Lung-RADS 类别 3 和 4 结节的临床实践

AJR Am J Roentgenol. 2022 Jul;219(1):55-65. doi: 10.2214/AJR.21.27180. Epub 2022 Jan 26.

Cancer statistics, 2022.癌症统计数据，2022 年。

CA Cancer J Clin. 2022 Jan;72(1):7-33. doi: 10.3322/caac.21708. Epub 2022 Jan 12.

Attenuation of Adverse Postinfarction Left Ventricular Remodeling with Empagliflozin Enhances Mitochondria-Linked Cellular Energetics and Mitochondrial Biogenesis.恩格列净可减轻心肌梗死后的左心室重构，增强与线粒体相关的细胞能量代谢和线粒体生物发生。

Int J Mol Sci. 2021 Dec 31;23(1):437. doi: 10.3390/ijms23010437.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

基于常规临床和实验室数据的机器学习早期鉴别肺癌与良性结节。

Machine Learning for Early Discrimination Between Lung Cancer and Benign Nodules Using Routine Clinical and Laboratory Data.

机构信息

出版信息

BACKGROUND

METHODS

RESULTS

CONCLUSION

背景

方法

结果

结论

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献