使用贝叶斯网络进行肺癌检测：一项针对丹麦高危人群的回顾性开发与验证研究。

Lung Cancer Detection Using Bayesian Networks: A Retrospective Development and Validation Study on a Danish Population of High-Risk Individuals.

作者信息

Henriksen Margrethe Bang, Van Daalen Florian, Wee Leonard, Hansen Torben Frøstrup, Jensen Lars Henrik, Brasen Claus Lohman, Hilberg Ole, Bermejo Inigo

机构信息

Department of Oncology, Vejle University Hospital, Vejle, Denmark.

Institute of Regional Health Research, University of Southern Denmark, Odense, Denmark.

出版信息

Cancer Med. 2025 Feb;14(3):e70458. doi: 10.1002/cam4.70458.

DOI:10.1002/cam4.70458

PMID:39887592

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11783238/

Abstract

BACKGROUND

Lung cancer (LC) is the top cause of cancer deaths globally, prompting many countries to adopt LC screening programs. While screening typically relies on age and smoking intensity, more efficient risk models exist. We devised a Bayesian network (BN) for LC detection, testing its resilience with varying degrees of missing data and comparing it to a prior machine learning (ML) model.

METHODS

We analyzed data from 9940 patients referred for LC assessment in Southern Denmark from 2009 to 2018. Variables included age, sex, smoking, and lab results. Our experiments varied missing data (0%-30%), BN structure (expert-based vs. data-driven), and discretization method (standard vs. data-driven).

RESULTS

Across all missing data levels, area under the curve (AUC) remained steady, ranging from 0.737 to 0.757, compared to the ML model's AUC of 0.77. BN structure and discretization method had minimal impact on performance. BNs were well calibrated overall, with a net benefit in decision curve analysis when predicted risk exceeded 5%.

CONCLUSION

BN models showed resilience with up to 30% missing values. Moreover, these BNs exhibited similar performance, calibration, and clinical utility compared to the machine learning model developed using the same dataset. Considering their effectiveness in handling missing data, BNs emerge as a relevant method for the development of future lung cancer detection models.

摘要

背景

肺癌是全球癌症死亡的首要原因，促使许多国家采用肺癌筛查项目。虽然筛查通常依赖于年龄和吸烟强度，但存在更有效的风险模型。我们设计了一种用于肺癌检测的贝叶斯网络（BN），测试其在不同程度缺失数据情况下的弹性，并将其与先前的机器学习（ML）模型进行比较。

方法

我们分析了2009年至2018年在丹麦南部因肺癌评估而转诊的9940例患者的数据。变量包括年龄、性别、吸烟情况和实验室检查结果。我们的实验改变了缺失数据（0% - 30%）、BN结构（基于专家与数据驱动）和离散化方法（标准与数据驱动）。

结果

在所有缺失数据水平下，曲线下面积（AUC）保持稳定，范围从0.737到0.757，而ML模型的AUC为0.77。BN结构和离散化方法对性能的影响最小。BN总体校准良好，当预测风险超过5%时，决策曲线分析显示有净收益。

结论

BN模型在缺失值高达30%的情况下仍表现出弹性。此外，与使用相同数据集开发的机器学习模型相比，这些BN在性能、校准和临床效用方面表现相似。考虑到其在处理缺失数据方面的有效性，BN成为未来肺癌检测模型开发的一种相关方法。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b80a/11783238/b6223b39942f/CAM4-14-e70458-g004.jpg

相似文献

Lung Cancer Detection Using Bayesian Networks: A Retrospective Development and Validation Study on a Danish Population of High-Risk Individuals.使用贝叶斯网络进行肺癌检测：一项针对丹麦高危人群的回顾性开发与验证研究。

Cancer Med. 2025 Feb;14(3):e70458. doi: 10.1002/cam4.70458.

Deep Learning Using Chest Radiographs to Identify High-Risk Smokers for Lung Cancer Screening Computed Tomography: Development and Validation of a Prediction Model.利用胸部X光片进行深度学习以识别肺癌筛查计算机断层扫描的高危吸烟者：预测模型的开发与验证

Ann Intern Med. 2020 Nov 3;173(9):704-713. doi: 10.7326/M20-1868. Epub 2020 Sep 1.

A Bayesian Network Approach to Lung Cancer Screening: Assessing the Impact of Data Quantity, Quality, and the Combination of Data from Danish Electronic Health Records.一种用于肺癌筛查的贝叶斯网络方法：评估数据量、质量以及丹麦电子健康记录数据组合的影响。

Cancers (Basel). 2024 Nov 28;16(23):3989. doi: 10.3390/cancers16233989.

Assessing eligibility for lung cancer screening using parsimonious ensemble machine learning models: A development and validation study.采用简约集成机器学习模型评估肺癌筛查的资格：一项开发和验证研究。

PLoS Med. 2023 Oct 3;20(10):e1004287. doi: 10.1371/journal.pmed.1004287. eCollection 2023 Oct.

Pulmonologists-level lung cancer detection based on standard blood test results and smoking status using an explainable machine learning approach.基于标准血液检测结果和吸烟状况，采用可解释的机器学习方法进行肺科医生水平的肺癌检测。

Sci Rep. 2024 Dec 24;14(1):30630. doi: 10.1038/s41598-024-82093-4.

Multi-cancer risk stratification based on national health data: a retrospective modelling and validation study.基于国家健康数据的多癌种风险分层：一项回顾性建模和验证研究。

Lancet Digit Health. 2024 Jun;6(6):e396-e406. doi: 10.1016/S2589-7500(24)00062-1.

Prediction of lung cancer incidence on the low-dose computed tomography arm of the National Lung Screening Trial: A dynamic Bayesian network.国家肺癌筛查试验低剂量计算机断层扫描组肺癌发病率的预测：动态贝叶斯网络

Artif Intell Med. 2016 Sep;72:42-55. doi: 10.1016/j.artmed.2016.07.001. Epub 2016 Jul 27.

Preoperative risk stratification in endometrial cancer (ENDORISK) by a Bayesian network model: A development and validation study.基于贝叶斯网络模型的子宫内膜癌术前风险分层（ENDORISK）：一项开发和验证研究。

PLoS Med. 2020 May 15;17(5):e1003111. doi: 10.1371/journal.pmed.1003111. eCollection 2020 May.

Are Current Survival Prediction Tools Useful When Treating Subsequent Skeletal-related Events From Bone Metastases?当前的生存预测工具在治疗骨转移后的骨骼相关事件时有用吗？

Clin Orthop Relat Res. 2024 Sep 1;482(9):1710-1721. doi: 10.1097/CORR.0000000000003030. Epub 2024 Mar 22.

Does the Presence of Missing Data Affect the Performance of the SORG Machine-learning Algorithm for Patients With Spinal Metastasis? Development of an Internet Application Algorithm.缺失数据的存在是否会影响 SORG 机器学习算法在脊柱转移瘤患者中的性能？开发一种互联网应用算法。

Clin Orthop Relat Res. 2024 Jan 1;482(1):143-157. doi: 10.1097/CORR.0000000000002706. Epub 2023 Jun 12.

本文引用的文献

Sci Rep. 2024 Dec 24;14(1):30630. doi: 10.1038/s41598-024-82093-4.

A collection of multiregistry data on patients at high risk of lung cancer-a Danish retrospective cohort study of nearly 40,000 patients.一项关于肺癌高危患者的多登记处数据收集——一项对近40000名患者的丹麦回顾性队列研究。

Transl Lung Cancer Res. 2023 Dec 26;12(12):2392-2411. doi: 10.21037/tlcr-23-495. Epub 2023 Dec 22.

Stage Shift Improves Lung Cancer Survival: Real-World Evidence.阶段转移改善肺癌生存：真实世界证据。

J Thorac Oncol. 2023 Jan;18(1):47-56. doi: 10.1016/j.jtho.2022.09.005. Epub 2022 Sep 19.

A Review of Deep Learning Techniques for Lung Cancer Screening and Diagnosis Based on CT Images.基于CT图像的肺癌筛查与诊断深度学习技术综述

Diagnostics (Basel). 2023 Aug 8;13(16):2617. doi: 10.3390/diagnostics13162617.

Biomarkers in Lung Cancer Screening: a Narrative Review.肺癌筛查中的生物标志物：一篇叙述性综述。

Curr Chall Thorac Surg. 2023 Feb 25;5. doi: 10.21037/ccts-20-171. Epub 2021 Mar 1.

Validation of a Deep Learning-Based Model to Predict Lung Cancer Risk Using Chest Radiographs and Electronic Medical Record Data.基于深度学习的模型使用胸部 X 光片和电子病历数据预测肺癌风险的验证。

JAMA Netw Open. 2022 Dec 1;5(12):e2248793. doi: 10.1001/jamanetworkopen.2022.48793.

Risk-based prediction model for selecting eligible population for lung cancer screening among ever smokers in Korea.韩国既往吸烟者中用于选择肺癌筛查合格人群的基于风险的预测模型。

Transl Lung Cancer Res. 2021 Dec;10(12):4390-4402. doi: 10.21037/tlcr-21-566.

Lung cancer mortality reduction by LDCT screening: UKLS randomised trial results and international meta-analysis.低剂量CT筛查降低肺癌死亡率：英国肺癌筛查试验随机试验结果及国际荟萃分析

Lancet Reg Health Eur. 2021 Sep 11;10:100179. doi: 10.1016/j.lanepe.2021.100179. eCollection 2021 Nov.

Integrated Biomarkers for the Management of Indeterminate Pulmonary Nodules.用于处理肺结节不明确情况的综合生物标志物

Am J Respir Crit Care Med. 2021 Dec 1;204(11):1306-1316. doi: 10.1164/rccm.202012-4438OC.

Machine Learning for Early Lung Cancer Identification Using Routine Clinical and Laboratory Data.基于常规临床和实验室数据的肺癌早期识别的机器学习。

Am J Respir Crit Care Med. 2021 Aug 15;204(4):445-453. doi: 10.1164/rccm.202007-2791OC.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

使用贝叶斯网络进行肺癌检测：一项针对丹麦高危人群的回顾性开发与验证研究。

Lung Cancer Detection Using Bayesian Networks: A Retrospective Development and Validation Study on a Danish Population of High-Risk Individuals.

作者信息

机构信息

出版信息

BACKGROUND

METHODS

RESULTS

CONCLUSION

背景

方法

结果

结论

相似文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

本文引用的文献