• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于机器学习的肺结节良恶性辅助诊断:一项回顾性研究

Auxiliary Diagnosis of Pulmonary Nodules' Benignancy and Malignancy Based on Machine Learning: A Retrospective Study.

作者信息

Wang Wanling, Yang Bingqing, Wu Huan, Che Hebin, Tong Yue, Zhang Bozun, Liu Hongwu, Chen Yuanyuan

机构信息

Medical Innovation Research Department of PLA General Hospital, Beijing, People's Republic of China.

Goodwill Hessian Health Technology Co. Ltd, Beijing, People's Republic of China.

出版信息

J Multidiscip Healthc. 2025 Jun 27;18:3735-3748. doi: 10.2147/JMDH.S518166. eCollection 2025.

DOI:10.2147/JMDH.S518166
PMID:40600198
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12212436/
Abstract

BACKGROUND

Lung cancer, one of the most lethal malignancies globally, often presents insidiously as pulmonary nodules. Its nonspecific clinical presentation and heterogeneous imaging characteristics hinder accurate differentiation between benign and malignant lesions, while biopsy's invasiveness and procedural constraints underscore the critical need for non-invasive early diagnostic approaches.

METHODS

In this retrospective study, we analyzed outpatient and inpatient records from the First Medical Center of Chinese PLA General Hospital between 2011 and 2021, focusing on pulmonary nodules measuring 5-30mm on CT scans without overt signs of malignancy. Pathological examination served as the reference standard. Comparative experiments evaluated SVM, RF, XGBoost, FNN, and Atten_FNN using five-fold cross-validation to assess AUC, sensitivity, and specificity. The dataset was split 70%/30%, and stratified five-fold cross-validation was applied to the training set. The optimal model was interpreted with SHAP to identify the most influential predictive features.

RESULTS

This study enrolled 3355 patients, including 1156 with benign and 2199 with malignant pulmonary nodules. The Atten_FNN model demonstrated superior performance in five-fold cross-validation, achieving an AUC of 0.82, accuracy of 0.75, sensitivity of 0.77, and F1 score of 0.80. SHAP analysis revealed key predictive factors: demographic variables (age, sex, BMI), CT-derived features (maximum nodule diameter, morphology, density, calcification, ground-glass opacity), and laboratory biomarkers (neuroendocrine markers, carcinoembryonic antigen).

CONCLUSION

This study integrates electronic medical records and pathology data to predict pulmonary nodule malignancy using machine/deep learning models. SHAP-based interpretability analysis uncovered key clinical determinants. Acknowledging limitations in cross-center generalizability, we propose the development of a multimodal diagnostic systems that combines CT imaging and radiomics, to be validated in multi-center prospective cohorts to facilitate clinical translation. This framework establishes a novel paradigm for early precision diagnosis of lung cancer.

摘要

背景

肺癌是全球最致命的恶性肿瘤之一,常隐匿表现为肺结节。其非特异性临床表现和异质性影像学特征阻碍了良性和恶性病变的准确鉴别,而活检的侵入性和操作限制凸显了对非侵入性早期诊断方法的迫切需求。

方法

在这项回顾性研究中,我们分析了解放军总医院第一医学中心2011年至2021年的门诊和住院记录,重点关注CT扫描中直径为5 - 30mm且无明显恶性征象的肺结节。病理检查作为参考标准。采用五折交叉验证的比较实验评估支持向量机(SVM)、随机森林(RF)、极端梯度提升(XGBoost)、全连接神经网络(FNN)和注意力全连接神经网络(Atten_FNN),以评估曲线下面积(AUC)、敏感性和特异性。数据集按70%/30%划分,并对训练集应用分层五折交叉验证。使用SHAP对最优模型进行解释,以识别最具影响力的预测特征。

结果

本研究纳入3355例患者,其中良性肺结节1156例,恶性肺结节2199例。Atten_FNN模型在五折交叉验证中表现优异,AUC为0.82,准确率为0.75,敏感性为0.77,F1评分为0.80。SHAP分析揭示了关键预测因素:人口统计学变量(年龄、性别、体重指数);CT衍生特征(最大结节直径、形态、密度、钙化、磨玻璃影);以及实验室生物标志物(神经内分泌标志物、癌胚抗原)。

结论

本研究整合电子病历和病理数据,使用机器学习/深度学习模型预测肺结节的恶性程度。基于SHAP的可解释性分析揭示了关键临床决定因素。认识到跨中心可推广性的局限性,我们建议开发一种结合CT成像和放射组学的多模态诊断系统,并在多中心前瞻性队列中进行验证,以促进临床转化。该框架为肺癌的早期精准诊断建立了一种新范式。

相似文献

1
Auxiliary Diagnosis of Pulmonary Nodules' Benignancy and Malignancy Based on Machine Learning: A Retrospective Study.基于机器学习的肺结节良恶性辅助诊断:一项回顾性研究
J Multidiscip Healthc. 2025 Jun 27;18:3735-3748. doi: 10.2147/JMDH.S518166. eCollection 2025.
2
Supervised Machine Learning Models for Predicting Sepsis-Associated Liver Injury in Patients With Sepsis: Development and Validation Study Based on a Multicenter Cohort Study.用于预测脓毒症患者脓毒症相关肝损伤的监督式机器学习模型:基于多中心队列研究的开发与验证研究
J Med Internet Res. 2025 May 26;27:e66733. doi: 10.2196/66733.
3
Development and interpretation of machine learning-based prognostic models for predicting high-risk prognostic pathological components in pulmonary nodules: integrating clinical features, serum tumor marker and imaging features.基于机器学习的预测肺结节高危预后病理成分的预后模型的开发与解读:整合临床特征、血清肿瘤标志物和影像特征
J Cancer Res Clin Oncol. 2025 Jun 17;151(6):190. doi: 10.1007/s00432-025-06241-7.
4
Signs and symptoms to determine if a patient presenting in primary care or hospital outpatient settings has COVID-19.在基层医疗机构或医院门诊环境中,如果患者出现以下症状和体征,可判断其是否患有 COVID-19。
Cochrane Database Syst Rev. 2022 May 20;5(5):CD013665. doi: 10.1002/14651858.CD013665.pub3.
5
Non-contrast computed tomography radiomics model to predict benign and malignant thyroid nodules with lobe segmentation: A dual-center study.用于预测甲状腺良恶性结节并进行叶分割的非增强计算机断层扫描影像组学模型:一项双中心研究。
World J Radiol. 2025 Jun 28;17(6):106682. doi: 10.4329/wjr.v17.i6.106682.
6
Construction and validation of HBV-ACLF bacterial infection diagnosis model based on machine learning.基于机器学习的HBV-ACLF细菌感染诊断模型的构建与验证
BMC Infect Dis. 2025 Jul 1;25(1):847. doi: 10.1186/s12879-025-11199-5.
7
AI-based Hepatic Steatosis Detection and Integrated Hepatic Assessment from Cardiac CT Attenuation Scans Enhances All-cause Mortality Risk Stratification: A Multi-center Study.基于人工智能的心脏CT衰减扫描检测肝脂肪变性及综合肝脏评估可增强全因死亡风险分层:一项多中心研究
medRxiv. 2025 Jun 11:2025.06.09.25329157. doi: 10.1101/2025.06.09.25329157.
8
Intraoperative frozen section analysis for the diagnosis of early stage ovarian cancer in suspicious pelvic masses.术中冰冻切片分析用于诊断可疑盆腔肿块中的早期卵巢癌。
Cochrane Database Syst Rev. 2016 Mar 1;3(3):CD010360. doi: 10.1002/14651858.CD010360.pub2.
9
Are Current Survival Prediction Tools Useful When Treating Subsequent Skeletal-related Events From Bone Metastases?当前的生存预测工具在治疗骨转移后的骨骼相关事件时有用吗?
Clin Orthop Relat Res. 2024 Sep 1;482(9):1710-1721. doi: 10.1097/CORR.0000000000003030. Epub 2024 Mar 22.
10
A Responsible Framework for Assessing, Selecting, and Explaining Machine Learning Models in Cardiovascular Disease Outcomes Among People With Type 2 Diabetes: Methodology and Validation Study.用于评估、选择和解释2型糖尿病患者心血管疾病结局机器学习模型的责任框架:方法与验证研究
JMIR Med Inform. 2025 Jun 27;13:e66200. doi: 10.2196/66200.

本文引用的文献

1
Evaluation of Pulmonary Nodules by Radiologists vs. Radiomics in Stand-Alone and Complementary CT and MRI.放射科医生与影像组学在独立及辅助CT和MRI中对肺结节的评估
Diagnostics (Basel). 2024 Feb 23;14(5):483. doi: 10.3390/diagnostics14050483.
2
Analyzing prehospital delays in recurrent acute ischemic stroke: Insights from interpretable machine learning.分析复发性急性缺血性脑卒中的院前延误:可解释机器学习的新见解。
Patient Educ Couns. 2024 Jun;123:108228. doi: 10.1016/j.pec.2024.108228. Epub 2024 Mar 4.
3
Global cancer burden growing, amidst mounting need for services.
全球癌症负担不断增加,对服务的需求也日益迫切。
Saudi Med J. 2024 Mar;45(3):326-327.
4
Pilot intervention to increase uptake of lung cancer screening through the emergency department.通过急诊科提高肺癌筛查接受率的试点干预措施。
Am J Emerg Med. 2024 May;79:157-160. doi: 10.1016/j.ajem.2024.02.019. Epub 2024 Feb 20.
5
Radiomics analysis to predict pulmonary nodule malignancy using machine learning approaches.基于机器学习的影像组学分析预测肺结节良恶性。
Thorax. 2024 Mar 15;79(4):307-315. doi: 10.1136/thorax-2023-220226.
6
Associations of long-term exposure to air pollution with prevalence of pulmonary nodules: A cross-sectional study in Shijiazhuang, China.长期暴露于空气污染与肺结节患病率的关联:中国石家庄的一项横断面研究。
Ecotoxicol Environ Saf. 2023 Jul 31;262:115311. doi: 10.1016/j.ecoenv.2023.115311.
7
On the Analyses of Medical Images Using Traditional Machine Learning Techniques and Convolutional Neural Networks.关于使用传统机器学习技术和卷积神经网络对医学图像进行分析
Arch Comput Methods Eng. 2023;30(5):3173-3233. doi: 10.1007/s11831-023-09899-9. Epub 2023 Apr 4.
8
CircSSNN: circRNA-binding site prediction via sequence self-attention neural networks with pre-normalization.CircSSNN:使用带有预归一化的序列自注意力神经网络进行 circRNA 结合位点预测。
BMC Bioinformatics. 2023 May 30;24(1):220. doi: 10.1186/s12859-023-05352-7.
9
Multi-omics integrated circulating cell-free DNA genomic signatures enhanced the diagnostic performance of early-stage lung cancer and postoperative minimal residual disease.多组学整合循环无细胞 DNA 基因组特征提高了早期肺癌和术后微小残留病的诊断性能。
EBioMedicine. 2023 May;91:104553. doi: 10.1016/j.ebiom.2023.104553. Epub 2023 Apr 5.
10
Artificial intelligence in lung cancer diagnosis and prognosis: Current application and future perspective.人工智能在肺癌诊断和预后中的应用:现状与未来展望。
Semin Cancer Biol. 2023 Feb;89:30-37. doi: 10.1016/j.semcancer.2023.01.006. Epub 2023 Jan 20.