• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

预测肺癌发病的1年风险:使用缅因州电子健康记录的前瞻性研究

Prediction of the 1-Year Risk of Incident Lung Cancer: Prospective Study Using Electronic Health Records from the State of Maine.

作者信息

Wang Xiaofang, Zhang Yan, Hao Shiying, Zheng Le, Liao Jiayu, Ye Chengyin, Xia Minjie, Wang Oliver, Liu Modi, Weng Ching Ho, Duong Son Q, Jin Bo, Alfreds Shaun T, Stearns Frank, Kanov Laura, Sylvester Karl G, Widen Eric, McElhinney Doff B, Ling Xuefeng B

机构信息

Shandong Provincial Key Laboratory of Network Based Intelligent Computing, University of Jinan, Jinan, China.

Department of Surgery, Stanford University, Stanford, CA, United States.

出版信息

J Med Internet Res. 2019 May 16;21(5):e13260. doi: 10.2196/13260.

DOI:10.2196/13260
PMID:31099339
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6542253/
Abstract

BACKGROUND

Lung cancer is the leading cause of cancer death worldwide. Early detection of individuals at risk of lung cancer is critical to reduce the mortality rate.

OBJECTIVE

The aim of this study was to develop and validate a prospective risk prediction model to identify patients at risk of new incident lung cancer within the next 1 year in the general population.

METHODS

Data from individual patient electronic health records (EHRs) were extracted from the Maine Health Information Exchange network. The study population consisted of patients with at least one EHR between April 1, 2016, and March 31, 2018, who had no history of lung cancer. A retrospective cohort (N=873,598) and a prospective cohort (N=836,659) were formed for model construction and validation. An Extreme Gradient Boosting (XGBoost) algorithm was adopted to build the model. It assigned a score to each individual to quantify the probability of a new incident lung cancer diagnosis from October 1, 2016, to September 31, 2017. The model was trained with the clinical profile in the retrospective cohort from the preceding 6 months and validated with the prospective cohort to predict the risk of incident lung cancer from April 1, 2017, to March 31, 2018.

RESULTS

The model had an area under the curve (AUC) of 0.881 (95% CI 0.873-0.889) in the prospective cohort. Two thresholds of 0.0045 and 0.01 were applied to the predictive scores to stratify the population into low-, medium-, and high-risk categories. The incidence of lung cancer in the high-risk category (579/53,922, 1.07%) was 7.7 times higher than that in the overall cohort (1167/836,659, 0.14%). Age, a history of pulmonary diseases and other chronic diseases, medications for mental disorders, and social disparities were found to be associated with new incident lung cancer.

CONCLUSIONS

We retrospectively developed and prospectively validated an accurate risk prediction model of new incident lung cancer occurring in the next 1 year. Through statistical learning from the statewide EHR data in the preceding 6 months, our model was able to identify statewide high-risk patients, which will benefit the population health through establishment of preventive interventions or more intensive surveillance.

摘要

背景

肺癌是全球癌症死亡的主要原因。早期发现肺癌高危个体对于降低死亡率至关重要。

目的

本研究的目的是开发并验证一种前瞻性风险预测模型,以识别普通人群中未来1年内有新发肺癌风险的患者。

方法

从缅因州健康信息交换网络中提取个体患者电子健康记录(EHR)的数据。研究人群包括在2016年4月1日至2018年3月31日期间至少有一份EHR且无肺癌病史的患者。形成了一个回顾性队列(N = 873,598)和一个前瞻性队列(N = 836,659)用于模型构建和验证。采用极端梯度提升(XGBoost)算法构建模型。该模型为每个个体分配一个分数,以量化从2016年10月1日至2017年9月31日新发肺癌诊断的概率。该模型以前6个月回顾性队列中的临床特征进行训练,并以前瞻性队列进行验证,以预测2017年4月1日至2018年3月31日期间肺癌发生的风险。

结果

该模型在前瞻性队列中的曲线下面积(AUC)为0.881(95%CI 0.873 - 0.889)。将预测分数的两个阈值0.0045和0.01应用于对人群进行分层,分为低、中、高风险类别。高风险类别中的肺癌发病率(579/53,922,1.07%)比总体队列中的发病率(1167/836,659,0.14%)高7.7倍。发现年龄、肺部疾病和其他慢性疾病史、精神障碍用药以及社会差异与新发肺癌有关。

结论

我们回顾性开发并前瞻性验证了一个准确的未来1年内新发肺癌风险预测模型。通过对前6个月全州EHR数据的统计学习,我们的模型能够识别全州的高危患者,这将通过建立预防干预措施或更密集的监测使人群健康受益。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c8d7/6542253/22256cd945f0/jmir_v21i5e13260_fig4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c8d7/6542253/fe3c38661a56/jmir_v21i5e13260_fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c8d7/6542253/195492f427af/jmir_v21i5e13260_fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c8d7/6542253/ddb70d1b7ea9/jmir_v21i5e13260_fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c8d7/6542253/22256cd945f0/jmir_v21i5e13260_fig4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c8d7/6542253/fe3c38661a56/jmir_v21i5e13260_fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c8d7/6542253/195492f427af/jmir_v21i5e13260_fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c8d7/6542253/ddb70d1b7ea9/jmir_v21i5e13260_fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c8d7/6542253/22256cd945f0/jmir_v21i5e13260_fig4.jpg

相似文献

1
Prediction of the 1-Year Risk of Incident Lung Cancer: Prospective Study Using Electronic Health Records from the State of Maine.预测肺癌发病的1年风险:使用缅因州电子健康记录的前瞻性研究
J Med Internet Res. 2019 May 16;21(5):e13260. doi: 10.2196/13260.
2
Prediction of Incident Hypertension Within the Next Year: Prospective Study Using Statewide Electronic Health Records and Machine Learning.预测未来一年内高血压的发病情况:一项使用全州电子健康记录和机器学习的前瞻性研究。
J Med Internet Res. 2018 Jan 30;20(1):e22. doi: 10.2196/jmir.9268.
3
Assessing Statewide All-Cause Future One-Year Mortality: Prospective Study With Implications for Quality of Life, Resource Utilization, and Medical Futility.评估全州全因未来一年死亡率:一项对生活质量、资源利用和医疗无效性有影响的前瞻性研究。
J Med Internet Res. 2018 Jun 4;20(6):e10311. doi: 10.2196/10311.
4
Performance of a Machine Learning Algorithm Using Electronic Health Record Data to Identify and Estimate Survival in a Longitudinal Cohort of Patients With Lung Cancer.基于电子健康记录数据的机器学习算法在肺癌纵向队列患者中识别和估计生存的性能。
JAMA Netw Open. 2021 Jul 1;4(7):e2114723. doi: 10.1001/jamanetworkopen.2021.14723.
5
Comment on "Prediction of the 1-Year Risk of Incident Lung Cancer: Prospective Study Using Electronic Health Records from the State of Maine".关于《预测肺癌发病的1年风险:使用缅因州电子健康记录的前瞻性研究》的评论
J Med Internet Res. 2020 Sep 15;22(9):e14944. doi: 10.2196/14944.
6
Estimating One-Year Risk of Incident Chronic Kidney Disease: Retrospective Development and Validation Study Using Electronic Medical Record Data From the State of Maine.估算慢性肾脏病发病的一年风险:利用缅因州电子病历数据进行的回顾性开发与验证研究
JMIR Med Inform. 2017 Jul 26;5(3):e21. doi: 10.2196/medinform.7954.
7
Predicting the future risk of lung cancer: development, and internal and external validation of the CanPredict (lung) model in 19·67 million people and evaluation of model performance against seven other risk prediction models.预测肺癌未来风险:CanPredict(肺部)模型在 1967 万人中的开发、内部和外部验证以及该模型与其他七个风险预测模型的性能评估。
Lancet Respir Med. 2023 Aug;11(8):685-697. doi: 10.1016/S2213-2600(23)00050-4. Epub 2023 Apr 5.
8
Machine Learning to Predict Mortality and Critical Events in a Cohort of Patients With COVID-19 in New York City: Model Development and Validation.机器学习预测纽约市新冠肺炎患者队列中的死亡率和危急事件:模型开发与验证
J Med Internet Res. 2020 Nov 6;22(11):e24018. doi: 10.2196/24018.
9
Identification of elders at higher risk for fall with statewide electronic health records and a machine learning algorithm.利用全州范围的电子健康记录和机器学习算法识别高风险跌倒的老年人。
Int J Med Inform. 2020 May;137:104105. doi: 10.1016/j.ijmedinf.2020.104105. Epub 2020 Mar 3.
10
Identification of patients at risk of new onset heart failure: Utilizing a large statewide health information exchange to train and validate a risk prediction model.利用大型全州健康信息交换系统识别新发心力衰竭风险患者:训练和验证风险预测模型。
PLoS One. 2021 Dec 10;16(12):e0260885. doi: 10.1371/journal.pone.0260885. eCollection 2021.

引用本文的文献

1
Constructing multicancer risk cohorts using national data from medical helplines and secondary care.利用医疗求助热线和二级医疗保健的国家数据构建多癌风险队列。
NPJ Digit Med. 2025 Aug 27;8(1):551. doi: 10.1038/s41746-025-01855-0.
2
Ethical and secure evidence generation from regionwide clinical data through a collaborative environment for advancing predictive care.通过协作环境从区域范围内的临床数据中生成符合伦理且安全的证据,以推进预测性医疗。
Front Public Health. 2025 Aug 8;13:1630351. doi: 10.3389/fpubh.2025.1630351. eCollection 2025.
3
Predictive performance of risk prediction models for lung cancer incidence in Western and Asian countries: a systematic review and meta-analysis.

本文引用的文献

1
Assessing Statewide All-Cause Future One-Year Mortality: Prospective Study With Implications for Quality of Life, Resource Utilization, and Medical Futility.评估全州全因未来一年死亡率:一项对生活质量、资源利用和医疗无效性有影响的前瞻性研究。
J Med Internet Res. 2018 Jun 4;20(6):e10311. doi: 10.2196/10311.
2
Cancer screening in the United States, 2018: A review of current American Cancer Society guidelines and current issues in cancer screening.美国 2018 年癌症筛查:对当前美国癌症协会指南和癌症筛查中当前问题的综述。
CA Cancer J Clin. 2018 Jul;68(4):297-316. doi: 10.3322/caac.21446. Epub 2018 May 30.
3
Updated statistics of lung and bronchus cancer in United States (2018).
西方国家和亚洲国家肺癌发病风险预测模型的预测性能:一项系统评价和荟萃分析。
Sci Rep. 2025 Mar 4;15(1):4259. doi: 10.1038/s41598-024-83875-6.
4
Lung Cancer Detection Using Bayesian Networks: A Retrospective Development and Validation Study on a Danish Population of High-Risk Individuals.使用贝叶斯网络进行肺癌检测:一项针对丹麦高危人群的回顾性开发与验证研究。
Cancer Med. 2025 Feb;14(3):e70458. doi: 10.1002/cam4.70458.
5
Pulmonologists-level lung cancer detection based on standard blood test results and smoking status using an explainable machine learning approach.基于标准血液检测结果和吸烟状况,采用可解释的机器学习方法进行肺科医生水平的肺癌检测。
Sci Rep. 2024 Dec 24;14(1):30630. doi: 10.1038/s41598-024-82093-4.
6
Natural language processing and expert follow-up establishes tachycardia association with CDKL5 deficiency disorder.自然语言处理和专家随访确定了心动过速与CDKL5缺乏症之间的关联。
Genet Med Open. 2023 Nov 18;2:100842. doi: 10.1016/j.gimo.2023.100842. eCollection 2024.
7
An Artificial Intelligence (AI)-Integrated Approach to Enhance Early Detection and Personalized Treatment Strategies in Lung Cancer Among Smokers: A Literature Review.一种人工智能(AI)整合方法,用于加强吸烟者肺癌的早期检测和个性化治疗策略:文献综述
Cureus. 2024 Aug 12;16(8):e66688. doi: 10.7759/cureus.66688. eCollection 2024 Aug.
8
Development of Lung Cancer Risk Prediction Machine Learning Models for Equitable Learning Health System: Retrospective Study.用于公平学习健康系统的肺癌风险预测机器学习模型的开发:回顾性研究
JMIR AI. 2024 Sep 11;3:e56590. doi: 10.2196/56590.
9
Prediction of Cancer Symptom Trajectory Using Longitudinal Electronic Health Record Data and Long Short-Term Memory Neural Network.使用纵向电子健康记录数据和长短时记忆神经网络预测癌症症状轨迹。
JCO Clin Cancer Inform. 2024 Mar;8:e2300039. doi: 10.1200/CCI.23.00039.
10
Machine learning-based prediction models for parathyroid carcinoma using pre-surgery cognitive function and clinical features.基于机器学习的甲状旁腺癌术前认知功能和临床特征预测模型。
Sci Rep. 2023 Nov 3;13(1):19007. doi: 10.1038/s41598-023-46294-7.
美国肺癌和支气管癌的最新统计数据(2018年)。
J Thorac Dis. 2018 Mar;10(3):1158-1161. doi: 10.21037/jtd.2018.03.15.
4
A Validated Clinical Risk Prediction Model for Lung Cancer in Smokers of All Ages and Exposure Types: A HUNT Study.一种适用于所有年龄段和吸烟暴露类型的吸烟者肺癌的验证临床风险预测模型:一项 HUNT 研究。
EBioMedicine. 2018 May;31:36-46. doi: 10.1016/j.ebiom.2018.03.027. Epub 2018 Mar 30.
5
Lung cancer in connective tissue disease-associated interstitial lung disease: clinical features and impact on outcomes.结缔组织病相关间质性肺疾病中的肺癌:临床特征及对预后的影响
J Thorac Dis. 2018 Feb;10(2):799-807. doi: 10.21037/jtd.2017.12.134.
6
Lung cancer and socioeconomic status in a pooled analysis of case-control studies.病例对照研究汇总分析中的肺癌与社会经济地位
PLoS One. 2018 Feb 20;13(2):e0192999. doi: 10.1371/journal.pone.0192999. eCollection 2018.
7
Cancer risk associated with chronic diseases and disease markers: prospective cohort study.与慢性病和疾病标志物相关的癌症风险:前瞻性队列研究。
BMJ. 2018 Jan 31;360:k134. doi: 10.1136/bmj.k134.
8
Prediction of Incident Hypertension Within the Next Year: Prospective Study Using Statewide Electronic Health Records and Machine Learning.预测未来一年内高血压的发病情况:一项使用全州电子健康记录和机器学习的前瞻性研究。
J Med Internet Res. 2018 Jan 30;20(1):e22. doi: 10.2196/jmir.9268.
9
Development of a risk prediction model for lung cancer: The Japan Public Health Center-based Prospective Study.肺癌风险预测模型的开发:基于日本公共卫生中心的前瞻性研究。
Cancer Sci. 2018 Mar;109(3):854-862. doi: 10.1111/cas.13509. Epub 2018 Feb 21.
10
Cancer statistics, 2018.癌症统计数据,2018 年。
CA Cancer J Clin. 2018 Jan;68(1):7-30. doi: 10.3322/caac.21442. Epub 2018 Jan 4.