多种机器学习和统计模型在预测个体患者临床风险方面的一致性：以心血管疾病为例的纵向队列研究

Consistency of variety of machine learning and statistical models in predicting clinical risks of individual patients: longitudinal cohort study using cardiovascular disease as exemplar.

作者信息

Li Yan, Sperrin Matthew, Ashcroft Darren M, van Staa Tjeerd Pieter

机构信息

Health e-Research Centre, Health Data Research UK North, School of Health Sciences, Faculty of Biology, Medicine and Health, University of Manchester, Manchester, Manchester M13 9PL, UK.

Centre for Pharmacoepidemiology and Drug Safety, School of Health Sciences, Faculty of Biology, Medicine and Health, University of Manchester, Manchester, UK.

出版信息

BMJ. 2020 Nov 4;371:m3919. doi: 10.1136/bmj.m3919.

DOI:10.1136/bmj.m3919

PMID:33148619

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7610202/

Abstract

OBJECTIVE

To assess the consistency of machine learning and statistical techniques in predicting individual level and population level risks of cardiovascular disease and the effects of censoring on risk predictions.

DESIGN

Longitudinal cohort study from 1 January 1998 to 31 December 2018.

SETTING AND PARTICIPANTS

3.6 million patients from the Clinical Practice Research Datalink registered at 391 general practices in England with linked hospital admission and mortality records.

MAIN OUTCOME MEASURES

Model performance including discrimination, calibration, and consistency of individual risk prediction for the same patients among models with comparable model performance. 19 different prediction techniques were applied, including 12 families of machine learning models (grid searched for best models), three Cox proportional hazards models (local fitted, QRISK3, and Framingham), three parametric survival models, and one logistic model.

RESULTS

The various models had similar population level performance (C statistics of about 0.87 and similar calibration). However, the predictions for individual risks of cardiovascular disease varied widely between and within different types of machine learning and statistical models, especially in patients with higher risks. A patient with a risk of 9.5-10.5% predicted by QRISK3 had a risk of 2.9-9.2% in a random forest and 2.4-7.2% in a neural network. The differences in predicted risks between QRISK3 and a neural network ranged between -23.2% and 0.1% (95% range). Models that ignored censoring (that is, assumed censored patients to be event free) substantially underestimated risk of cardiovascular disease. Of the 223 815 patients with a cardiovascular disease risk above 7.5% with QRISK3, 57.8% would be reclassified below 7.5% when using another model.

CONCLUSIONS

A variety of models predicted risks for the same patients very differently despite similar model performances. The logistic models and commonly used machine learning models should not be directly applied to the prediction of long term risks without considering censoring. Survival models that consider censoring and that are explainable, such as QRISK3, are preferable. The level of consistency within and between models should be routinely assessed before they are used for clinical decision making.

摘要

目的

评估机器学习和统计技术在预测心血管疾病个体水平和人群水平风险方面的一致性，以及删失对风险预测的影响。

设计

1998年1月1日至2018年12月31日的纵向队列研究。

设置和参与者

来自临床实践研究数据链的360万患者，在英格兰的391家全科诊所登记，并与医院入院和死亡记录相关联。

主要观察指标

模型性能，包括具有可比模型性能的模型之间对相同患者个体风险预测的区分度、校准度和一致性。应用了19种不同的预测技术，包括12个机器学习模型家族（通过网格搜索寻找最佳模型）、三个Cox比例风险模型（局部拟合、QRISK3和弗明汉模型）、三个参数生存模型和一个逻辑模型。

结果

各种模型在人群水平上具有相似的性能（C统计量约为0.87，校准度相似）。然而，不同类型的机器学习和统计模型之间以及内部对心血管疾病个体风险的预测差异很大，尤其是在高风险患者中。QRISK3预测风险为9.5 - 10.5%的患者，在随机森林中的风险为2.9 - 9.2%，在神经网络中的风险为2.4 - 7.2%。QRISK3与神经网络之间预测风险的差异在 - 23.2%至0.1%之间（95%范围）。忽略删失的模型（即假设删失患者无事件发生）会大幅低估心血管疾病风险。在QRISK3预测心血管疾病风险高于7.5%的223815名患者中，使用另一种模型时，57.8%的患者会被重新分类到7.5%以下。

结论

尽管模型性能相似，但多种模型对相同患者的风险预测差异很大。在不考虑删失的情况下，逻辑模型和常用的机器学习模型不应直接应用于长期风险预测。考虑删失且可解释的生存模型，如QRISK3，更可取。在将模型用于临床决策之前，应常规评估模型内部和之间的一致性水平。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0329/7610202/33ae29ec74c9/liy055296.f1.jpg

相似文献

Consistency of variety of machine learning and statistical models in predicting clinical risks of individual patients: longitudinal cohort study using cardiovascular disease as exemplar.

BMJ. 2020 Nov 4;371:m3919. doi: 10.1136/bmj.m3919.

Do population-level risk prediction models that use routinely collected health data reliably predict individual risks?

Sci Rep. 2019 Aug 2;9(1):11222. doi: 10.1038/s41598-019-47712-5.

Consistency of ranking was evaluated as new measure for prediction model stability: longitudinal cohort study.

J Clin Epidemiol. 2021 Oct;138:168-177. doi: 10.1016/j.jclinepi.2021.06.026. Epub 2021 Jul 3.

Development and validation of QRISK3 risk prediction algorithms to estimate future risk of cardiovascular disease: prospective cohort study.

BMJ. 2017 May 23;357:j2099. doi: 10.1136/bmj.j2099.

Examining the impact of data quality and completeness of electronic health records on predictions of patients' risks of cardiovascular disease.

Int J Med Inform. 2020 Jan;133:104033. doi: 10.1016/j.ijmedinf.2019.104033. Epub 2019 Nov 11.

Development and internal-external validation of statistical and machine learning models for breast cancer prognostication: cohort study.

BMJ. 2023 May 10;381:e073800. doi: 10.1136/bmj-2022-073800.

Predicting the risk of emergency admission with machine learning: Development and validation using linked electronic health records.

PLoS Med. 2018 Nov 20;15(11):e1002695. doi: 10.1371/journal.pmed.1002695. eCollection 2018 Nov.

The uncertainty with using risk prediction models for individual decision making: an exemplar cohort study examining the prediction of cardiovascular disease in English primary care.

BMC Med. 2019 Jul 17;17(1):134. doi: 10.1186/s12916-019-1368-8.

Effect of competing mortality risks on predictive performance of the QRISK3 cardiovascular risk prediction tool in older people and those with comorbidity: external validation population cohort study.

Lancet Healthy Longev. 2021 Jun;2(6):e352-e361. doi: 10.1016/S2666-7568(21)00088-X.

Pre-existing and machine learning-based models for cardiovascular risk prediction.

Sci Rep. 2021 Apr 26;11(1):8886. doi: 10.1038/s41598-021-88257-w.

引用本文的文献

Machine learning prediction of clinical pregnancy in endometriosis patients following fresh IVF/ICSI-ET.

Eur J Med Res. 2025 Sep 3;30(1):838. doi: 10.1186/s40001-025-03113-1.

Development and validation of an interpretable multi-task model to predict outcomes in patients with rhabdomyolysis: a multicenter retrospective cohort study.

EClinicalMedicine. 2025 Aug 21;87:103438. doi: 10.1016/j.eclinm.2025.103438. eCollection 2025 Sep.

An easily machine learning-based tool for preliminary risk assessment of microvascular invasion in hepatocellular carcinoma.

Surg Endosc. 2025 Aug 27. doi: 10.1007/s00464-025-12094-5.

Development and validation of interpretable machine learning models for predicting AKI risk in patients treated with PD-1/PD-L1: a retrospective study.

BMC Med Inform Decis Mak. 2025 Aug 8;25(1):295. doi: 10.1186/s12911-025-03142-0.

Identification and validation of an explainable machine learning model for vascular depression diagnosis in the older adults: a multicenter cohort study.

BMC Med. 2025 Jul 31;23(1):448. doi: 10.1186/s12916-025-04283-9.

A comparison of modeling approaches for static and dynamic prediction of central line-associated bloodstream infections using electronic health records (part 2): random forest models.

Diagn Progn Res. 2025 Jul 21;9(1):21. doi: 10.1186/s41512-025-00194-8.

Predicting 14-day readmission in middle-aged and elderly patients with pneumonia using emergency department data: a multicentre retrospective cohort study with a survival machine learning approach.

BMJ Open. 2025 Jun 17;15(6):e102711. doi: 10.1136/bmjopen-2025-102711.

Improving Lung Cancer Risk Prediction Using Machine Learning: A Comparative Analysis of Stacking Models and Traditional Approaches.

Cancers (Basel). 2025 May 13;17(10):1651. doi: 10.3390/cancers17101651.

Cardiovascular Risk Estimation in Colombia Using Artificial Intelligence Techniques.

Cardiol Res Pract. 2025 May 11;2025:2566839. doi: 10.1155/crp/2566839. eCollection 2025.

Identification and Validation of an Explainable Prediction Model of Sepsis in Patients With Intracerebral Hemorrhage: Multicenter Retrospective Study.

J Med Internet Res. 2025 Apr 28;27:e71413. doi: 10.2196/71413.

本文引用的文献

Machine learning and artificial intelligence research for patient benefit: 20 critical questions on transparency, replicability, ethics, and effectiveness.

BMJ. 2020 Mar 20;368:l6927. doi: 10.1136/bmj.l6927.

Comparison of Machine Learning Methods With Traditional Models for Use of Administrative Claims With Electronic Medical Records to Predict Heart Failure Outcomes.

JAMA Netw Open. 2020 Jan 3;3(1):e1918962. doi: 10.1001/jamanetworkopen.2019.18962.

Potential Liability for Physicians Using Artificial Intelligence.

JAMA. 2019 Nov 12;322(18):1765-1766. doi: 10.1001/jama.2019.15064.

Do population-level risk prediction models that use routinely collected health data reliably predict individual risks?

Sci Rep. 2019 Aug 2;9(1):11222. doi: 10.1038/s41598-019-47712-5.

The uncertainty with using risk prediction models for individual decision making: an exemplar cohort study examining the prediction of cardiovascular disease in English primary care.

BMC Med. 2019 Jul 17;17(1):134. doi: 10.1186/s12916-019-1368-8.

Cardiovascular disease risk prediction using automated machine learning: A prospective study of 423,604 UK Biobank participants.

PLoS One. 2019 May 15;14(5):e0213653. doi: 10.1371/journal.pone.0213653. eCollection 2019.

A systematic review shows no performance benefit of machine learning over logistic regression for clinical prediction models.

J Clin Epidemiol. 2019 Jun;110:12-22. doi: 10.1016/j.jclinepi.2019.02.004. Epub 2019 Feb 11.

Evaluating Artificial Intelligence Applications in Clinical Settings.

JAMA Netw Open. 2018 Sep 7;1(5):e182658. doi: 10.1001/jamanetworkopen.2018.2658.

Machine Learning Outperforms ACC / AHA CVD Risk Calculator in MESA.

J Am Heart Assoc. 2018 Nov 20;7(22):e009476. doi: 10.1161/JAHA.118.009476.

Questions for Artificial Intelligence in Health Care.

JAMA. 2019 Jan 1;321(1):31-32. doi: 10.1001/jama.2018.18932.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

多种机器学习和统计模型在预测个体患者临床风险方面的一致性：以心血管疾病为例的纵向队列研究

Consistency of variety of machine learning and statistical models in predicting clinical risks of individual patients: longitudinal cohort study using cardiovascular disease as exemplar.

作者信息

机构信息

出版信息

OBJECTIVE

DESIGN

SETTING AND PARTICIPANTS

MAIN OUTCOME MEASURES

RESULTS

CONCLUSIONS

目的

设计

设置和参与者

主要观察指标

结果

结论

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献