文献检索文档翻译深度研究
Suppr Zotero 插件Zotero 插件
邀请有礼套餐&价格历史记录

新学期,新优惠

限时优惠:9月1日-9月22日

30天高级会员仅需29元

1天体验卡首发特惠仅需5.99元

了解详情
不再提醒
插件&应用
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
高级版
套餐订阅购买积分包
AI 工具
文献检索文档翻译深度研究
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2025

Consistency of variety of machine learning and statistical models in predicting clinical risks of individual patients: longitudinal cohort study using cardiovascular disease as exemplar.

作者信息

Li Yan, Sperrin Matthew, Ashcroft Darren M, van Staa Tjeerd Pieter

机构信息

Health e-Research Centre, Health Data Research UK North, School of Health Sciences, Faculty of Biology, Medicine and Health, University of Manchester, Manchester, Manchester M13 9PL, UK.

Centre for Pharmacoepidemiology and Drug Safety, School of Health Sciences, Faculty of Biology, Medicine and Health, University of Manchester, Manchester, UK.

出版信息

BMJ. 2020 Nov 4;371:m3919. doi: 10.1136/bmj.m3919.


DOI:10.1136/bmj.m3919
PMID:33148619
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7610202/
Abstract

OBJECTIVE: To assess the consistency of machine learning and statistical techniques in predicting individual level and population level risks of cardiovascular disease and the effects of censoring on risk predictions. DESIGN: Longitudinal cohort study from 1 January 1998 to 31 December 2018. SETTING AND PARTICIPANTS: 3.6 million patients from the Clinical Practice Research Datalink registered at 391 general practices in England with linked hospital admission and mortality records. MAIN OUTCOME MEASURES: Model performance including discrimination, calibration, and consistency of individual risk prediction for the same patients among models with comparable model performance. 19 different prediction techniques were applied, including 12 families of machine learning models (grid searched for best models), three Cox proportional hazards models (local fitted, QRISK3, and Framingham), three parametric survival models, and one logistic model. RESULTS: The various models had similar population level performance (C statistics of about 0.87 and similar calibration). However, the predictions for individual risks of cardiovascular disease varied widely between and within different types of machine learning and statistical models, especially in patients with higher risks. A patient with a risk of 9.5-10.5% predicted by QRISK3 had a risk of 2.9-9.2% in a random forest and 2.4-7.2% in a neural network. The differences in predicted risks between QRISK3 and a neural network ranged between -23.2% and 0.1% (95% range). Models that ignored censoring (that is, assumed censored patients to be event free) substantially underestimated risk of cardiovascular disease. Of the 223 815 patients with a cardiovascular disease risk above 7.5% with QRISK3, 57.8% would be reclassified below 7.5% when using another model. CONCLUSIONS: A variety of models predicted risks for the same patients very differently despite similar model performances. The logistic models and commonly used machine learning models should not be directly applied to the prediction of long term risks without considering censoring. Survival models that consider censoring and that are explainable, such as QRISK3, are preferable. The level of consistency within and between models should be routinely assessed before they are used for clinical decision making.

摘要
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0329/7610202/f2009fb43635/liy055296.f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0329/7610202/33ae29ec74c9/liy055296.f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0329/7610202/0eb05fd539fa/liy055296.f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0329/7610202/2dcbbd8d2abc/liy055296.f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0329/7610202/f2009fb43635/liy055296.f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0329/7610202/33ae29ec74c9/liy055296.f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0329/7610202/0eb05fd539fa/liy055296.f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0329/7610202/2dcbbd8d2abc/liy055296.f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0329/7610202/f2009fb43635/liy055296.f4.jpg

相似文献

[1]
Consistency of variety of machine learning and statistical models in predicting clinical risks of individual patients: longitudinal cohort study using cardiovascular disease as exemplar.

BMJ. 2020-11-4

[2]
Do population-level risk prediction models that use routinely collected health data reliably predict individual risks?

Sci Rep. 2019-8-2

[3]
Consistency of ranking was evaluated as new measure for prediction model stability: longitudinal cohort study.

J Clin Epidemiol. 2021-10

[4]
Development and validation of QRISK3 risk prediction algorithms to estimate future risk of cardiovascular disease: prospective cohort study.

BMJ. 2017-5-23

[5]
Examining the impact of data quality and completeness of electronic health records on predictions of patients' risks of cardiovascular disease.

Int J Med Inform. 2019-11-11

[6]
Development and internal-external validation of statistical and machine learning models for breast cancer prognostication: cohort study.

BMJ. 2023-5-10

[7]
Predicting the risk of emergency admission with machine learning: Development and validation using linked electronic health records.

PLoS Med. 2018-11-20

[8]
The uncertainty with using risk prediction models for individual decision making: an exemplar cohort study examining the prediction of cardiovascular disease in English primary care.

BMC Med. 2019-7-17

[9]
Effect of competing mortality risks on predictive performance of the QRISK3 cardiovascular risk prediction tool in older people and those with comorbidity: external validation population cohort study.

Lancet Healthy Longev. 2021-6

[10]
Pre-existing and machine learning-based models for cardiovascular risk prediction.

Sci Rep. 2021-4-26

引用本文的文献

[1]
Machine learning prediction of clinical pregnancy in endometriosis patients following fresh IVF/ICSI-ET.

Eur J Med Res. 2025-9-3

[2]
Development and validation of an interpretable multi-task model to predict outcomes in patients with rhabdomyolysis: a multicenter retrospective cohort study.

EClinicalMedicine. 2025-8-21

[3]
An easily machine learning-based tool for preliminary risk assessment of microvascular invasion in hepatocellular carcinoma.

Surg Endosc. 2025-8-27

[4]
Development and validation of interpretable machine learning models for predicting AKI risk in patients treated with PD-1/PD-L1: a retrospective study.

BMC Med Inform Decis Mak. 2025-8-8

[5]
Identification and validation of an explainable machine learning model for vascular depression diagnosis in the older adults: a multicenter cohort study.

BMC Med. 2025-7-31

[6]
A comparison of modeling approaches for static and dynamic prediction of central line-associated bloodstream infections using electronic health records (part 2): random forest models.

Diagn Progn Res. 2025-7-21

[7]
Predicting 14-day readmission in middle-aged and elderly patients with pneumonia using emergency department data: a multicentre retrospective cohort study with a survival machine learning approach.

BMJ Open. 2025-6-17

[8]
Improving Lung Cancer Risk Prediction Using Machine Learning: A Comparative Analysis of Stacking Models and Traditional Approaches.

Cancers (Basel). 2025-5-13

[9]
Cardiovascular Risk Estimation in Colombia Using Artificial Intelligence Techniques.

Cardiol Res Pract. 2025-5-11

[10]
Identification and Validation of an Explainable Prediction Model of Sepsis in Patients With Intracerebral Hemorrhage: Multicenter Retrospective Study.

J Med Internet Res. 2025-4-28

本文引用的文献

[1]
Machine learning and artificial intelligence research for patient benefit: 20 critical questions on transparency, replicability, ethics, and effectiveness.

BMJ. 2020-3-20

[2]
Comparison of Machine Learning Methods With Traditional Models for Use of Administrative Claims With Electronic Medical Records to Predict Heart Failure Outcomes.

JAMA Netw Open. 2020-1-3

[3]
Potential Liability for Physicians Using Artificial Intelligence.

JAMA. 2019-11-12

[4]
Do population-level risk prediction models that use routinely collected health data reliably predict individual risks?

Sci Rep. 2019-8-2

[5]
The uncertainty with using risk prediction models for individual decision making: an exemplar cohort study examining the prediction of cardiovascular disease in English primary care.

BMC Med. 2019-7-17

[6]
Cardiovascular disease risk prediction using automated machine learning: A prospective study of 423,604 UK Biobank participants.

PLoS One. 2019-5-15

[7]
A systematic review shows no performance benefit of machine learning over logistic regression for clinical prediction models.

J Clin Epidemiol. 2019-2-11

[8]
Evaluating Artificial Intelligence Applications in Clinical Settings.

JAMA Netw Open. 2018-9-7

[9]
Machine Learning Outperforms ACC / AHA CVD Risk Calculator in MESA.

J Am Heart Assoc. 2018-11-20

[10]
Questions for Artificial Intelligence in Health Care.

JAMA. 2019-1-1

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

推荐工具

医学文档翻译智能文献检索