我们能否利用观察性医疗保健数据开发真实世界的预后模型？调查模型对数据库和表型敏感性的大规模实验。

Can we develop real-world prognostic models using observational healthcare data? Large-scale experiment to investigate model sensitivity to database and phenotypes.

作者信息

Reps Jenna M, Rijnbeek Peter R, Ryan Patrick B

机构信息

, Johnson & Johnson, Raritan, NJ, USA.

Department of Medical Informatics, Erasmus University Medical Center, Rotterdam, The Netherlands.

出版信息

Diagn Progn Res. 2025 Apr 17;9(1):10. doi: 10.1186/s41512-025-00191-x.

DOI:10.1186/s41512-025-00191-x

PMID:40247385

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12004590/

Abstract

BACKGROUND

Large observational healthcare databases are frequently used to develop models to be implemented in real-world clinical practice populations. For example, these databases were used to develop COVID severity models that guided interventions such as who to prioritize vaccinating during the pandemic. However, the clinical setting and observational databases often differ in the types of patients (case mix), and it is a nontrivial process to identify patients with medical conditions (phenotyping) in these databases. In this study, we investigate how sensitive a model's performance is to the choice of development database, population, and outcome phenotype.

METHODS

We developed > 450 different logistic regression models for nine prediction tasks across seven databases with a range of suitable population and outcome phenotypes. Performance stability within tasks was calculated by applying each model to data created by permuting the database, population, or outcome phenotype. We investigate performance (AUROC, scaled Brier, and calibration-in-the-large) stability and individual risk estimate stability.

RESULTS

In general, changing the outcome definitions or population phenotype made little impact on the model validation discrimination. However, validation discrimination was unstable when the database changed. Calibration and Brier performance were unstable when the population, outcome definition, or database changed. This may be problematic if a model developed using observational data is implemented in a real-world setting.

CONCLUSIONS

These results highlight the importance of validating a model developed using observational data in the clinical setting prior to using it for decision-making. Calibration and Brier score should be evaluated to prevent miscalibrated risk estimates being used to aid clinical decisions.

摘要

背景

大型观察性医疗保健数据库经常被用于开发可在实际临床实践人群中实施的模型。例如，这些数据库被用于开发新冠严重程度模型，该模型指导了诸如在疫情期间确定优先接种疫苗对象等干预措施。然而，临床环境和观察性数据库在患者类型（病例组合）方面往往存在差异，并且在这些数据库中识别患有特定疾病的患者（表型分析）是一个复杂的过程。在本研究中，我们调查了模型性能对开发数据库、人群和结局表型选择的敏感程度。

方法

我们针对七个数据库中的九项预测任务，开发了超过450种不同的逻辑回归模型，这些模型具有一系列合适的人群和结局表型。通过将每个模型应用于通过对数据库、人群或结局表型进行置换而创建的数据，计算任务内的性能稳定性。我们研究了性能（曲线下面积、缩放布里尔得分和整体校准）稳定性以及个体风险估计稳定性。

结果

总体而言，改变结局定义或人群表型对模型验证辨别力影响不大。然而，当数据库改变时，验证辨别力不稳定。当人群、结局定义或数据库改变时，校准和布里尔性能不稳定。如果将使用观察性数据开发的模型应用于实际环境中，这可能会产生问题。

结论

这些结果凸显了在将使用观察性数据开发的模型用于决策之前，在临床环境中对其进行验证的重要性。应评估校准和布里尔得分，以防止使用校准错误的风险估计来辅助临床决策。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bb7a/12004590/25d100e8ec74/41512_2025_191_Fig1_HTML.jpg

相似文献

Can we develop real-world prognostic models using observational healthcare data? Large-scale experiment to investigate model sensitivity to database and phenotypes.我们能否利用观察性医疗保健数据开发真实世界的预后模型？调查模型对数据库和表型敏感性的大规模实验。

Diagn Progn Res. 2025 Apr 17;9(1):10. doi: 10.1186/s41512-025-00191-x.

Comparison of Two Modern Survival Prediction Tools, SORG-MLA and METSSS, in Patients With Symptomatic Long-bone Metastases Who Underwent Local Treatment With Surgery Followed by Radiotherapy and With Radiotherapy Alone.两种现代生存预测工具 SORG-MLA 和 METSSS 在接受手术联合放疗和单纯放疗治疗有症状长骨转移患者中的比较。

Clin Orthop Relat Res. 2024 Dec 1;482(12):2193-2208. doi: 10.1097/CORR.0000000000003185. Epub 2024 Jul 23.

Are Current Survival Prediction Tools Useful When Treating Subsequent Skeletal-related Events From Bone Metastases?当前的生存预测工具在治疗骨转移后的骨骼相关事件时有用吗？

Clin Orthop Relat Res. 2024 Sep 1;482(9):1710-1721. doi: 10.1097/CORR.0000000000003030. Epub 2024 Mar 22.

Learning patient-level prediction models across multiple healthcare databases: evaluation of ensembles for increasing model transportability.跨多个医疗保健数据库学习患者级预测模型：评估集成模型以提高模型可转移性。

BMC Med Inform Decis Mak. 2022 May 25;22(1):142. doi: 10.1186/s12911-022-01879-6.

Does the Presence of Missing Data Affect the Performance of the SORG Machine-learning Algorithm for Patients With Spinal Metastasis? Development of an Internet Application Algorithm.缺失数据的存在是否会影响 SORG 机器学习算法在脊柱转移瘤患者中的性能？开发一种互联网应用算法。

Clin Orthop Relat Res. 2024 Jan 1;482(1):143-157. doi: 10.1097/CORR.0000000000002706. Epub 2023 Jun 12.

How Does the Skeletal Oncology Research Group Algorithm's Prediction of 5-year Survival in Patients with Chondrosarcoma Perform on International Validation?骨肿瘤研究组算法对软骨肉瘤患者 5 年生存率的预测在国际验证中的表现如何？

Clin Orthop Relat Res. 2020 Oct;478(10):2300-2308. doi: 10.1097/CORR.0000000000001305.

Does the SORG Machine-learning Algorithm for Extremity Metastases Generalize to a Contemporary Cohort of Patients? Temporal Validation From 2016 to 2020.SORG 机器学习算法对肢体转移瘤的泛化能力如何？2016 年至 2020 年的时间验证。

Clin Orthop Relat Res. 2023 Dec 1;481(12):2419-2430. doi: 10.1097/CORR.0000000000002698. Epub 2023 May 25.

Machine Learning Did Not Outperform Conventional Competing Risk Modeling to Predict Revision Arthroplasty.在预测翻修关节成形术方面，机器学习的表现并未优于传统的竞争风险模型。

Clin Orthop Relat Res. 2024 Aug 1;482(8):1472-1482. doi: 10.1097/CORR.0000000000003018. Epub 2024 Mar 12.

Prognostic models for newly-diagnosed chronic lymphocytic leukaemia in adults: a systematic review and meta-analysis.成人新诊断慢性淋巴细胞白血病的预后模型：一项系统评价和荟萃分析。

Cochrane Database Syst Rev. 2020 Jul 31;7(7):CD012022. doi: 10.1002/14651858.CD012022.pub2.

Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.在流行地区，服用抗叶酸抗疟药物的人群中，叶酸补充剂与疟疾易感性和严重程度的关系。

Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217.

本文引用的文献

Finding a constrained number of predictor phenotypes for multiple outcome prediction.为多结果预测寻找数量受限的预测表型。

BMJ Health Care Inform. 2025 Jan 16;32(1):e101227. doi: 10.1136/bmjhci-2024-101227.

Stability of clinical prediction models developed using statistical or machine learning methods.基于统计或机器学习方法开发的临床预测模型的稳定性。

Biom J. 2023 Dec;65(8):e2200302. doi: 10.1002/bimj.202200302. Epub 2023 Jul 19.

Targeted validation: validating clinical prediction models in their intended population and setting.靶向验证：在目标人群和环境中验证临床预测模型。

Diagn Progn Res. 2022 Dec 22;6(1):24. doi: 10.1186/s41512-022-00136-8.

Logistic regression models for patient-level prediction based on massive observational data: Do we need all data?基于海量观测数据的患者水平预测的逻辑回归模型：我们需要所有数据吗？

Int J Med Inform. 2022 Jul;163:104762. doi: 10.1016/j.ijmedinf.2022.104762. Epub 2022 Apr 12.

Seek COVER: using a disease proxy to rapidly develop and validate a personalized risk calculator for COVID-19 outcomes in an international network.寻找替代指标：利用疾病替代指标在国际网络中快速开发和验证针对 COVID-19 结局的个体化风险计算器。

BMC Med Res Methodol. 2022 Jan 30;22(1):35. doi: 10.1186/s12874-022-01505-z.

Investigating the impact of development and internal validation design when training prognostic models using a retrospective cohort in big US observational healthcare data.利用美国大型观察性医疗保健数据中的回顾性队列来训练预后模型时，研究开发和内部验证设计的影响。

BMJ Open. 2021 Dec 24;11(12):e050146. doi: 10.1136/bmjopen-2021-050146.

External Validations of Cardiovascular Clinical Prediction Models: A Large-Scale Review of the Literature.心血管临床预测模型的外部验证：文献的大规模综述。

Circ Cardiovasc Qual Outcomes. 2021 Aug;14(8):e007858. doi: 10.1161/CIRCOUTCOMES.121.007858. Epub 2021 Aug 3.

Implementation of the COVID-19 Vulnerability Index Across an International Network of Health Care Data Sets: Collaborative External Validation Study.在国际医疗保健数据集网络中实施COVID-19脆弱性指数：协作外部验证研究。

JMIR Med Inform. 2021 Apr 5;9(4):e21547. doi: 10.2196/21547.

An empirical analysis of dealing with patients who are lost to follow-up when developing prognostic models using a cohort design.运用队列设计开发预后模型时处理失访患者的实证分析。

BMC Med Inform Decis Mak. 2021 Feb 6;21(1):43. doi: 10.1186/s12911-021-01408-x.

Living risk prediction algorithm (QCOVID) for risk of hospital admission and mortality from coronavirus 19 in adults: national derivation and validation cohort study.成人因冠状病毒 19 住院和死亡风险的生存风险预测算法（QCOVID）：全国推导和验证队列研究。

BMJ. 2020 Oct 20;371:m3731. doi: 10.1136/bmj.m3731.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

我们能否利用观察性医疗保健数据开发真实世界的预后模型？调查模型对数据库和表型敏感性的大规模实验。

Can we develop real-world prognostic models using observational healthcare data? Large-scale experiment to investigate model sensitivity to database and phenotypes.

作者信息

机构信息

出版信息

BACKGROUND

METHODS

RESULTS

CONCLUSIONS

背景

方法

结果

结论

相似文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

本文引用的文献