• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一种在国际数据网络中进行大规模患者水平预测外部验证方法的可行性和评估:验证用于预测新诊断为心房颤动的女性患者中风的模型。

Feasibility and evaluation of a large-scale external validation approach for patient-level prediction in an international data network: validation of models predicting stroke in female patients newly diagnosed with atrial fibrillation.

机构信息

Janssen Research and Development, 1125 Trenton Harbourton Rd, Titusville, NJ, 08560, USA.

Department of Medical Informatics, Erasmus University Medical Center, Rotterdam, The Netherlands.

出版信息

BMC Med Res Methodol. 2020 May 6;20(1):102. doi: 10.1186/s12874-020-00991-3.

DOI:10.1186/s12874-020-00991-3
PMID:32375693
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7201646/
Abstract

BACKGROUND

To demonstrate how the Observational Healthcare Data Science and Informatics (OHDSI) collaborative network and standardization can be utilized to scale-up external validation of patient-level prediction models by enabling validation across a large number of heterogeneous observational healthcare datasets.

METHODS

Five previously published prognostic models (ATRIA, CHADS, CHADSVASC, Q-Stroke and Framingham) that predict future risk of stroke in patients with atrial fibrillation were replicated using the OHDSI frameworks. A network study was run that enabled the five models to be externally validated across nine observational healthcare datasets spanning three countries and five independent sites.

RESULTS

The five existing models were able to be integrated into the OHDSI framework for patient-level prediction and they obtained mean c-statistics ranging between 0.57-0.63 across the 6 databases with sufficient data to predict stroke within 1 year of initial atrial fibrillation diagnosis for females with atrial fibrillation. This was comparable with existing validation studies. The validation network study was run across nine datasets within 60 days once the models were replicated. An R package for the study was published at https://github.com/OHDSI/StudyProtocolSandbox/tree/master/ExistingStrokeRiskExternalValidation.

CONCLUSION

This study demonstrates the ability to scale up external validation of patient-level prediction models using a collaboration of researchers and a data standardization that enable models to be readily shared across data sites. External validation is necessary to understand the transportability or reproducibility of a prediction model, but without collaborative approaches it can take three or more years for a model to be validated by one independent researcher. In this paper we show it is possible to both scale-up and speed-up external validation by showing how validation can be done across multiple databases in less than 2 months. We recommend that researchers developing new prediction models use the OHDSI network to externally validate their models.

摘要

背景

为了展示 Observational Healthcare Data Science and Informatics(OHDSI)协作网络和标准化如何被利用来扩大患者水平预测模型的外部验证规模,使验证能够跨越大量异构的观察性医疗保健数据集。

方法

使用 OHDSI 框架复制了五个先前发表的预测模型(ATRIA、CHADS、CHADSVASC、Q-Stroke 和 Framingham),这些模型预测心房颤动患者未来中风风险。进行了一项网络研究,使五个模型能够在跨越三个国家和五个独立站点的九个观察性医疗保健数据集上进行外部验证。

结果

五个现有模型能够集成到 OHDSI 框架中进行患者水平预测,并且在六个数据库中,它们在预测女性心房颤动患者 1 年内初始心房颤动诊断后中风风险方面获得了 0.57-0.63 的平均 c 统计量。这与现有的验证研究相当。一旦模型被复制,在 60 天内就可以在九个数据集上运行验证网络研究。该研究的 R 包已在 https://github.com/OHDSI/StudyProtocolSandbox/tree/master/ExistingStrokeRiskExternalValidation 上发布。

结论

本研究展示了使用研究人员合作和数据标准化来扩大患者水平预测模型的外部验证能力,使模型能够在数据站点之间轻松共享。外部验证对于理解预测模型的可转移性或可再现性是必要的,但如果没有合作方法,一个独立研究人员可能需要三年或更长时间才能验证一个模型。在本文中,我们通过展示如何在不到 2 个月的时间内在多个数据库中进行验证,展示了如何扩大和加速外部验证,证明了这一点。我们建议开发新预测模型的研究人员使用 OHDSI 网络来对其模型进行外部验证。

相似文献

1
Feasibility and evaluation of a large-scale external validation approach for patient-level prediction in an international data network: validation of models predicting stroke in female patients newly diagnosed with atrial fibrillation.一种在国际数据网络中进行大规模患者水平预测外部验证方法的可行性和评估:验证用于预测新诊断为心房颤动的女性患者中风的模型。
BMC Med Res Methodol. 2020 May 6;20(1):102. doi: 10.1186/s12874-020-00991-3.
2
Development and validation of a prognostic model predicting symptomatic hemorrhagic transformation in acute ischemic stroke at scale in the OHDSI network.在 OHDSI 网络中大规模开发和验证用于预测急性缺血性脑卒中症状性出血转化的预后模型。
PLoS One. 2020 Jan 7;15(1):e0226718. doi: 10.1371/journal.pone.0226718. eCollection 2020.
3
Implementation of the COVID-19 Vulnerability Index Across an International Network of Health Care Data Sets: Collaborative External Validation Study.在国际医疗保健数据集网络中实施COVID-19脆弱性指数:协作外部验证研究。
JMIR Med Inform. 2021 Apr 5;9(4):e21547. doi: 10.2196/21547.
4
A new risk scheme to predict ischemic stroke and other thromboembolism in atrial fibrillation: the ATRIA study stroke risk score.一种新的风险预测方案,用于预测心房颤动中的缺血性卒中及其他血栓栓塞事件:ATRIA 研究卒中风险评分。
J Am Heart Assoc. 2013 Jun 21;2(3):e000250. doi: 10.1161/JAHA.113.000250.
5
A standardized analytics pipeline for reliable and rapid development and validation of prediction models using observational health data.使用观察性健康数据进行可靠且快速的预测模型开发和验证的标准化分析管道。
Comput Methods Programs Biomed. 2021 Nov;211:106394. doi: 10.1016/j.cmpb.2021.106394. Epub 2021 Sep 6.
6
Using Iterative Pairwise External Validation to Contextualize Prediction Model Performance: A Use Case Predicting 1-Year Heart Failure Risk in Patients with Diabetes Across Five Data Sources.利用迭代成对外部验证来情境化预测模型性能:一个预测糖尿病患者 1 年心力衰竭风险的案例,涉及五个数据源。
Drug Saf. 2022 May;45(5):563-570. doi: 10.1007/s40264-022-01161-8. Epub 2022 May 17.
7
Validation and development of models using clinical, biochemical and ultrasound markers for predicting pre-eclampsia: an individual participant data meta-analysis.利用临床、生化和超声标志物预测子痫前期的模型的验证和建立:一项个体参与者数据荟萃分析。
Health Technol Assess. 2020 Dec;24(72):1-252. doi: 10.3310/hta24720.
8
Augmented intelligence decision tool for stroke prediction combines factors from CHA DS -VASc and the intermountain risk score for patients with atrial fibrillation.增强型人工智能决策工具用于预测中风,结合了 CHA2DS2-VASc 和房颤患者的山间风险评分的因素。
J Cardiovasc Electrophysiol. 2019 Sep;30(9):1452-1461. doi: 10.1111/jce.13999. Epub 2019 Jun 25.
9
Novel composite score to predict atrial Fibrillation in acute stroke patients: AF predicting score in acute stroke.预测急性卒中患者房颤的新型综合评分:急性卒中房颤预测评分
Int J Cardiol. 2016 Apr 15;209:184-9. doi: 10.1016/j.ijcard.2016.02.002. Epub 2016 Feb 3.
10
Towards global model generalizability: independent cross-site feature evaluation for patient-level risk prediction models using the OHDSI network.迈向全球模型通用性:使用 OHDSI 网络进行患者水平风险预测模型的独立跨站点特征评估。
J Am Med Inform Assoc. 2024 Apr 19;31(5):1051-1061. doi: 10.1093/jamia/ocae028.

引用本文的文献

1
Advancing Real-World Evidence Through a Federated Health Data Network (EHDEN): Descriptive Study.通过联合健康数据网络(EHDEN)推进真实世界证据:描述性研究。
J Med Internet Res. 2025 Aug 7;27:e74119. doi: 10.2196/74119.
2
Implementation and Updating of Clinical Prediction Models: A Systematic Review.临床预测模型的实施与更新:一项系统综述
Mayo Clin Proc Digit Health. 2025 May 23;3(3):100228. doi: 10.1016/j.mcpdig.2025.100228. eCollection 2025 Sep.
3
Extensive benchmarking of a method that estimates external model performance from limited statistical characteristics.一种从有限统计特征估计外部模型性能的方法的广泛基准测试。
NPJ Digit Med. 2025 Jan 27;8(1):59. doi: 10.1038/s41746-024-01414-z.
4
Finding a constrained number of predictor phenotypes for multiple outcome prediction.为多结果预测寻找数量受限的预测表型。
BMJ Health Care Inform. 2025 Jan 16;32(1):e101227. doi: 10.1136/bmjhci-2024-101227.
5
Fair prediction of 2-year stroke risk in patients with atrial fibrillation.心房颤动患者2年中风风险的准确预测。
J Am Med Inform Assoc. 2024 Dec 1;31(12):2820-2828. doi: 10.1093/jamia/ocae170.
6
Inter hospital external validation of interpretable machine learning based triage score for the emergency department using common data model.基于常见数据模型的可解释机器学习分诊评分在急诊科的医院间外部验证。
Sci Rep. 2024 Mar 20;14(1):6666. doi: 10.1038/s41598-024-54364-7.
7
Mapping the Oncological Basis Dataset to the Standardized Vocabularies of a Common Data Model: A Feasibility Study.将肿瘤学基础数据集映射到通用数据模型的标准化词汇表:一项可行性研究。
Cancers (Basel). 2023 Aug 11;15(16):4059. doi: 10.3390/cancers15164059.
8
Machine Learning and Real-World Data to Predict Lung Cancer Risk in Routine Care.机器学习和真实世界数据在常规护理中预测肺癌风险。
Cancer Epidemiol Biomarkers Prev. 2023 Mar 6;32(3):337-343. doi: 10.1158/1055-9965.EPI-22-0873.
9
External validation of existing dementia prediction models on observational health data.基于观察性健康数据对现有痴呆症预测模型进行外部验证。
BMC Med Res Methodol. 2022 Dec 5;22(1):311. doi: 10.1186/s12874-022-01793-5.
10
Generic medical concept embedding and time decay for diverse patient outcome prediction tasks.用于多种患者预后预测任务的通用医学概念嵌入和时间衰减
iScience. 2022 Aug 4;25(9):104880. doi: 10.1016/j.isci.2022.104880. eCollection 2022 Sep 16.

本文引用的文献

1
Predictive analytics in health care: how can we know it works?医疗保健中的预测分析:我们如何知道它是否有效?
J Am Med Inform Assoc. 2019 Dec 1;26(12):1651-1654. doi: 10.1093/jamia/ocz130.
2
Design and implementation of a standardized framework to generate and evaluate patient-level prediction models using observational healthcare data.利用观察性医疗保健数据生成和评估患者水平预测模型的标准化框架的设计与实现。
J Am Med Inform Assoc. 2018 Aug 1;25(8):969-975. doi: 10.1093/jamia/ocy032.
3
Primary prevention of cardiovascular disease: A review of contemporary guidance and literature.心血管疾病的一级预防:当代指南与文献综述
JRSM Cardiovasc Dis. 2017 Jan 1;6:2048004016687211. doi: 10.1177/2048004016687211. eCollection 2017 Jan-Dec.
4
Predicting the Future - Big Data, Machine Learning, and Clinical Medicine.预测未来——大数据、机器学习与临床医学。
N Engl J Med. 2016 Sep 29;375(13):1216-9. doi: 10.1056/NEJMp1606181.
5
Comparison of the ATRIA, CHADS2, and CHA2DS2-VASc stroke risk scores in predicting ischaemic stroke in a large Swedish cohort of patients with atrial fibrillation.比较 ATRIA、CHADS2 和 CHA2DS2-VASc 卒中风险评分在预测大型瑞典房颤患者缺血性卒中中的作用。
Eur Heart J. 2016 Nov 7;37(42):3203-3210. doi: 10.1093/eurheartj/ehw077. Epub 2016 Mar 3.
6
Comparative Performance of ATRIA, CHADS2, and CHA2DS2-VASc Risk Scores Predicting Stroke in Patients With Atrial Fibrillation: Results From a National Primary Care Database.ATRIA、CHADS2 和 CHA2DS2-VASc 风险评分在预测心房颤动患者中风中的比较表现:来自全国初级保健数据库的结果。
J Am Coll Cardiol. 2015 Oct 27;66(17):1851-9. doi: 10.1016/j.jacc.2015.08.033.
7
Comparisons of CHADS2 and CHA2DS2-VASc scores for stroke risk stratification in atrial fibrillation: Which scoring system should be used for Asians?用于心房颤动中风风险分层的CHADS2和CHA2DS2-VASc评分比较:亚洲人应使用哪种评分系统?
Heart Rhythm. 2016 Jan;13(1):46-53. doi: 10.1016/j.hrthm.2015.08.017. Epub 2015 Aug 12.
8
Observational Health Data Sciences and Informatics (OHDSI): Opportunities for Observational Researchers.观察性健康数据科学与信息学(OHDSI):观察性研究人员的机遇。
Stud Health Technol Inform. 2015;216:574-8.
9
The Stroke Riskometer(TM) App: validation of a data collection tool and stroke risk predictor.中风风险评估仪(TM)应用程序:一种数据收集工具和中风风险预测器的验证
Int J Stroke. 2015 Feb;10(2):231-44. doi: 10.1111/ijs.12411. Epub 2014 Dec 10.
10
External validation of new risk prediction models is infrequent and reveals worse prognostic discrimination.新的风险预测模型的外部验证很少,且显示出较差的预后判别能力。
J Clin Epidemiol. 2015 Jan;68(1):25-34. doi: 10.1016/j.jclinepi.2014.09.007. Epub 2014 Oct 23.