使用重复观测值的风险预测方法比较：在血液透析电子健康记录中的应用

A comparison of risk prediction methods using repeated observations: an application to electronic health records for hemodialysis.

作者信息

Goldstein Benjamin A, Pomann Gina Maria, Winkelmayer Wolfgang C, Pencina Michael J

机构信息

Biostatistics and Bioinformatics, Duke University, 2424 Erwin Road, Durham, 27705, NC, U.S.A.

Center for Predictive Medicine, Duke Clinical Research Institute, Durham, NC, 27705, U.S.A.

出版信息

Stat Med. 2017 Jul 30;36(17):2750-2763. doi: 10.1002/sim.7308. Epub 2017 May 2.

DOI:10.1002/sim.7308

PMID:28464332

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5494276/

Abstract

An increasingly important data source for the development of clinical risk prediction models is electronic health records (EHRs). One of their key advantages is that they contain data on many individuals collected over time. This allows one to incorporate more clinical information into a risk model. However, traditional methods for developing risk models are not well suited to these irregularly collected clinical covariates. In this paper, we compare a range of approaches for using longitudinal predictors in a clinical risk model. Using data from an EHR for patients undergoing hemodialysis, we incorporate five different clinical predictors into a risk model for patient mortality. We consider different approaches for treating the repeated measurements including use of summary statistics, machine learning methods, functional data analysis, and joint models. We follow up our empirical findings with a simulation study. Overall, our results suggest that simple approaches perform just as well, if not better, than more complex analytic approaches. These results have important implication for development of risk prediction models with EHRs. Copyright © 2017 John Wiley & Sons, Ltd.

摘要

电子健康记录（EHRs）是临床风险预测模型开发中一个日益重要的数据源。其关键优势之一在于它们包含了随时间收集的许多个体的数据。这使得人们能够将更多临床信息纳入风险模型。然而，传统的风险模型开发方法并不适合这些不规则收集的临床协变量。在本文中，我们比较了一系列在临床风险模型中使用纵向预测因子的方法。利用接受血液透析患者的电子健康记录数据，我们将五个不同的临床预测因子纳入患者死亡率风险模型。我们考虑了处理重复测量的不同方法，包括使用汇总统计、机器学习方法、功能数据分析和联合模型。我们通过模拟研究对实证结果进行了跟进。总体而言，我们的结果表明，简单方法即便不比更复杂的分析方法更好，至少也表现得一样好。这些结果对利用电子健康记录开发风险预测模型具有重要意义。版权所有© 2017约翰·威利父子有限公司。

相似文献

A comparison of risk prediction methods using repeated observations: an application to electronic health records for hemodialysis.

Stat Med. 2017 Jul 30;36(17):2750-2763. doi: 10.1002/sim.7308. Epub 2017 May 2.

Near-term prediction of sudden cardiac death in older hemodialysis patients using electronic health records.

Clin J Am Soc Nephrol. 2014 Jan;9(1):82-91. doi: 10.2215/CJN.03050313. Epub 2013 Oct 31.

Opportunities and challenges in developing risk prediction models with electronic health records data: a systematic review.

J Am Med Inform Assoc. 2017 Jan;24(1):198-208. doi: 10.1093/jamia/ocw042. Epub 2016 May 17.

A Naive Bayes machine learning approach to risk prediction using censored, time-to-event data.

Stat Med. 2015 Sep 20;34(21):2941-57. doi: 10.1002/sim.6526. Epub 2015 May 18.

A semiparametric joint model for longitudinal and survival data with application to hemodialysis study.

Biometrics. 2009 Sep;65(3):737-45. doi: 10.1111/j.1541-0420.2008.01168.x. Epub 2009 Jan 23.

Clinical risk prediction with random forests for survival, longitudinal, and multivariate (RF-SLAM) data analysis.

BMC Med Res Methodol. 2019 Dec 31;20(1):1. doi: 10.1186/s12874-019-0863-0.

Performance of a Machine Learning Algorithm Using Electronic Health Record Data to Identify and Estimate Survival in a Longitudinal Cohort of Patients With Lung Cancer.

JAMA Netw Open. 2021 Jul 1;4(7):e2114723. doi: 10.1001/jamanetworkopen.2021.14723.

Detecting clinically meaningful biomarkers with repeated measurements: An illustration with electronic health records.

Biometrics. 2015 Jun;71(2):478-86. doi: 10.1111/biom.12283. Epub 2015 Feb 4.

Accurate Prediction of Coronary Heart Disease for Patients With Hypertension From Electronic Health Records With Big Data and Machine-Learning Methods: Model Development and Performance Evaluation.

JMIR Med Inform. 2020 Jul 6;8(7):e17257. doi: 10.2196/17257.

Classifying individuals based on a densely captured sequence of vital signs: An example using repeated blood pressure measurements during hemodialysis treatment.

J Biomed Inform. 2015 Oct;57:219-24. doi: 10.1016/j.jbi.2015.08.010. Epub 2015 Aug 13.

引用本文的文献

Designing an Implementable Clinical Prediction Model for Near-Term Mortality and Long-Term Survival in Patients on Maintenance Hemodialysis.

Am J Kidney Dis. 2024 Jul;84(1):73-82. doi: 10.1053/j.ajkd.2023.12.013. Epub 2024 Feb 21.

Developing Clinical Prediction Models Using Primary Care Electronic Health Record Data: The Impact of Data Preparation Choices on Model Performance.

Front Epidemiol. 2022 Jun 2;2:871630. doi: 10.3389/fepid.2022.871630. eCollection 2022.

Young Infants Clinical Signs Study 8-sign Algorithm for Identification of Sick Infants Adapted for Routine Home Visits: A Systematic Review and Critical Appraisal of its Measurement Properties.

Glob Pediatr Health. 2024 Jan 25;11:2333794X231219598. doi: 10.1177/2333794X231219598. eCollection 2024.

Improving cardiovascular risk prediction through machine learning modelling of irregularly repeated electronic health records.

Eur Heart J Digit Health. 2023 Oct 17;5(1):30-40. doi: 10.1093/ehjdh/ztad058. eCollection 2024 Jan.

Model for Predicting Complications of Hemodialysis Patients Using Data From the Internet of Medical Things and Electronic Medical Records.

IEEE J Transl Eng Health Med. 2023 Jan 5;11:375-383. doi: 10.1109/JTEHM.2023.3234207. eCollection 2023.

Predict, diagnose, and treat chronic kidney disease with machine learning: a systematic literature review.

J Nephrol. 2023 May;36(4):1101-1117. doi: 10.1007/s40620-023-01573-4. Epub 2023 Feb 14.

Multivariate longitudinal data for survival analysis of cardiovascular event prediction in young adults: insights from a comparative explainable study.

BMC Med Res Methodol. 2023 Jan 25;23(1):23. doi: 10.1186/s12874-023-01845-4.

Dynamic predictions from longitudinal CD4 count measures and time to death of HIV/AIDS patients using a Bayesian joint model.

Sci Afr. 2023 Mar;19:e01519. doi: 10.1016/j.sciaf.2022.e01519. Epub 2023 Jan 2.

Incremental value of risk factor variability for cardiovascular risk prediction in individuals with type 2 diabetes: results from UK primary care electronic health records.

Int J Epidemiol. 2022 Dec 13;51(6):1813-1823. doi: 10.1093/ije/dyac140.

Translating Data Analytics Into Improved Spine Surgery Outcomes: A Roadmap for Biomedical Informatics Research in 2021.

Global Spine J. 2022 Jun;12(5):952-963. doi: 10.1177/21925682211008424. Epub 2021 May 11.

本文引用的文献

A LAG FUNCTIONAL LINEAR MODEL FOR PREDICTION OF MAGNETIZATION TRANSFER RATIO IN MULTIPLE SCLEROSIS LESIONS.

Ann Appl Stat. 2016 Dec;10(4):2325-2348. doi: 10.1214/16-aoas981. Epub 2017 Jan 5.

REGULARIZED BRAIN READING WITH SHRINKAGE AND SMOOTHING.

Ann Appl Stat. 2015 Dec;9(4):1997-2022. doi: 10.1214/15-aoas837. Epub 2016 Jan 28.

Controlling for Informed Presence Bias Due to the Number of Health Encounters in an Electronic Health Record.

Am J Epidemiol. 2016 Dec 1;184(11):847-855. doi: 10.1093/aje/kww112. Epub 2016 Nov 16.

The use of repeated blood pressure measures for cardiovascular risk prediction: a comparison of statistical models in the ARIC study.

Stat Med. 2017 Dec 10;36(28):4514-4528. doi: 10.1002/sim.7144. Epub 2016 Oct 11.

Predicting mortality over different time horizons: which data elements are needed?

J Am Med Inform Assoc. 2017 Jan;24(1):176-181. doi: 10.1093/jamia/ocw057. Epub 2016 Jun 29.

Opportunities and challenges in developing risk prediction models with electronic health records data: a systematic review.

J Am Med Inform Assoc. 2017 Jan;24(1):198-208. doi: 10.1093/jamia/ocw042. Epub 2016 May 17.

Regularization Paths for Cox's Proportional Hazards Model via Coordinate Descent.

J Stat Softw. 2011 Mar;39(5):1-13. doi: 10.18637/jss.v039.i05.

US Renal Data System 2015 Annual Data Report: Epidemiology of Kidney Disease in the United States.

Am J Kidney Dis. 2016 Mar;67(3 Suppl 1):Svii, S1-305. doi: 10.1053/j.ajkd.2015.12.014.

Interaction Models for Functional Regression.

Comput Stat Data Anal. 2016 Feb 1;94:317-329. doi: 10.1016/j.csda.2015.08.020.

Cox Regression Models with Functional Covariates for Survival Data.

Stat Modelling. 2015 Jun 1;15(3):256-278. doi: 10.1177/1471082X14565526. Epub 2015 Jan 9.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

使用重复观测值的风险预测方法比较：在血液透析电子健康记录中的应用

A comparison of risk prediction methods using repeated observations: an application to electronic health records for hemodialysis.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献