Bull Lucy M, Lunt Mark, Martin Glen P, Hyrich Kimme, Sergeant Jamie C
Centre for Epidemiology Versus Arthritis, Centre for Musculoskeletal Research, Manchester Academic Health Science Centre, University of Manchester, Manchester, UK.
Centre for Biostatistics, Manchester Academic Health Science Centre, University of Manchester, Manchester, UK.
Diagn Progn Res. 2020 Jul 9;4:9. doi: 10.1186/s41512-020-00078-z. eCollection 2020.
Clinical prediction models (CPMs) predict the risk of health outcomes for individual patients. The majority of existing CPMs only harness cross-sectional patient information. Incorporating repeated measurements, such as those stored in electronic health records, into CPMs may provide an opportunity to enhance their performance. However, the number and complexity of methodological approaches available could make it difficult for researchers to explore this opportunity. Our objective was to review the literature and summarise existing approaches for harnessing repeated measurements of predictor variables in CPMs, primarily to make this field more accessible for applied researchers.
MEDLINE, Embase and Web of Science were searched for articles reporting the development of a multivariable CPM for individual-level prediction of future binary or time-to-event outcomes and modelling repeated measurements of at least one predictor. Information was extracted on the following: the methodology used, its specific aim, reported advantages and limitations, and software available to apply the method.
The search revealed 217 relevant articles. Seven methodological frameworks were identified: time-dependent covariate modelling, generalised estimating equations, landmark analysis, two-stage modelling, joint-modelling, trajectory classification and machine learning. Each of these frameworks satisfies at least one of three aims: to better represent the predictor-outcome relationship over time, to infer a covariate value at a pre-specified time and to account for the effect of covariate change.
The applicability of identified methods depends on the motivation for including longitudinal information and the method's compatibility with the clinical context and available patient data, for both model development and risk estimation in practice.
临床预测模型(CPMs)可预测个体患者出现健康结局的风险。现有的大多数CPMs仅利用横断面患者信息。将重复测量数据(如存储在电子健康记录中的数据)纳入CPMs可能会提供提高其性能的机会。然而,可用方法的数量和复杂性可能使研究人员难以探索这一机会。我们的目标是回顾文献并总结在CPMs中利用预测变量重复测量的现有方法,主要是为了让应用研究人员更容易接触到这一领域。
检索MEDLINE、Embase和科学网,查找报告开发用于个体水平预测未来二元结局或事件发生时间结局的多变量CPM以及对至少一个预测变量进行重复测量建模的文章。提取以下信息:所使用的方法、其具体目标、报告的优点和局限性以及可用于应用该方法的软件。
检索发现217篇相关文章。确定了七个方法框架:时间依存协变量建模、广义估计方程、标志性分析、两阶段建模、联合建模、轨迹分类和机器学习。这些框架中的每一个都至少满足三个目标之一:更好地表示预测变量与结局随时间的关系、推断预先指定时间的协变量值以及考虑协变量变化的影响。
所确定方法的适用性取决于纳入纵向信息的动机以及该方法与临床背景和可用患者数据的兼容性,这对于实践中的模型开发和风险估计均适用。