Booth Sarah, Riley Richard D, Ensor Joie, Lambert Paul C, Rutherford Mark J
Biostatistics Research Group, Department of Health Sciences, University of Leicester, Leicester, UK.
Centre for Prognosis Research, Research Institute for Primary Care and Health Sciences, Keele University, Keele, UK.
Int J Epidemiol. 2020 Aug 1;49(4):1316-1325. doi: 10.1093/ije/dyaa030.
Prognostic models are typically developed in studies covering long time periods. However, if more recent years have seen improvements in survival, then using the full dataset may lead to out-of-date survival predictions. Period analysis addresses this by developing the model in a subset of the data from a recent time window, but results in a reduction of sample size.
We propose a new approach, called temporal recalibration, to combine the advantages of period analysis and full cohort analysis. This approach develops a model in the entire dataset and then recalibrates the baseline survival using a period analysis sample. The approaches are demonstrated utilizing a prognostic model in colon cancer built using both Cox proportional hazards and flexible parametric survival models with data from 1996-2005 from the Surveillance, Epidemiology, and End Results (SEER) Program database. Comparison of model predictions with observed survival estimates were made for new patients subsequently diagnosed in 2006 and followed-up until 2015.
Period analysis and temporal recalibration provided more up-to-date survival predictions that more closely matched observed survival in subsequent data than the standard full cohort models. In addition, temporal recalibration provided more precise estimates of predictor effects.
Prognostic models are typically developed using a full cohort analysis that can result in out-of-date long-term survival estimates when survival has improved in recent years. Temporal recalibration is a simple method to address this, which can be used when developing and updating prognostic models to ensure survival predictions are more closely calibrated with the observed survival of individuals diagnosed subsequently.
预后模型通常是在涵盖较长时间段的研究中开发的。然而,如果近年来生存率有所提高,那么使用完整数据集可能会导致过时的生存预测。时期分析通过在最近时间窗口的数据子集中开发模型来解决这个问题,但会导致样本量减少。
我们提出了一种新方法,称为时间重新校准,以结合时期分析和全队列分析的优点。这种方法在整个数据集中开发一个模型,然后使用时期分析样本重新校准基线生存率。使用来自监测、流行病学和最终结果(SEER)计划数据库的1996 - 2005年数据,利用Cox比例风险模型和灵活参数生存模型构建结肠癌预后模型来演示这些方法。对2006年随后诊断并随访至2015年的新患者,将模型预测与观察到的生存估计进行比较。
与标准的全队列模型相比,时期分析和时间重新校准提供了更符合最新情况的生存预测,与后续数据中观察到的生存情况更紧密匹配。此外,时间重新校准提供了更精确的预测因素效应估计。
预后模型通常使用全队列分析来开发,当近年来生存率有所提高时,这可能会导致过时的长期生存估计。时间重新校准是解决这个问题的一种简单方法,在开发和更新预后模型时可以使用,以确保生存预测与随后诊断个体的观察到的生存情况更紧密地校准。