Grand Mia Klinten, Putter Hein
Department of Medical Statistics and Bioinformatics, Leiden University Medical Center, P.O. Box 9600, 2300 RC, Leiden, the Netherlands.
Stat Med. 2016 Mar 30;35(7):1178-92. doi: 10.1002/sim.6771. Epub 2015 Oct 26.
In multi-state models, the expected length of stay (ELOS) in a state is not a straightforward object to relate to covariates, and the traditional approach has instead been to construct regression models for the transition intensities and calculate ELOS from these. The disadvantage of this approach is that the effect of covariates on the intensities is not easily translated into the effect on ELOS, and it typically relies on the Markov assumption. We propose to use pseudo-observations to construct regression models for ELOS, thereby allowing a direct interpretation of covariate effects while at the same time avoiding the Markov assumption. For this approach, all we need is a non-parametric consistent estimator for ELOS. For every subject (and for every state of interest), a pseudo-observation is constructed, and they are then used as outcome variables in the regression model. We furthermore show how to construct longitudinal (pseudo-) data when combining the concept of pseudo-observations with landmarking. In doing so, covariates are allowed to be time-varying, and we can investigate potential time-varying effects of the covariates. The models can be fitted using generalized estimating equations, and dependence between observations on the same subject is handled by applying the sandwich estimator. The method is illustrated using data from the US Health and Retirement Study where the impact of socio-economic factors on ELOS in health and disability is explored. Finally, we investigate the performance of our approach under different degrees of left-truncation, non-Markovianity, and right-censoring by means of simulation.
在多状态模型中,某一状态下的预期住院时间(ELOS)并非一个能直接与协变量相关联的简单对象,传统方法是构建关于转移强度的回归模型,并据此计算ELOS。这种方法的缺点在于,协变量对强度的影响不易转化为对ELOS的影响,而且通常依赖于马尔可夫假设。我们建议使用伪观测值来构建ELOS的回归模型,从而在避免马尔可夫假设的同时,直接解释协变量的效应。对于这种方法,我们所需要的只是一个ELOS的非参数一致估计量。对于每个研究对象(以及每个感兴趣的状态),构建一个伪观测值,然后将它们用作回归模型中的结果变量。我们还展示了如何在将伪观测值的概念与地标法相结合时构建纵向(伪)数据。这样做时,允许协变量随时间变化,并且我们可以研究协变量潜在的时变效应。这些模型可以使用广义估计方程进行拟合,并且通过应用三明治估计量来处理同一研究对象观测值之间的相关性。我们使用美国健康与退休研究的数据说明了该方法,探讨了社会经济因素对健康和残疾状态下ELOS的影响。最后,我们通过模拟研究了我们的方法在不同程度的左截断、非马尔可夫性和右删失情况下的性能。