Heagerty P J, Zeger S L
Department of Biostatistics, University of Washington, Seattle 98195, USA.
Biometrics. 2000 Sep;56(3):719-32. doi: 10.1111/j.0006-341x.2000.00719.x.
We develop semiparametric estimation methods for a pair of regressions that characterize the first and second moments of clustered discrete survival times. In the first regression, we represent discrete survival times through univariate continuation indicators whose expectations are modeled using a generalized linear model. In the second regression, we model the marginal pairwise association of survival times using the Clayton-Oakes cross-product ratio (Clayton, 1978, Biometrika 65, 141-151; Oakes, 1989, Journal of the American Statistical Association 84, 487-493). These models have recently been proposed by Shih (1998, Biometrics 54, 1115-1128). We relate the discrete survival models to multivariate multinomial models presented in Heagerty and Zeger (1996, Journal of the American Statistical Society 91, 1024-1036) and derive a paired estimating equations procedure that is computationally feasible for moderate and large clusters. We extend the work of Guo and Lin (1994, Biometrics 50, 632-639) and Shih (1998) to allow covariance weighted estimating equations and investigate the impact of weighting in terms of asymptotic relative efficiency. We demonstrate that the multinomial structure must be acknowledged when adopting weighted estimating equations and show that a naive use of GEE methods can lead to inconsistent parameter estimates. Finally, we illustrate the proposed methodology by analyzing psychological testing data previously summarized by TenHave and Uttal (1994, Applied Statistics 43, 371-384) and Guo and Lin (1994).
我们针对一对回归模型开发了半参数估计方法,这些回归模型刻画了聚类离散生存时间的一阶矩和二阶矩。在第一个回归模型中,我们通过单变量延续指标来表示离散生存时间,其期望使用广义线性模型进行建模。在第二个回归模型中,我们使用克莱顿 - 奥克斯交叉乘积比(克莱顿,1978年,《生物统计学》65卷,141 - 151页;奥克斯,1989年,《美国统计协会杂志》84卷,487 - 493页)对生存时间的边际成对关联进行建模。这些模型最近由施(1998年,《生物统计学》54卷,1115 - 1128页)提出。我们将离散生存模型与希格蒂和泽格(1996年,《美国统计学会杂志》91卷,1024 - 1036页)中提出的多元多项模型相关联,并推导了一种成对估计方程程序,该程序对于中等规模和大规模聚类在计算上是可行的。我们扩展了郭和林(1994年,《生物统计学》50卷,632 - 639页)以及施(1998年)的工作,以允许协方差加权估计方程,并从渐近相对效率的角度研究加权的影响。我们证明,在采用加权估计方程时必须承认多项结构,并表明简单使用广义估计方程方法可能导致参数估计不一致。最后,我们通过分析先前由滕哈夫和乌塔尔(1994年,《应用统计学》43卷,371 - 384页)以及郭和林(1994年)总结的心理测试数据来说明所提出的方法。