Kooperberg C, Clarkson D B
Department of Statistics, University of Washington, Seattle 98195-4322, USA.
Biometrics. 1997 Dec;53(4):1485-94.
In a recent paper, Kooperberg, Stone, and Truong (1995a) introduced hazard regression (HARE), in which linear splines and their tensor products are used to estimate the conditional log-hazard function based on possibly censored, positive response data and one or more covariates. Model selection is carried out in an adaptive fashion using maximum likelihood estimation of the unknown coefficients, Rao and Wald statistics to carry out stepwise addition and deletion of basis functions, and the Bayesian Information Criterion (BIC) to select the final model. In the present paper, the HARE methodology is extended to accommodate interval-censored data, time-dependent covariates, and cubic splines. The presence of interval-censored data means that the log-likelihood function may no longer be concave, presenting additional numerical challenges. The extended methodology is applied to a data set containing both interval-censoring and time-dependent covariates. The new software will be available in a future release of S-Plus.
在最近的一篇论文中,库珀伯格、斯通和特鲁昂(1995a)引入了风险回归(HARE),其中线性样条及其张量积用于基于可能被删失的正响应数据以及一个或多个协变量来估计条件对数风险函数。模型选择以自适应方式进行,使用未知系数的最大似然估计、用于逐步添加和删除基函数的 Rao 和 Wald 统计量,以及用于选择最终模型的贝叶斯信息准则(BIC)。在本文中,HARE 方法被扩展以适应区间删失数据、随时间变化的协变量和三次样条。区间删失数据的存在意味着对数似然函数可能不再是凹函数,这带来了额外的数值挑战。扩展后的方法应用于一个包含区间删失和随时间变化协变量的数据集。新软件将在 S-Plus 的未来版本中提供。