O'Quigley John, Xu Ronghui, Stare Janez
Institut Curie, Paris.
Stat Med. 2005 Feb 15;24(3):479-89. doi: 10.1002/sim.1946.
A coefficient of explained randomness, analogous to explained variation but for non-linear models, was presented by Kent. The construct hinges upon the notion of Kullback-Leibler information gain. Kent and O'Quigley developed these ideas, obtaining simple, multiple and partial coefficients for the situation of proportional hazards regression. Their approach was based upon the idea of transforming a general proportional hazards model to a specific one of Weibull form. Xu and O'Quigley developed a more direct approach, more in harmony with the semi-parametric nature of the proportional hazards model thereby simplifying inference and allowing, for instance, the use of time dependent covariates. A potential drawback to the coefficient of Xu and O'Quigley is its interpretation as explained randomness in the covariate given time. An investigator might feel that the interpretation of the Kent and O'Quigley coefficient, as a proportion of explained randomness of time given the covariate, is preferable. One purpose of this note is to indicate that, under an independent censoring assumption, the two population coefficients coincide. Thus the simpler inferential setting for Xu and O'Quigley can also be applied to the coefficient of Kent and O'Quigley. Our second purpose is to point out that a sample-based coefficient in common use in the SAS statistical package can be interpreted as an estimate of explained randomness when there is no censoring. When there is censoring the SAS coefficient would not seem satisfactory in that its population counterpart depends on an independent censoring mechanism. However there is a quick fix and we argue in favour of its use.
肯特提出了一个解释随机性系数,类似于解释变异,但用于非线性模型。该结构基于库尔贝克-莱布勒信息增益的概念。肯特和奥奎格利发展了这些想法,得到了比例风险回归情况下的简单、多重和偏系数。他们的方法基于将一般比例风险模型转换为威布尔形式的特定模型的想法。徐和奥奎格利开发了一种更直接的方法,更符合比例风险模型的半参数性质,从而简化了推断,并允许例如使用时间相依协变量。徐和奥奎格利系数的一个潜在缺点是其在给定时间的协变量中作为解释随机性的解释。研究者可能会觉得,肯特和奥奎格利系数作为给定协变量时时间解释随机性的比例的解释更可取。本笔记的一个目的是指出,在独立删失假设下,这两个人口系数是一致的。因此,徐和奥奎格利更简单的推断设置也可以应用于肯特和奥奎格利系数。我们的第二个目的是指出,在没有删失的情况下,SAS统计软件包中常用的基于样本的系数可以解释为解释随机性的估计。当存在删失时,SAS系数似乎不太令人满意,因为其总体对应物依赖于独立删失机制。然而,有一个快速解决办法,我们主张使用它。