Flandre Philippe, Deutsch Reena, O'Quigley John
Sorbonne Universités, UPMC Univ Paris 06, INSERM, Institut Pierre Louis d'épidémiologie et de Santé Publique (IPLESP UMR-S 1136), Paris, F75013, France.
Department of Psychiatry, University of California at San Diego, La Jolla, 92093, CA, U.S.A.
Stat Med. 2017 Sep 10;36(20):3171-3180. doi: 10.1002/sim.7342. Epub 2017 Jun 7.
One aspect of an analysis of survival data based on the proportional hazards model that has been receiving increasing attention is that of the predictive ability or explained variation of the model. A number of contending measures have been suggested, including one measure, R (β), which has been proposed given its several desirable properties, including its capacity to accommodate time-dependent covariates, a major feature of the model and one that gives rise to great generality. A thorough study of the properties of available measures, including the aforementioned measure, has been carried out recently. In that work, the authors used bootstrap techniques, particularly complex in the setting of censored data, in order to obtain estimates of precision. The motivation of this work is to provide analytical expressions of precision, in particular confidence interval estimates for R (β). We use Taylor series approximations with and without local linearizing transforms. We also consider a very simple expression based on the Fisher's transformation. This latter approach has two great advantages. It is very easy and quick to calculate, and secondly, it can be obtained for any of the methods given in the recent review. A large simulation study is carried out to investigate the properties of the different methods. Finally, three well-known datasets in breast cancer, lymphoma and lung cancer research are given as illustrations. Copyright © 2017 John Wiley & Sons, Ltd.
基于比例风险模型的生存数据分析中,一个受到越来越多关注的方面是模型的预测能力或解释变异。已经提出了许多相互竞争的度量方法,其中包括一种度量方法R(β),鉴于它具有若干理想特性而被提出,这些特性包括它能够处理随时间变化的协变量,这是该模型的一个主要特征,也是产生很大通用性的一个特征。最近对现有度量方法的特性,包括上述度量方法,进行了全面研究。在该研究中,作者使用了自抽样技术,在删失数据的情况下这种技术尤其复杂,以便获得精度估计。这项工作的动机是提供精度的解析表达式,特别是R(β)的置信区间估计。我们使用带和不带局部线性化变换的泰勒级数近似。我们还考虑基于费希尔变换的一个非常简单的表达式。后一种方法有两个很大的优点。它计算起来非常容易和快捷,其次,对于最近综述中给出的任何方法都可以得到这种表达式。进行了一项大型模拟研究来考察不同方法的特性。最后,给出了乳腺癌、淋巴瘤和肺癌研究中的三个著名数据集作为例证。版权所有© 2017约翰威立父子有限公司。