White Ian R, Rapsomaniki Eleni
MRC Biostatistics Unit, Cambridge Institute of Public Health, Forvie Site, Robinson Way, Cambridge Biomedical Campus, Cambridge CB2 0SR, UK.
Farr Institute for Health Informatics Research, Department of Epidemiology and Public Health, University College London Medical School, 222 Euston Road, London WC1E 6BT, UK.
Biom J. 2015 Jul;57(4):592-613. doi: 10.1002/bimj.201400061. Epub 2014 Dec 20.
Discrimination statistics describe the ability of a survival model to assign higher risks to individuals who experience earlier events: examples are Harrell's C-index and Royston and Sauerbrei's D, which we call the D-index. Prognostic covariates whose distributions are controlled by the study design (e.g. age and sex) influence discrimination and can make it difficult to compare model discrimination between studies. Although covariate adjustment is a standard procedure for quantifying disease-risk factor associations, there are no covariate adjustment methods for discrimination statistics in censored survival data.
To develop extensions of the C-index and D-index that describe the prognostic ability of a model adjusted for one or more covariate(s).
We define a covariate-adjusted C-index and D-index for censored survival data, propose several estimators, and investigate their performance in simulation studies and in data from a large individual participant data meta-analysis, the Emerging Risk Factors Collaboration.
The proposed methods perform well in simulations. In the Emerging Risk Factors Collaboration data, the age-adjusted C-index and D-index were substantially smaller than unadjusted values. The study-specific standard deviation of baseline age was strongly associated with the unadjusted C-index and D-index but not significantly associated with the age-adjusted indices.
The proposed estimators improve meta-analysis comparisons, are easy to implement and give a more meaningful clinical interpretation.
区分统计描述了生存模型将更高风险分配给经历较早事件个体的能力:例如哈雷尔C指数以及罗伊斯顿和绍尔布雷的D指数(我们称之为D指数)。其分布受研究设计控制的预后协变量(如年龄和性别)会影响区分度,并且可能使不同研究之间的模型区分度难以比较。尽管协变量调整是量化疾病风险因素关联的标准程序,但对于删失生存数据中的区分统计量,尚无协变量调整方法。
开发C指数和D指数的扩展,以描述针对一个或多个协变量调整后的模型的预后能力。
我们为删失生存数据定义了协变量调整后的C指数和D指数,提出了几种估计方法,并在模拟研究以及来自大型个体参与者数据荟萃分析(新兴风险因素协作组)的数据中研究了它们的性能。
所提出的方法在模拟中表现良好。在新兴风险因素协作组的数据中,年龄调整后的C指数和D指数显著小于未调整的值。基线年龄的研究特定标准差与未调整的C指数和D指数密切相关,但与年龄调整后的指数无显著关联。
所提出的估计方法改善了荟萃分析比较,易于实施,并给出了更有意义的临床解释。