Bansal Aasthaa, Pepe Margaret Sullivan
Department of Biostatistics, University of Washington, Seattle, WA 98195, USA.
Lifetime Data Anal. 2013 Apr;19(2):170-201. doi: 10.1007/s10985-012-9237-1. Epub 2013 Jan 29.
When an existing risk prediction model is not sufficiently predictive, additional variables are sought for inclusion in the model. This paper addresses study designs to evaluate the improvement in prediction performance that is gained by adding a new predictor to a risk prediction model. We consider studies that measure the new predictor in a case-control subset of the study cohort, a practice that is common in biomarker research. We ask if matching controls to cases in regards to baseline predictors improves efficiency. A variety of measures of prediction performance are studied. We find through simulation studies that matching improves the efficiency with which most measures are estimated, but can reduce efficiency for some. Efficiency gains are less when more controls per case are included in the study. A method that models the distribution of the new predictor in controls appears to improve estimation efficiency considerably.
当现有的风险预测模型预测能力不足时,会寻求纳入其他变量到模型中。本文探讨了研究设计,以评估通过向风险预测模型添加新预测变量所获得的预测性能提升。我们考虑在研究队列的病例对照子集中测量新预测变量的研究,这种做法在生物标志物研究中很常见。我们探讨了在基线预测变量方面将对照与病例进行匹配是否能提高效率。研究了多种预测性能的衡量指标。我们通过模拟研究发现,匹配提高了大多数指标估计的效率,但对某些指标可能会降低效率。当研究中每个病例纳入更多对照时,效率提升较小。一种对对照中新预测变量的分布进行建模的方法似乎能显著提高估计效率。