Sauerbrei W, Schumacher M
Institute of Medical Biometry and Informatics, University of Freiburg, Germany.
Stat Med. 1992 Dec;11(16):2093-109. doi: 10.1002/sim.4780111607.
A common problem in the statistical analysis of clinical studies is the selection of those variables in the framework of a regression model which might influence the outcome variable. Stepwise methods have been available for a long time, but as with many other possible strategies, there is a lot of criticism of their use. Investigations of the stability of a selected model are often called for, but usually are not carried out in a systematic way. Since analytical approaches are extremely difficult, data-dependent methods might be an useful alternative. Based on a bootstrap resampling procedure, Chen and George investigated the stability of a stepwise selection procedure in the framework of the Cox proportional hazard regression model. We extend their proposal and develop a bootstrap-model selection procedure, combining the bootstrap method with existing selection techniques such as stepwise methods. We illustrate the proposed strategy in the process of model building by using data from two cancer clinical trials featuring two different situations commonly arising in clinical research. In a brain tumour study the adjustment for covariates in an overall treatment comparison is of primary interest calling for the selection of even 'mild' effects. In a prostate cancer study we concentrate on the analysis of treatment-covariate interactions demanding that only 'strong' effects should be selected. Both variants of the strategy will be demonstrated analysing the clinical trials with a Cox model, but they can be applied in other types of regression with obvious and straightforward modifications.
临床研究统计分析中的一个常见问题是在回归模型框架内选择那些可能影响结果变量的变量。逐步方法已经存在很长时间了,但与许多其他可能的策略一样,对其使用存在很多批评。通常需要对所选模型的稳定性进行研究,但通常并未以系统的方式进行。由于分析方法极其困难,依赖数据的方法可能是一种有用的替代方法。基于自助重抽样程序,陈和乔治在Cox比例风险回归模型框架内研究了逐步选择程序的稳定性。我们扩展了他们的提议,并开发了一种自助模型选择程序,将自助方法与逐步方法等现有选择技术相结合。我们通过使用来自两项癌症临床试验的数据来说明所提出的策略在模型构建过程中的应用,这两项试验呈现了临床研究中常见的两种不同情况。在一项脑肿瘤研究中,在总体治疗比较中对协变量进行调整是主要关注点,这就需要选择即使是“轻微”的效应。在一项前列腺癌研究中,我们专注于治疗 - 协变量相互作用的分析,要求只选择“强”效应。该策略的两种变体都将通过使用Cox模型分析临床试验来展示,但经过明显且直接的修改后,它们也可应用于其他类型的回归分析。