Division of HIV/AIDS Prevention, National Center for HIV, Viral Hepatitis, STD and TB Prevention, Centers for Disease Control and Prevention, Atlanta, GA 30333, USA.
Stat Med. 2010 Jul 10;29(15):1647-59. doi: 10.1002/sim.3943.
Some research studies in the medical literature use multiple stepwise variable selection (SVS) algorithms to build multivariable models. The purpose of this study is to determine whether the use of multiple SVS algorithms in tandem (stepwise agreement) is a valid variable selection procedure. Computer simulations were developed to address stepwise agreement. Three popular SVS algorithms were tested (backward elimination, forward selection, and stepwise) on three statistical methods (linear, logistic, and Cox proportional hazards regression). Other simulation parameters explored were the sample size, number of predictors considered, degree of correlation between pairs of predictors, p-value-based entrance and exit criteria, predictor type (normally distributed or binary), and differences between stepwise agreement between any two or all three algorithms. Among stepwise methods, the rate of agreement, agreement on a model including only those predictors truly associated with the outcome, and agreement on a model containing the predictors truly associated with the outcome were measured. These rates were dependent on all simulation parameters. Mostly, the SVS algorithms agreed on a final model, but rarely on a model with only the true predictors. Sample size and candidate predictor pool size are the most influential simulation conditions. To conclude, stepwise agreement is often a poor strategy that gives misleading results and researchers should avoid using multiple SVS algorithms to build multivariable models. More research on the relationship between sample size and variable selection is needed.
一些医学文献中的研究使用多种逐步变量选择(SVS)算法来构建多变量模型。本研究旨在确定在串联(逐步一致)中使用多种 SVS 算法是否是一种有效的变量选择程序。开发了计算机模拟来解决逐步一致的问题。在三种统计方法(线性、逻辑和 Cox 比例风险回归)上测试了三种流行的 SVS 算法(逐步向后消除、逐步向前选择和逐步)。探索的其他模拟参数包括样本量、考虑的预测器数量、预测器之间相关性的程度、基于 p 值的进入和退出标准、预测器类型(正态分布或二进制)以及任何两个或所有三个算法之间的逐步一致性差异。在逐步方法中,衡量了一致率、仅包括与结果真正相关的预测器的模型的一致性以及包含与结果真正相关的预测器的模型的一致性。这些比率取决于所有模拟参数。大多数情况下,SVS 算法在最终模型上达成一致,但很少在仅包含真正预测器的模型上达成一致。样本量和候选预测器池大小是最具影响力的模拟条件。总之,逐步一致往往是一种糟糕的策略,会产生误导性的结果,研究人员应该避免使用多种 SVS 算法来构建多变量模型。需要更多关于样本量和变量选择之间关系的研究。