University of Wisconsin-Madison, Madison, USA.
Psychometrika. 2021 Mar;86(1):215-238. doi: 10.1007/s11336-021-09754-5. Epub 2021 Mar 15.
Issues of model selection have dominated the theoretical and applied statistical literature for decades. Model selection methods such as ridge regression, the lasso, and the elastic net have replaced ad hoc methods such as stepwise regression as a means of model selection. In the end, however, these methods lead to a single final model that is often taken to be the model considered ahead of time, thus ignoring the uncertainty inherent in the search for a final model. One method that has enjoyed a long history of theoretical developments and substantive applications, and that accounts directly for uncertainty in model selection, is Bayesian model averaging (BMA). BMA addresses the problem of model selection by not selecting a final model, but rather by averaging over a space of possible models that could have generated the data. The purpose of this paper is to provide a detailed and up-to-date review of BMA with a focus on its foundations in Bayesian decision theory and Bayesian predictive modeling. We consider the selection of parameter and model priors as well as methods for evaluating predictions based on BMA. We also consider important assumptions regarding BMA and extensions of model averaging methods to address these assumptions, particularly the method of Bayesian stacking. Simple empirical examples are provided and directions for future research relevant to psychometrics are discussed.
几十年来,模型选择问题一直主导着理论和应用统计学文献。岭回归、套索和弹性网络等模型选择方法已经取代了逐步回归等特定方法,成为模型选择的一种手段。然而,最终这些方法会导致一个单一的最终模型,这个模型通常被视为事先考虑的模型,从而忽略了在寻找最终模型时固有的不确定性。一种方法在理论发展和实质性应用方面有着悠久的历史,并且直接考虑了模型选择中的不确定性,这就是贝叶斯模型平均(BMA)。BMA 通过不选择最终模型,而是对可能生成数据的可能模型空间进行平均,来解决模型选择问题。本文的目的是提供一个详细和最新的 BMA 综述,重点是其在贝叶斯决策理论和贝叶斯预测建模中的基础。我们考虑了参数和模型先验的选择以及基于 BMA 的预测评估方法。我们还考虑了 BMA 的重要假设以及扩展模型平均方法以解决这些假设的方法,特别是贝叶斯堆叠方法。提供了简单的实证示例,并讨论了与心理测量学相关的未来研究方向。