Methodology of Educational Sciences Research Group, Faculty of Psychology and Educational Sciences, KU Leuven, Andreas Vesaliusstraat 2, Box 3762, 3000, Leuven, Belgium.
Behav Res Methods. 2013 Mar;45(1):1-15. doi: 10.3758/s13428-012-0238-5.
When analyzing data, researchers are often confronted with a model selection problem (e.g., determining the number of components/factors in principal components analysis [PCA]/factor analysis or identifying the most important predictors in a regression analysis). To tackle such a problem, researchers may apply some objective procedure, like parallel analysis in PCA/factor analysis or stepwise selection methods in regression analysis. A drawback of these procedures is that they can only be applied to the model selection problem at hand. An interesting alternative is the CHull model selection procedure, which was originally developed for multiway analysis (e.g., multimode partitioning). However, the key idea behind the CHull procedure--identifying a model that optimally balances model goodness of fit/misfit and model complexity--is quite generic. Therefore, the procedure may also be used when applying many other analysis techniques. The aim of this article is twofold. First, we demonstrate the wide applicability of the CHull method by showing how it can be used to solve various model selection problems in the context of PCA, reduced K-means, best-subset regression, and partial least squares regression. Moreover, a comparison of CHull with standard model selection methods for these problems is performed. Second, we present the CHULL software, which may be downloaded from http://ppw.kuleuven.be/okp/software/CHULL/, to assist the user in applying the CHull procedure.
在分析数据时,研究人员经常面临模型选择问题(例如,确定主成分分析 [PCA]/因子分析中的组件/因子数量或确定回归分析中最重要的预测因子)。为了解决此类问题,研究人员可能会应用一些客观的程序,如 PCA/因子分析中的平行分析或回归分析中的逐步选择方法。这些程序的一个缺点是它们只能应用于当前的模型选择问题。一个有趣的替代方案是 CHull 模型选择程序,它最初是为多向分析(例如,多模式分区)开发的。然而,CHull 程序背后的关键思想——确定一个在模型拟合/不拟合和模型复杂度之间最佳平衡的模型——是非常通用的。因此,当应用许多其他分析技术时,也可以使用该程序。本文的目的有两个。首先,我们通过展示如何在 PCA、简化 K-均值、最佳子集回归和偏最小二乘回归的上下文中使用 CHull 方法来解决各种模型选择问题,展示了 CHull 方法的广泛适用性。此外,还对这些问题的 CHull 与标准模型选择方法进行了比较。其次,我们介绍了 CHULL 软件,该软件可从 http://ppw.kuleuven.be/okp/software/CHULL/ 下载,以帮助用户应用 CHull 程序。