Department of Psychology, National University of Singapore.
Department of Psychology, Sogang University.
Multivariate Behav Res. 2021 May-Jun;56(3):426-446. doi: 10.1080/00273171.2019.1694477. Epub 2019 Nov 28.
Extended Redundancy Analysis is a statistical tool for exploring the directional relationships of multiple sets of exogenous variables on a set of endogenous variables. This approach posits that the endogenous and exogenous variables are related via latent components, each of which is extracted from a set of exogenous variables, that account for the maximum variation of the endogenous variables. However, it is often difficult to distinguish between the true variables that form the latent components and the false variables that do not, especially when the association between the true variables and the exogenous set is weak. To overcome this limitation, we propose a Sparse Extended Redundancy Analysis via the Exclusive LASSO that performs variable selection while maintaining model specification. We validate the performance of the proposed approach in a simulation study. Finally, the empirical utility of this approach is demonstrated through two examples-one on a study of youth academic achievement and the other on a text analysis of newspaper data.
扩展冗余分析是一种统计工具,用于探索多组外生变量对一组内生变量的方向关系。这种方法假设内生变量和外生变量是通过潜在成分相关的,每个成分都是从一组外生变量中提取出来的,能够解释内生变量的最大变化。然而,通常很难区分形成潜在成分的真实变量和不形成潜在成分的虚假变量,尤其是当真实变量与外生变量之间的关联较弱时。为了克服这一限制,我们提出了一种通过独占 LASSO 进行稀疏扩展冗余分析的方法,该方法在保持模型规范的同时进行变量选择。我们在模拟研究中验证了所提出方法的性能。最后,通过两个实例(一个是关于青年学术成就的研究,另一个是关于报纸数据的文本分析)展示了这种方法的实际应用。