Andriamiarana Vivato V, Kilian Pascal, Brandt Holger, Kelava Augustin
Methods Center, Eberhard Karls University of Tübingen, Haußerstr. 11, 72076, Tübingen, Germany.
Behav Res Methods. 2025 Jan 22;57(2):71. doi: 10.3758/s13428-024-02589-9.
Due to the increased availability of intensive longitudinal data, researchers have been able to specify increasingly complex dynamic latent variable models. However, these models present challenges related to overfitting, hierarchical features, non-linearity, and sample size requirements. There are further limitations to be addressed regarding the finite sample performance of priors, including bias, accuracy, and type I error inflation. Bayesian estimation provides the flexibility to treat these issues simultaneously through the use of regularizing priors. In this paper, we aim to compare several Bayesian regularizing priors (ridge, Bayesian Lasso, adaptive spike-and-slab Lasso, and regularized horseshoe). To achieve this, we introduce a multilevel dynamic latent variable model. We then conduct two simulation studies and a prior sensitivity analysis using empirical data. The results show that the ridge prior is able to provide sparse estimation while avoiding overshrinkage of relevant signals, in comparison to other Bayesian regularization priors. In addition, we find that the Lasso and heavy-tailed regularizing priors do not perform well compared to light-tailed priors for the logistic model. In the context of multilevel dynamic latent variable modeling, it is often attractive to diversify the choice of priors. However, we instead suggest prioritizing the choice of ridge priors without extreme shrinkage, which we show can handle the trade-off between informativeness and generality, compared to other priors with high concentration around zero and/or heavy tails.
由于密集纵向数据的可得性增加,研究人员能够指定日益复杂的动态潜变量模型。然而,这些模型存在与过度拟合、层次特征、非线性和样本量要求相关的挑战。关于先验的有限样本性能,还存在需要解决的进一步限制,包括偏差、准确性和I型错误膨胀。贝叶斯估计提供了通过使用正则化先验同时处理这些问题的灵活性。在本文中,我们旨在比较几种贝叶斯正则化先验(岭、贝叶斯套索、自适应尖峰和平板套索以及正则化马蹄形)。为了实现这一目标,我们引入了一个多层次动态潜变量模型。然后,我们使用实证数据进行了两项模拟研究和一次先验敏感性分析。结果表明,与其他贝叶斯正则化先验相比,岭先验能够提供稀疏估计,同时避免相关信号的过度收缩。此外,我们发现对于逻辑模型,套索和重尾正则化先验与轻尾先验相比表现不佳。在多层次动态潜变量建模的背景下,多样化先验的选择通常很有吸引力。然而,我们建议优先选择没有极端收缩的岭先验,与其他在零附近具有高集中度和/或重尾的先验相比,我们表明它可以处理信息性和通用性之间的权衡。