Department of Statistics, Inha University, Incheon, South Korea.
Department of Mathematical Sciences, University of Cincinnati, Cincinnati, Ohio.
Biometrics. 2021 Jun;77(2):391-400. doi: 10.1111/biom.13290. Epub 2020 May 16.
We consider Bayesian logistic regression models with group-structured covariates. In high-dimensional settings, it is often assumed that only a small portion of groups are significant, and thus, consistent group selection is of significant importance. While consistent frequentist group selection methods have been proposed, theoretical properties of Bayesian group selection methods for logistic regression models have not been investigated yet. In this paper, we consider a hierarchical group spike and slab prior for logistic regression models in high-dimensional settings. Under mild conditions, we establish strong group selection consistency of the induced posterior, which is the first theoretical result in the Bayesian literature. Through simulation studies, we demonstrate that the proposed method outperforms existing state-of-the-art methods in various settings. We further apply our method to a magnetic resonance imaging data set for predicting Parkinson's disease and show its benefits over other contenders.
我们考虑具有分组结构协变量的贝叶斯逻辑回归模型。在高维设置中,通常假设只有一小部分组是显著的,因此一致的分组选择具有重要意义。虽然已经提出了一致的频率派分组选择方法,但尚未研究贝叶斯逻辑回归模型分组选择方法的理论性质。在本文中,我们考虑了高维环境下逻辑回归模型的分层组尖峰和板条先验。在温和的条件下,我们建立了诱导后验的强分组选择一致性,这是贝叶斯文献中的第一个理论结果。通过模拟研究,我们证明了该方法在各种环境下都优于现有的最先进方法。我们进一步将我们的方法应用于磁共振成像数据集,以预测帕金森病,并展示其优于其他竞争者的优势。