Kundu Suprateek, Mallick Bani K, Baladandayuthapan Veera
Department of Biostatistics & Bioinformatics, Emory University, 1518 Clifton Road, Atlanta, Georgia 30322, U.S.A.
Department of Statistics, Texas A&M University, 3143 TAMU, College Station, Texas 77843-3143, U.S.A.
Bayesian Anal. 2019 Jun;14(2):449-476. doi: 10.1214/17-ba1086. Epub 2018 Jul 11.
There has been an intense development in the Bayesian graphical model literature over the past decade; however, most of the existing methods are restricted to moderate dimensions. We propose a novel graphical model selection approach for large dimensional settings where the dimension increases with the sample size, by decoupling model fitting and covariance selection. First, a full model based on a complete graph is fit under a novel class of mixtures of inverse-Wishart priors, which induce shrinkage on the precision matrix under an equivalence with Cholesky-based regularization, while enabling conjugate updates. Subsequently, a post-fitting model selection step uses penalized joint credible regions to perform model selection. This allows our methods to be computationally feasible for large dimensional settings using a combination of straightforward Gibbs samplers and efficient post-fitting inferences. Theoretical guarantees in terms of selection consistency are also established. Simulations show that the proposed approach compares favorably with competing methods, both in terms of accuracy metrics and computation times. We apply this approach to a cancer genomics data example.
在过去十年中,贝叶斯图形模型文献有了深入发展;然而,现有的大多数方法都局限于中等维度。我们提出了一种新颖的图形模型选择方法,用于高维情形,即维度随样本量增加的情况,通过将模型拟合与协方差选择解耦来实现。首先,基于一个完全图的全模型在一类新颖的逆 Wishart 先验混合下进行拟合,这种先验在与基于 Cholesky 的正则化等价的情况下,会在精度矩阵上产生收缩,同时允许共轭更新。随后,一个拟合后模型选择步骤使用惩罚联合可信区域来进行模型选择。这使得我们的方法通过结合简单的 Gibbs 采样器和高效的拟合后推断,在高维情形下计算上可行。还建立了关于选择一致性的理论保证。模拟表明所提出的方法在准确性指标和计算时间方面都优于竞争方法。我们将此方法应用于一个癌症基因组学数据示例。