Kim Sinae, Dahl David B, Vannucci Marina
Department of Biostatistics, University of Michigan, Ann Arbor, MI,
Bayesian Anal. 2009;4(4):707-732. doi: 10.1214/09-BA426.
We propose a Bayesian method for multiple hypothesis testing in random effects models that uses Dirichlet process (DP) priors for a nonparametric treatment of the random effects distribution. We consider a general model formulation which accommodates a variety of multiple treatment conditions. A key feature of our method is the use of a product of spiked distributions, i.e., mixtures of a point-mass and continuous distributions, as the centering distribution for the DP prior. Adopting these spiked centering priors readily accommodates sharp null hypotheses and allows for the estimation of the posterior probabilities of such hypotheses. Dirichlet process mixture models naturally borrow information across objects through model-based clustering while inference on single hypotheses averages over clustering uncertainty. We demonstrate via a simulation study that our method yields increased sensitivity in multiple hypothesis testing and produces a lower proportion of false discoveries than other competitive methods. While our modeling framework is general, here we present an application in the context of gene expression from microarray experiments. In our application, the modeling framework allows simultaneous inference on the parameters governing differential expression and inference on the clustering of genes. We use experimental data on the transcriptional response to oxidative stress in mouse heart muscle and compare the results from our procedure with existing nonparametric Bayesian methods that provide only a ranking of the genes by their evidence for differential expression.
我们提出了一种用于随机效应模型中多重假设检验的贝叶斯方法,该方法使用狄利克雷过程(DP)先验对随机效应分布进行非参数处理。我们考虑一种通用的模型公式,它适用于各种多重处理条件。我们方法的一个关键特征是使用尖峰分布的乘积,即点质量分布和连续分布的混合,作为DP先验的中心分布。采用这些尖峰中心先验很容易适应尖锐的原假设,并允许估计此类假设的后验概率。狄利克雷过程混合模型通过基于模型的聚类自然地在对象之间借用信息,而对单个假设的推断则对聚类不确定性进行平均。我们通过模拟研究表明,与其他竞争方法相比,我们的方法在多重假设检验中具有更高的灵敏度,并且产生的错误发现比例更低。虽然我们的建模框架是通用的,但在此我们展示了在微阵列实验的基因表达背景下的一个应用。在我们的应用中,建模框架允许对控制差异表达的参数进行同时推断以及对基因聚类进行推断。我们使用小鼠心肌氧化应激转录反应的实验数据,并将我们程序的结果与现有的非参数贝叶斯方法进行比较,后者仅根据基因差异表达的证据对基因进行排名。