Stephan Klaas Enno, Penny Will D, Daunizeau Jean, Moran Rosalyn J, Friston Karl J
Wellcome Trust Centre for Neuroimaging, Institute of Neurology, University College London, London, UK.
Neuroimage. 2009 Jul 15;46(4):1004-17. doi: 10.1016/j.neuroimage.2009.03.025. Epub 2009 Mar 20.
Bayesian model selection (BMS) is a powerful method for determining the most likely among a set of competing hypotheses about the mechanisms that generated observed data. BMS has recently found widespread application in neuroimaging, particularly in the context of dynamic causal modelling (DCM). However, so far, combining BMS results from several subjects has relied on simple (fixed effects) metrics, e.g. the group Bayes factor (GBF), that do not account for group heterogeneity or outliers. In this paper, we compare the GBF with two random effects methods for BMS at the between-subject or group level. These methods provide inference on model-space using a classical and Bayesian perspective respectively. First, a classical (frequentist) approach uses the log model evidence as a subject-specific summary statistic. This enables one to use analysis of variance to test for differences in log-evidences over models, relative to inter-subject differences. We then consider the same problem in Bayesian terms and describe a novel hierarchical model, which is optimised to furnish a probability density on the models themselves. This new variational Bayes method rests on treating the model as a random variable and estimating the parameters of a Dirichlet distribution which describes the probabilities for all models considered. These probabilities then define a multinomial distribution over model space, allowing one to compute how likely it is that a specific model generated the data of a randomly chosen subject as well as the exceedance probability of one model being more likely than any other model. Using empirical and synthetic data, we show that optimising a conditional density of the model probabilities, given the log-evidences for each model over subjects, is more informative and appropriate than both the GBF and frequentist tests of the log-evidences. In particular, we found that the hierarchical Bayesian approach is considerably more robust than either of the other approaches in the presence of outliers. We expect that this new random effects method will prove useful for a wide range of group studies, not only in the context of DCM, but also for other modelling endeavours, e.g. comparing different source reconstruction methods for EEG/MEG or selecting among competing computational models of learning and decision-making.
贝叶斯模型选择(BMS)是一种强大的方法,用于在一组关于生成观测数据机制的相互竞争的假设中确定最有可能的假设。BMS最近在神经成像中得到了广泛应用,特别是在动态因果模型(DCM)的背景下。然而,到目前为止,合并多个受试者的BMS结果依赖于简单的(固定效应)指标,例如组贝叶斯因子(GBF),这些指标没有考虑组内异质性或异常值。在本文中,我们在受试者间或组水平上比较了GBF与两种用于BMS的随机效应方法。这些方法分别从经典和贝叶斯的角度对模型空间进行推断。首先,一种经典的(频率主义)方法使用对数模型证据作为受试者特定的汇总统计量。这使得人们能够使用方差分析来检验模型间对数证据的差异,相对于受试者间的差异。然后,我们从贝叶斯的角度考虑同样的问题,并描述了一种新颖分层模型,该模型经过优化以提供模型本身的概率密度。这种新的变分贝叶斯方法基于将模型视为随机变量,并估计描述所考虑的所有模型概率的狄利克雷分布的参数。这些概率随后在模型空间上定义了一个多项分布,使人们能够计算特定模型生成随机选择受试者数据的可能性,以及一个模型比任何其他模型更有可能的超越概率。使用实证数据和合成数据,我们表明,在给定每个模型针对受试者的对数证据的情况下,优化模型概率的条件密度比GBF和对数证据的频率主义检验更具信息性和适用性。特别是,我们发现分层贝叶斯方法在存在异常值的情况下比其他任何一种方法都要稳健得多。我们预计这种新的随机效应方法将被证明对广泛的组研究有用,不仅在DCM的背景下,而且对于其他建模工作,例如比较脑电图/脑磁图的不同源重建方法或在学习和决策的竞争计算模型中进行选择。