Suckling John, Ohlssen David, Andrew Christopher, Johnson Glyn, Williams Steven C R, Graves Martin, Chen Chi-Hua, Spiegelhalter David, Bullmore Ed
Brain Mapping Unit, Department of Psychiatry, University of Cambridge, Cambridge, United Kingdom.
Hum Brain Mapp. 2008 Oct;29(10):1111-22. doi: 10.1002/hbm.20451.
This article firstly presents a theoretical analysis of the statistical power of a parallel-group, repeated-measures (two-session) and two-centre design suitable for a placebo-controlled pharmacological MRI study. For arbitrary effect size, power is determined by the pooled between-session error, the pooled measurement error, the ratio of centre measurement errors, the total number of subjects and the proportion of subjects studied at the centre with greatest measurement error. Secondly, an experiment is described to obtain empirical estimates of variance components in task-related and resting state functional magnetic resonance imaging. Twelve healthy volunteers were scanned at two centres during performance of blocked and event-related versions of an affect processing task (each repeated twice per session) and rest. In activated regions, variance components were estimated: between-subject (23% of total), between-centre (2%), between-paradigm (4%), within-session occasion (paradigm repeat; 2%) and residual (measurement) error (69%). The between-centre ratio of measurement errors was 0.8. A similar analysis for the Hurst exponent estimated in resting data showed negligible contributions of between-subject and between-centre variability; measurement error accounted for 99% of total variance. Substituting these estimates in the theoretical expression for power, incorporation of two centres in the design necessitates a modest (10%) increase in the total number of subjects compared with a single-centre study. Furthermore, considerable improvements in power can be attained by repetition of the task within each scanning session. Thus, theoretical models of power and empirical data indicate that between-centre variability can be small enough to encourage multicentre designs without major compensatory increases in sample size.
本文首先对适用于安慰剂对照药理磁共振成像研究的平行组、重复测量(两阶段)和双中心设计的统计功效进行了理论分析。对于任意效应大小,功效由合并的组间误差、合并的测量误差、中心测量误差之比、受试者总数以及在测量误差最大的中心所研究的受试者比例决定。其次,描述了一项实验,以获取任务相关和静息态功能磁共振成像中方差成分的经验估计值。12名健康志愿者在两个中心接受扫描,期间进行了情感加工任务的组块式和事件相关版本(每个版本在每个阶段重复两次)以及静息状态。在激活区域,估计了方差成分:受试者间(占总方差的23%)、中心间(2%)、范式间(4%)、阶段内时机(范式重复;2%)和残差(测量)误差(69%)。中心间测量误差之比为0.8。对静息数据中估计的赫斯特指数进行的类似分析表明,受试者间和中心间变异性的贡献可忽略不计;测量误差占总方差的99%。将这些估计值代入功效的理论表达式中,与单中心研究相比,设计中纳入两个中心需要受试者总数适度增加(10%)。此外,通过在每个扫描阶段内重复任务,可以显著提高功效。因此,功效的理论模型和经验数据表明,中心间变异性可以小到足以鼓励多中心设计,而无需大幅增加样本量作为补偿。