Roeder Kathryn, Devlin B, Wasserman Larry
Department of Statistics, Carnegie Mellon University, Pittsburgh, PA 15213, USA.
Genet Epidemiol. 2007 Nov;31(7):741-7. doi: 10.1002/gepi.20237.
The potential of genome-wide association analysis can only be realized when they have power to detect signals despite the detrimental effect of multiple testing on power. We develop a weighted multiple testing procedure that facilitates the input of prior information in the form of groupings of tests. For each group a weight is estimated from the observed test statistics within the group. Differentially weighting groups improves the power to detect signals in likely groupings. The advantage of the grouped-weighting concept, over fixed weights based on prior information, is that it often leads to an increase in power even if many of the groupings are not correlated with the signal. Being data dependent, the procedure is remarkably robust to poor choices in groupings. Power is typically improved if one (or more) of the groups clusters multiple tests with signals, yet little power is lost when the groupings are totally random. If there is no apparent signal in a group, relative to a group that appears to have several tests with signals, the former group will be down-weighted relative to the latter. If no groups show apparent signals, then the weights will be approximately equal. The only restriction on the procedure is that the number of groups be small, relative to the total number of tests performed.
只有当全基因组关联分析有能力检测信号时,其潜力才能得以实现,尽管多重检验对检验效能有不利影响。我们开发了一种加权多重检验程序,该程序便于以检验分组的形式输入先验信息。对于每个组,根据组内观察到的检验统计量估计一个权重。对不同组进行差异化加权可提高在可能的分组中检测信号的效能。与基于先验信息的固定权重相比,分组加权概念的优势在于,即使许多分组与信号不相关,它通常也会导致效能提高。由于该程序依赖数据,因此对分组中的错误选择具有很强的鲁棒性。如果其中一个(或多个)组将多个有信号的检验聚集在一起,通常会提高效能,而当分组完全随机时,效能损失很小。如果一个组中没有明显信号,相对于一个似乎有多个有信号检验的组,前一组相对于后一组将被赋予较低权重。如果没有组显示明显信号,那么权重将大致相等。该程序的唯一限制是,相对于所执行检验的总数,组的数量要少。