Parkhomenko Elena, Tritchler David, Lemire Mathieu, Hu Pingzhao, Beyene Joseph
Biostatistics Methodology Unit, Research Institute, Hospital for Sick Children, 555 University Avenue, Toronto, Ontario M5G 1X8, Canada.
BMC Proc. 2009 Dec 15;3 Suppl 7(Suppl 7):S40. doi: 10.1186/1753-6561-3-s7-s40.
In high-dimensional studies such as genome-wide association studies, the correction for multiple testing in order to control total type I error results in decreased power to detect modest effects. We present a new analytical approach based on the higher criticism statistic that allows identification of the presence of modest effects. We apply our method to the genome-wide study of rheumatoid arthritis provided in the Genetic Analysis Workshop 16 Problem 1 data set. There is evidence for unknown bias in this study that could be explained by the presence of undetected modest effects. We compared the asymptotic and empirical thresholds for the higher criticism statistic. Using the asymptotic threshold we detected the presence of modest effects genome-wide. We also detected modest effects using 90th percentile of the empirical null distribution as a threshold; however, there is no such evidence when the 95th and 99th percentiles were used. While the higher criticism method suggests that there is some evidence for modest effects, interpreting individual single-nucleotide polymorphisms with significant higher criticism statistics is of undermined value. The goal of higher criticism is to alert the researcher that genetic effects remain to be discovered and to promote the use of more targeted and powerful studies to detect the remaining effects.
在诸如全基因组关联研究等高维研究中,为了控制总的I型错误而进行的多重检验校正会导致检测中等效应的效能降低。我们提出了一种基于更高批评统计量的新分析方法,该方法能够识别中等效应的存在。我们将我们的方法应用于遗传分析研讨会16问题1数据集中提供的类风湿性关节炎全基因组研究。该研究中有证据表明存在未知偏差,这可能是由未检测到的中等效应的存在所解释的。我们比较了更高批评统计量的渐近阈值和经验阈值。使用渐近阈值,我们在全基因组范围内检测到了中等效应的存在。我们还使用经验零分布的第90百分位数作为阈值检测到了中等效应;然而,当使用第95和第99百分位数时,没有此类证据。虽然更高批评方法表明有一些证据支持中等效应,但用具有显著更高批评统计量来解释单个单核苷酸多态性的价值有限。更高批评的目标是提醒研究人员仍有待发现遗传效应,并促进使用更具针对性和更强大的研究来检测剩余效应。