Department of Statistics, The George Washington University , Washington, DC , USA.
Front Public Health. 2013 Dec 9;1:63. doi: 10.3389/fpubh.2013.00063. eCollection 2013.
We review and compare multiple hypothesis testing procedures used in clinical trials and those in genomic studies. Clinical trials often employ global tests, which draw an overall conclusion for all the hypotheses, such as SUM test, Two-Step test, Approximate Likelihood Ratio test (ALRT), Intersection-Union Test (IUT), and MAX test. The SUM and Two-Step tests are most powerful under homogeneous treatment effects, while the ALRT and MAX test are robust in cases with non-homogeneous treatment effects. Furthermore, the ALRT is robust to unequal sample sizes in testing different hypotheses. In genomic studies, stepwise procedures are used to draw marker-specific conclusions and control family wise error rate (FWER) or false discovery rate (FDR). FDR refers to the percent of false positives among all significant results and is preferred over FWER in screening high-dimensional genomic markers due to its interpretability. In cases where correlations between test statistics cannot be ignored, Westfall-Young resampling method generates the joint distribution of P-values under the null and maintains their correlation structure. Finally, the GWAS data from a clinical trial searching for SNPs associated with nephropathy among Type 1 diabetic patients are used to illustrate various procedures.
我们回顾和比较了临床试验和基因组研究中使用的多种假设检验程序。临床试验通常采用全局检验,对所有假设进行总体结论,如 SUM 检验、两步检验、近似似然比检验(ALRT)、交集并集检验(IUT)和 MAX 检验。SUM 和两步检验在同质处理效应下最有效,而 ALRT 和 MAX 检验在处理效应非同质的情况下具有稳健性。此外,ALRT 在检验不同假设时对不等样本量具有稳健性。在基因组研究中,逐步程序用于得出标记特异性结论,并控制家族错误率(FWER)或假发现率(FDR)。FDR 是指所有显著结果中假阳性的百分比,由于其可解释性,在筛选高维基因组标记时优于 FWER。在不能忽略检验统计量之间相关性的情况下,Westfall-Young 重抽样方法生成零假设下 P 值的联合分布,并保持它们的相关结构。最后,使用来自临床试验的 GWAS 数据,该试验旨在寻找与 1 型糖尿病患者肾病相关的 SNPs,来说明各种程序。