Peña Edsel A, Habiger Joshua D, Wu Wensong
University of South Carolina, Columbia, Oklahoma State University and University of South Carolina, Columbia.
Ann Stat. 2011 Feb;39(1):556-583. doi: 10.1214/10-aos844.
Improved procedures, in terms of smaller missed discovery rates (MDR), for performing multiple hypotheses testing with weak and strong control of the family-wise error rate (FWER) or the false discovery rate (FDR) are developed and studied. The improvement over existing procedures such as the Šidák procedure for FWER control and the Benjamini-Hochberg (BH) procedure for FDR control is achieved by exploiting possible differences in the powers of the individual tests. Results signal the need to take into account the powers of the individual tests and to have multiple hypotheses decision functions which are not limited to simply using the individual -values, as is the case, for example, with the Šidák, Bonferroni, or BH procedures. They also enhance understanding of the role of the powers of individual tests, or more precisely the receiver operating characteristic (ROC) functions of decision processes, in the search for better multiple hypotheses testing procedures. A decision-theoretic framework is utilized, and through auxiliary randomizers the procedures could be used with discrete or mixed-type data or with rank-based nonparametric tests. This is in contrast to existing -value based procedures whose theoretical validity is contingent on each of these -value statistics being stochastically equal to or greater than a standard uniform variable under the null hypothesis. Proposed procedures are relevant in the analysis of high-dimensional "large , small " data sets arising in the natural, physical, medical, economic and social sciences, whose generation and creation is accelerated by advances in high-throughput technology, notably, but not limited to, microarray technology.
开发并研究了在控制族系错误率(FWER)或错误发现率(FDR)方面具有更强控制能力且漏检率(MDR)更小的改进程序,用于进行多重假设检验。通过利用各个检验功效的可能差异,相对于现有程序(如用于控制FWER的Šidák程序和用于控制FDR的Benjamini-Hochberg(BH)程序)实现了改进。结果表明,需要考虑各个检验的功效,并拥有不限于简单使用各个p值的多重假设决策函数,例如Šidák、Bonferroni或BH程序就是如此。它们还增进了对各个检验功效(或更准确地说是决策过程的接收者操作特征(ROC)函数)在寻找更好的多重假设检验程序中的作用的理解。利用了决策理论框架,并且通过辅助随机化器,这些程序可用于离散或混合型数据或基于秩的非参数检验。这与现有的基于p值的程序形成对比,后者的理论有效性取决于在原假设下每个p值统计量随机等于或大于标准均匀变量。所提出的程序在分析自然科学、物理科学、医学、经济学和社会科学中出现的高维“大p,小n”数据集时具有相关性,这些数据集的生成和创建因高通量技术(特别是但不限于微阵列技术)的进步而加速。