Cai T Tony, Liu Weidong
Department of Statistics, The Wharton School, University of Pennsylvania, Philadelphia, PA 19104 (
Department of Mathematics, Institute of Natural Sciences and MOE-LSC, Shanghai Jiao Tong University, Shanghai, China (
J Am Stat Assoc. 2016;111(513):229-240. doi: 10.1080/01621459.2014.999157. Epub 2016 May 5.
Multiple testing of correlations arises in many applications including gene coexpression network analysis and brain connectivity analysis. In this paper, we consider large scale simultaneous testing for correlations in both the one-sample and two-sample settings. New multiple testing procedures are proposed and a bootstrap method is introduced for estimating the proportion of the nulls falsely rejected among all the true nulls. The properties of the proposed procedures are investigated both theoretically and numerically. It is shown that the procedures asymptotically control the overall false discovery rate and false discovery proportion at the nominal level. Simulation results show that the methods perform well numerically in terms of both the size and power of the test and it significantly outperforms two alternative methods. The two-sample procedure is also illustrated by an analysis of a prostate cancer dataset for the detection of changes in coexpression patterns between gene expression levels.
相关性的多重检验出现在许多应用中,包括基因共表达网络分析和脑连接性分析。在本文中,我们考虑在单样本和两样本设置下对相关性进行大规模同时检验。提出了新的多重检验程序,并引入了一种自助法来估计在所有真零假设中被错误拒绝的零假设的比例。从理论和数值两方面研究了所提出程序的性质。结果表明,这些程序在名义水平上渐近地控制了总体错误发现率和错误发现比例。模拟结果表明,这些方法在检验的大小和功效方面在数值上表现良好,并且显著优于两种替代方法。通过对一个前列腺癌数据集进行分析,以检测基因表达水平之间共表达模式的变化,对两样本程序进行了说明。