Suppr超能文献

通过对排列进行划分来快速近似排列检验中的小p值。

Fast approximation of small p-values in permutation tests by partitioning the permutations.

作者信息

Segal Brian D, Braun Thomas, Elliott Michael R, Jiang Hui

机构信息

Department of Biostatistics, University of Michigan, 1415 Washington Heights, Ann Arbor, Michigan 48109-2029, U.S.A.

出版信息

Biometrics. 2018 Mar;74(1):196-206. doi: 10.1111/biom.12731. Epub 2017 May 18.

Abstract

Researchers in genetics and other life sciences commonly use permutation tests to evaluate differences between groups. Permutation tests have desirable properties, including exactness if data are exchangeable, and are applicable even when the distribution of the test statistic is analytically intractable. However, permutation tests can be computationally intensive. We propose both an asymptotic approximation and a resampling algorithm for quickly estimating small permutation p-values (e.g., <10-6) for the difference and ratio of means in two-sample tests. Our methods are based on the distribution of test statistics within and across partitions of the permutations, which we define. In this article, we present our methods and demonstrate their use through simulations and an application to cancer genomic data. Through simulations, we find that our resampling algorithm is more computationally efficient than another leading alternative, particularly for extremely small p-values (e.g., <10-30). Through application to cancer genomic data, we find that our methods can successfully identify up- and down-regulated genes. While we focus on the difference and ratio of means, we speculate that our approaches may work in other settings.

摘要

遗传学和其他生命科学领域的研究人员通常使用排列检验来评估组间差异。排列检验具有理想的特性,包括在数据可交换时的精确性,并且即使在检验统计量的分布难以进行解析处理时也适用。然而,排列检验的计算量可能很大。我们提出了一种渐近近似方法和一种重采样算法,用于在双样本检验中快速估计均值差异和均值比的小排列p值(例如,<10-6)。我们的方法基于我们所定义的排列分区内和跨分区的检验统计量分布。在本文中,我们介绍了我们的方法,并通过模拟以及对癌症基因组数据的应用来展示它们的用途。通过模拟,我们发现我们的重采样算法在计算上比另一种主要的替代方法更高效,特别是对于极小的p值(例如,<10-30)。通过应用于癌症基因组数据,我们发现我们的方法能够成功识别上调和下调基因。虽然我们专注于均值差异和均值比,但我们推测我们的方法可能在其他情况下也有效。

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验