Southworth Lucinda K, Kim Stuart K, Owen Art B
Biomedical Informatics, Stanford University, Stanford, California 94305, USA.
J Comput Biol. 2009 Apr;16(4):625-38. doi: 10.1089/cmb.2008.0144.
This paper takes a close look at balanced permutations, a recently developed sample reuse method with applications in bioinformatics. It turns out that balanced permutation reference distributions do not have the correct null behavior, which can be traced to their lack of a group structure. We find that they can give p-values that are too permissive to varying degrees. In particular the observed test statistic can be larger than that of all B balanced permutations of a data set with a probability much higher than 1/(B + 1), even under the null hypothesis.
本文深入研究了平衡排列,这是一种最近开发的样本重用方法,在生物信息学中有应用。事实证明,平衡排列参考分布没有正确的零假设行为,这可以追溯到它们缺乏群结构。我们发现它们可能会给出不同程度上过于宽松的p值。特别是,即使在零假设下,观察到的检验统计量大于数据集的所有B个平衡排列的检验统计量的概率也远高于1/(B + 1)。