Thas Olivier, Clement Lieven, Rayner John C W, Carvalho Beatriz, Van Criekinge Wim
Department of Mathematical Modeling, Statistics and Bioinformatics, Ghent University, Coupure Links 653, B-9000 Gent, Belgium.
Biometrics. 2012 Jun;68(2):446-54. doi: 10.1111/j.1541-0420.2012.01750.x. Epub 2012 Apr 16.
We present an adaptive percentile modified Wilcoxon rank sum test for the two-sample problem. The test is basically a Wilcoxon rank sum test applied on a fraction of the sample observations, and the fraction is adaptively determined by the sample observations. Most of the theory is developed under a location-shift model, but we demonstrate that the test is also meaningful for testing against more general alternatives. The test may be particularly useful for the analysis of massive datasets in which quasi-automatic hypothesis testing is required. We investigate the power characteristics of the new test in a simulation study, and we apply the test to a microarray experiment on colorectal cancer. These empirical studies demonstrate that the new test has good overall power and that it succeeds better in finding differentially expressed genes as compared to other popular tests. We conclude that the new nonparametric test is widely applicable and that its power is comparable to the power of the Baumgartner-Weiß-Schindler test.
我们提出了一种用于两样本问题的自适应百分位数修正威尔科克森秩和检验。该检验本质上是对一部分样本观测值应用威尔科克森秩和检验,且这一比例由样本观测值自适应确定。大部分理论是在位置偏移模型下发展起来的,但我们证明该检验对于针对更一般备择假设的检验也有意义。该检验对于需要准自动假设检验的海量数据集的分析可能特别有用。我们在一项模拟研究中考察了新检验的功效特征,并将该检验应用于一项关于结直肠癌的微阵列实验。这些实证研究表明,新检验具有良好的总体功效,并且与其他常用检验相比,在发现差异表达基因方面表现更优。我们得出结论,新的非参数检验具有广泛的适用性,其功效与鲍姆加特纳 - 魏斯 - 辛德勒检验的功效相当。