Pounds Stan, Rai Shesh N
Department of Biostatistics, St. Jude Children's Research Hospital, 332 N. Lauderdale St., Memphis, TN, 38105, USA.
Comput Stat Data Anal. 2009 Mar 15;53(5):1604-1612. doi: 10.1016/j.csda.2008.05.010.
The concept of assumption adequacy averaging is introduced as a technique to develop more robust methods that incorporate assessments of assumption adequacy into the analysis. The concept is illustrated by using it to develop a method that averages results from the t-test and nonparametric rank-sum test with weights obtained from using the Shapiro-Wilk test to test the assumption of normality. Through this averaging process, the proposed method is able to rely more heavily on the statistical test that the data suggests is superior for each individual gene. Subsequently, this method developed by assumption adequacy averaging outperforms its two component methods (the t-test and rank-sum test) in a series of traditional and bootstrap-based simulation studies. The proposed method showed greater concordance in gene selection across two studies of gene expression in acute myeloid leukemia than did the t-test or rank-sum test. An R routine to implement the method is available upon request.
引入了假设充分性平均的概念,作为一种开发更稳健方法的技术,该方法将假设充分性评估纳入分析中。通过使用夏皮罗-威尔克检验来检验正态性假设所获得的权重,对t检验和非参数秩和检验的结果进行平均,以此来说明这一概念。通过这种平均过程,所提出的方法能够更依赖于数据表明对每个个体基因更优的统计检验。随后,在一系列传统的和基于自助法的模拟研究中,通过假设充分性平均开发的这种方法优于其两个组成方法(t检验和秩和检验)。在两项急性髓系白血病基因表达研究中,所提出的方法在基因选择上比t检验或秩和检验表现出更高的一致性。如有需要,可提供实现该方法的R程序。