微阵列数据分析中组方差不相等。
Unequal group variances in microarray data analyses.
作者信息
Demissie Meaza, Mascialino Barbara, Calza Stefano, Pawitan Yudi
机构信息
Department of Statistics, University of Orebro, Sweden.
出版信息
Bioinformatics. 2008 May 1;24(9):1168-74. doi: 10.1093/bioinformatics/btn100. Epub 2008 Mar 14.
MOTIVATION
In searching for differentially expressed (DE) genes in microarray data, we often observe a fraction of the genes to have unequal variability between groups. This is not an issue in large samples, where a valid test exists that uses individual variances separately. The problem arises in the small-sample setting, where the approximately valid Welch test lacks sensitivity, while the more sensitive moderated t-test assumes equal variance.
METHODS
We introduce a moderated Welch test (MWT) that allows unequal variance between groups. It is based on (i) weighting of pooled and unpooled standard errors and (ii) improved estimation of the gene-level variance that exploits the information from across the genes.
RESULTS
When a non-trivial proportion of genes has unequal variability, false discovery rate (FDR) estimates based on the standard t and moderated t-tests are often too optimistic, while the standard Welch test has low sensitivity. The MWT is shown to (i) perform better than the standard t, the standard Welch and the moderated t-tests when the variances are unequal between groups and (ii) perform similarly to the moderated t, and better than the standard t and Welch tests when the group variances are equal. These results mean that MWT is more reliable than other existing tests over wider range of data conditions.
AVAILABILITY
R package to perform MWT is available at http://www.meb.ki.se/~yudpaw
动机
在寻找微阵列数据中差异表达(DE)的基因时,我们经常观察到一部分基因在组间具有不等的变异性。在大样本中这不是问题,因为存在一种有效的检验方法,可分别使用个体方差。问题出现在小样本情况下,近似有效的韦尔奇检验缺乏敏感性,而更敏感的适度t检验假定方差相等。
方法
我们引入了一种允许组间方差不等的适度韦尔奇检验(MWT)。它基于(i)合并和未合并标准误的加权,以及(ii)利用来自所有基因的信息对基因水平方差进行改进估计。
结果
当相当比例的基因具有不等变异性时,基于标准t检验和适度t检验的错误发现率(FDR)估计往往过于乐观,而标准韦尔奇检验的敏感性较低。结果表明,(i)当组间方差不等时,MWT的性能优于标准t检验、标准韦尔奇检验和适度t检验;(ii)当组方差相等时,MWT的性能与适度t检验相似,且优于标准t检验和韦尔奇检验。这些结果意味着在更广泛的数据条件范围内,MWT比其他现有检验更可靠。
可用性
执行MWT的R包可在http://www.meb.ki.se/~yudpaw获取