MRC Lifecourse Epidemiology Unit, University of Southampton, Southampton, UK.
BMC Genomics. 2013 Mar 11;14:161. doi: 10.1186/1471-2164-14-161.
Statistical analysis of genome-wide microarrays can result in many thousands of identical statistical tests being performed as each probe is tested for an association with a phenotype of interest. If there were no association between any of the probes and the phenotype, the distribution of P values obtained from statistical tests would resemble a Uniform distribution. If a selection of probes were significantly associated with the phenotype we would expect to observe P values for these probes of less than the designated significance level, alpha, resulting in more P values of less than alpha than expected by chance.
In data from a whole genome methylation promoter array we unexpectedly observed P value distributions where there were fewer P values less than alpha than would be expected by chance. Our data suggest that a possible reason for this is a violation of the statistical assumptions required for these tests arising from heteroskedasticity. A simple but statistically sound remedy (a heteroskedasticity-consistent covariance matrix estimator to calculate standard errors of regression coefficients that are robust to heteroskedasticity) rectified this violation and resulted in meaningful P value distributions.
The statistical analysis of 'omics data requires careful handling, especially in the choice of statistical test. To obtain meaningful results it is essential that the assumptions behind these tests are carefully examined and any violations rectified where possible, or a more appropriate statistical test chosen.
对全基因组微阵列进行统计分析可能会导致对每个探针进行与感兴趣的表型关联的测试,从而进行数千次相同的统计测试。如果没有任何探针与表型之间存在关联,则从统计测试中获得的 P 值分布将类似于均匀分布。如果选择的一些探针与表型显著相关,我们预计会观察到这些探针的 P 值小于指定的显著性水平α,从而导致小于α的 P 值比预期的机会更多。
在全基因组甲基化启动子阵列的数据中,我们意外地观察到 P 值分布,其中小于α的 P 值比预期的机会要少。我们的数据表明,这种情况的一个可能原因是由于异方差性,这些测试所需的统计假设受到违反。一种简单但统计学上合理的补救方法(一种异方差一致协方差矩阵估计器,用于计算对异方差稳健的回归系数的标准误差)纠正了这种违反情况,并导致了有意义的 P 值分布。
“组学”数据的统计分析需要谨慎处理,尤其是在选择统计测试时。为了获得有意义的结果,必须仔细检查这些测试背后的假设,并尽可能纠正任何违反情况,或者选择更合适的统计测试。