Montazeri Zahra, Yanofsky Corey M, Bickel David R
Ottawa Institute of Systems Biology.
Stat Appl Genet Mol Biol. 2010;9:Article23. doi: 10.2202/1544-6115.1504. Epub 2010 Jun 8.
Research on analyzing microarray data has focused on the problem of identifying differentially expressed genes to the neglect of the problem of how to integrate evidence that a gene is differentially expressed with information on the extent of its differential expression. Consequently, researchers currently prioritize genes for further study either on the basis of volcano plots or, more commonly, according to simple estimates of the fold change after filtering the genes with an arbitrary statistical significance threshold. While the subjective and informal nature of the former practice precludes quantification of its reliability, the latter practice is equivalent to using a hard-threshold estimator of the expression ratio that is not known to perform well in terms of mean-squared error, the sum of estimator variance and squared estimator bias. On the basis of two distinct simulation studies and data from different microarray studies, we systematically compared the performance of several estimators representing both current practice and shrinkage. We find that the threshold-based estimators usually perform worse than the maximum-likelihood estimator (MLE) and they often perform far worse as quantified by estimated mean-squared risk. By contrast, the shrinkage estimators tend to perform as well as or better than the MLE and never much worse than the MLE, as expected from what is known about shrinkage. However, a Bayesian measure of performance based on the prior information that few genes are differentially expressed indicates that hard-threshold estimators perform about as well as the local false discovery rate (FDR), the best of the shrinkage estimators studied. Based on the ability of the latter to leverage information across genes, we conclude that the use of the local-FDR estimator of the fold change instead of informal or threshold-based combinations of statistical tests and non-shrinkage estimators can be expected to substantially improve the reliability of gene prioritization at very little risk of doing so less reliably. Since the proposed replacement of post-selection estimates with shrunken estimates applies as well to other types of high-dimensional data, it could also improve the analysis of SNP data from genome-wide association studies.
对微阵列数据的分析研究主要集中在识别差异表达基因的问题上,而忽略了如何将基因差异表达的证据与差异表达程度的信息相结合的问题。因此,研究人员目前基于火山图或更常见地,根据在以任意统计显著性阈值筛选基因后对倍数变化的简单估计来对基因进行进一步研究的优先级排序。虽然前一种做法的主观和非正式性质排除了对其可靠性的量化,但后一种做法等同于使用表达比率的硬阈值估计器,而在均方误差(估计器方差与估计器偏差平方之和)方面,这种估计器的表现并不出色。基于两项不同的模拟研究以及来自不同微阵列研究的数据,我们系统地比较了代表当前做法和收缩估计的几种估计器的性能。我们发现,基于阈值的估计器通常比最大似然估计器(MLE)表现更差,并且根据估计的均方风险量化,它们往往表现得更差得多。相比之下,收缩估计器的表现往往与MLE一样好或更好,并且正如关于收缩的已知情况所预期的那样,永远不会比MLE差太多。然而,基于很少有基因差异表达的先验信息的贝叶斯性能度量表明,硬阈值估计器的表现与局部错误发现率(FDR)相当,FDR是所研究的收缩估计器中最好的。基于后者能够跨基因利用信息的能力,我们得出结论,使用倍数变化的局部FDR估计器,而不是统计检验和非收缩估计器的非正式或基于阈值组合,有望在几乎不降低可靠性风险的情况下,大幅提高基因优先级排序的可靠性。由于提议用收缩估计替换选择后的估计同样适用于其他类型的高维数据,它也可以改善全基因组关联研究中SNP数据的分析。