Hong Fangxin, Breitling Rainer
Department of Biostatistics, Division of Information Sciences, City of Hope National Medical Center, Beckman Research Institute, 1500 Duarte Rd, Duarte, CA 91010, USA.
Bioinformatics. 2008 Feb 1;24(3):374-82. doi: 10.1093/bioinformatics/btm620. Epub 2008 Jan 18.
The proliferation of public data repositories creates a need for meta-analysis methods to efficiently evaluate, integrate and validate related datasets produced by independent groups. A t-based approach has been proposed to integrate effect size from multiple studies by modeling both intra- and between-study variation. Recently, a non-parametric 'rank product' method, which is derived based on biological reasoning of fold-change criteria, has been applied to directly combine multiple datasets into one meta study. Fisher's Inverse chi(2) method, which only depends on P-values from individual analyses of each dataset, has been used in a couple of medical studies. While these methods address the question from different angles, it is not clear how they compare with each other.
We comparatively evaluate the three methods; t-based hierarchical modeling, rank products and Fisher's Inverse chi(2) test with P-values from either the t-based or the rank product method. A simulation study shows that the rank product method, in general, has higher sensitivity and selectivity than the t-based method in both individual and meta-analysis, especially in the setting of small sample size and/or large between-study variation. Not surprisingly, Fisher's chi(2) method highly depends on the method used in the individual analysis. Application to real datasets demonstrates that meta-analysis achieves more reliable identification than an individual analysis, and rank products are more robust in gene ranking, which leads to a much higher reproducibility among independent studies. Though t-based meta-analysis greatly improves over the individual analysis, it suffers from a potentially large amount of false positives when P-values serve as threshold. We conclude that careful meta-analysis is a powerful tool for integrating multiple array studies.
公共数据存储库的激增使得需要元分析方法来有效评估、整合和验证由独立研究小组产生的相关数据集。已经提出了一种基于t检验的方法,通过对研究内和研究间的变异进行建模来整合来自多项研究的效应量。最近,一种基于倍数变化标准的生物学推理推导出来的非参数“秩乘积”方法已被应用于直接将多个数据集合并为一项元研究。费舍尔逆卡方方法仅依赖于每个数据集单独分析得到的P值,已在一些医学研究中使用。虽然这些方法从不同角度解决了问题,但它们之间如何相互比较尚不清楚。
我们对三种方法进行了比较评估;基于t检验的层次模型、秩乘积法以及使用基于t检验或秩乘积法得到的P值的费舍尔逆卡方检验。一项模拟研究表明,一般来说,秩乘积法在个体分析和元分析中都比基于t检验的方法具有更高的灵敏度和选择性,尤其是在小样本量和/或研究间变异较大的情况下。不出所料,费舍尔卡方方法高度依赖于个体分析中使用的方法。对实际数据集的应用表明,元分析比个体分析能实现更可靠的识别,并且秩乘积法在基因排序中更稳健,这导致独立研究之间具有更高的可重复性。尽管基于t检验的元分析比个体分析有很大改进,但当以P值作为阈值时,它会受到大量潜在假阳性的影响。我们得出结论,仔细的元分析是整合多项阵列研究的有力工具。