Department of Statistics, Pennsylvania State University, University Park, Pennsylvania 16802, USA.
BMC Bioinformatics. 2014 Jun 28;15:226. doi: 10.1186/1471-2105-15-226.
Meta-analysis has become increasingly popular in recent years, especially in genomic data analysis, due to the fast growth of available data and studies that target the same questions. Many methods have been developed, including classical ones such as Fisher's combined probability test and Stouffer's Z-test. However, not all meta-analyses have the same goal in mind. Some aim at combining information to find signals in at least one of the studies, while others hope to find more consistent signals across the studies. While many classical meta-analysis methods are developed with the former goal in mind, the latter goal has much more practicality for genomic data analysis.
In this paper, we propose a class of meta-analysis methods based on summaries of weighted ordered p-values (WOP) that aim at detecting significance in a majority of studies. We consider weighted versions of classical procedures such as Fisher's method and Stouffer's method where the weight for each p-value is based on its order among the studies. In particular, we consider weights based on the binomial distribution, where the median of the p-values are weighted highest and the outlying p-values are down-weighted. We investigate the properties of our methods and demonstrate their strengths through simulations studies, comparing to existing procedures. In addition, we illustrate application of the proposed methodology by several meta-analysis of gene expression data.
Our proposed weighted ordered p-value (WOP) methods displayed better performance compared to existing methods for testing the hypothesis that there is signal in the majority of studies. They also appeared to be much more robust in applications compared to the rth ordered p-value (rOP) method (Song and Tseng, Ann. Appl. Stat. 2014, 8(2):777-800). With the flexibility of incorporating different p-value combination methods and different weighting schemes, the weighted ordered p-values (WOP) methods have great potential in detecting consistent signal in meta-analysis with heterogeneity.
近年来,元分析在基因组数据分析中变得越来越流行,这主要是由于可用数据和针对相同问题的研究的快速增长。已经开发了许多方法,包括经典方法,如 Fisher 联合概率检验和 Stouffer 的 Z 检验。然而,并非所有的元分析都有相同的目标。有些旨在合并信息,以在至少一项研究中找到信号,而另一些则希望在研究之间找到更一致的信号。虽然许多经典的元分析方法都是基于前者的目标开发的,但后者的目标对于基因组数据分析更具有实际意义。
在本文中,我们提出了一类基于加权有序 P 值(WOP)汇总的元分析方法,旨在检测大多数研究中的显著性。我们考虑了经典方法的加权版本,如 Fisher 方法和 Stouffer 方法,其中每个 P 值的权重基于其在研究中的顺序。特别是,我们考虑了基于二项分布的权重,其中 P 值的中位数加权最高,异常 P 值的权重降低。我们研究了我们方法的性质,并通过模拟研究与现有程序进行了比较,展示了它们的优势。此外,我们通过几个基因表达数据的元分析说明了所提出的方法的应用。
与现有的检验大多数研究中有信号的假设的方法相比,我们提出的加权有序 P 值(WOP)方法显示出更好的性能。与 r 阶 P 值(rOP)方法(Song 和 Tseng,Ann.Appl.Stat.2014,8(2):777-800)相比,它们在应用中也显得更加稳健。通过灵活地结合不同的 P 值组合方法和不同的加权方案,加权有序 P 值(WOP)方法在具有异质性的元分析中检测一致信号方面具有很大的潜力。