Environmental Health Science and Research Bureau, Environmental and Radiation Health Sciences Directorate, Health Canada, Ottawa, Ontario, Canada.
BMC Bioinformatics. 2010 Jan 28;11:60. doi: 10.1186/1471-2105-11-60.
Microarray experiments examine the change in transcript levels of tens of thousands of genes simultaneously. To derive meaningful data, biologists investigate the response of genes within specific pathways. Pathways are comprised of genes that interact to carry out a particular biological function. Existing methods for analyzing pathways focus on detecting changes in the mean or over-representation of the number of differentially expressed genes relative to the total of genes within the pathway. The issue of how to incorporate the influence of correlation among the genes is not generally addressed.
In this paper, we propose a non-parametric rank test for analyzing pathways that takes into account the correlation among the genes and compared two existing methods, Global and Gene Set Enrichment Analysis (GSEA), using two publicly available data sets. A simulation study was conducted to demonstrate the advantage of the rank test method.
The data indicate the advantages of the rank test. The method can distinguish significant changes in pathways due to either correlations or changes in the mean or both. From the simulation study the rank test out performed Global and GSEA. The greatest gain in performance was for the sample size case which makes the application of the rank test ideal for microarray experiments.
微阵列实验同时检测数以万计的基因的转录水平变化。为了得出有意义的数据,生物学家研究特定通路内基因的反应。通路由相互作用以执行特定生物学功能的基因组成。现有的分析途径的方法侧重于检测相对于通路内总基因数的差异表达基因的平均值或过表达的变化。通常不解决如何结合基因之间相关性的影响的问题。
在本文中,我们提出了一种非参数秩检验方法来分析通路,该方法考虑了基因之间的相关性,并使用两个公开可用的数据集比较了两种现有方法,即全局和基因集富集分析(GSEA)。进行了一项模拟研究以证明秩检验方法的优势。
数据表明了秩检验方法的优势。该方法可以区分由于相关性或平均值变化或两者兼而有之的通路中的显著变化。从模拟研究中,秩检验优于全局和 GSEA。性能的最大提高是在样本量情况下,这使得秩检验非常适合微阵列实验。