Department of Mathematical Statistics, Chalmers University of Technology/University of Gothenburg, Gothenburg, Sweden.
BMC Bioinformatics. 2013 Feb 27;14:70. doi: 10.1186/1471-2105-14-70.
Analysis of gene expression from different species is a powerful way to identify evolutionarily conserved transcriptional responses. However, due to evolutionary events such as gene duplication, there is no one-to-one correspondence between genes from different species which makes comparison of their expression profiles complex.
In this paper we describe a new method for cross-species meta-analysis of gene expression. The method takes the homology structure between compared species into account and can therefore compare expression data from genes with any number of orthologs and paralogs. A simulation study shows that the proposed method results in a substantial increase in statistical power compared to previously suggested procedures. As a proof of concept, we analyzed microarray data from heat stress experiments performed in eight species and identified several well-known evolutionarily conserved transcriptional responses. The method was also applied to gene expression profiles from five studies of estrogen exposed fish and both known and potentially novel responses were identified.
The method described in this paper will further increase the potential and reliability of meta-analysis of gene expression profiles from evolutionarily distant species. The method has been implemented in R and is freely available at http://bioinformatics.math.chalmers.se/Xspecies/.
分析不同物种的基因表达是识别进化保守转录反应的有力方法。然而,由于基因复制等进化事件,不同物种的基因之间没有一一对应的关系,这使得它们的表达谱比较变得复杂。
在本文中,我们描述了一种用于跨物种基因表达元分析的新方法。该方法考虑了比较物种之间的同源结构,因此可以比较具有任意数量的直系同源物和旁系同源物的基因的表达数据。模拟研究表明,与先前提出的方法相比,所提出的方法可显著提高统计功效。作为概念验证,我们分析了在八个物种中进行的热应激实验的微阵列数据,并鉴定了几个众所周知的进化保守转录反应。该方法还应用于雌激素暴露鱼类的五个研究的基因表达谱,鉴定了已知和潜在的新反应。
本文描述的方法将进一步提高元分析来自进化上遥远物种的基因表达谱的潜力和可靠性。该方法已在 R 中实现,并可在 http://bioinformatics.math.chalmers.se/Xspecies/ 免费获得。