Lu Yong, Huggins Peter, Bar-Joseph Ziv
School of Computer Science and Lane Center for Computational Biology, Carnegie Mellon University, Pittsburgh, PA 15213, USA.
Bioinformatics. 2009 Jun 15;25(12):1476-83. doi: 10.1093/bioinformatics/btp247. Epub 2009 Apr 8.
Many biological systems operate in a similar manner across a large number of species or conditions. Cross-species analysis of sequence and interaction data is often applied to determine the function of new genes. In contrast to these static measurements, microarrays measure the dynamic, condition-specific response of complex biological systems. The recent exponential growth in microarray expression datasets allows researchers to combine expression experiments from multiple species to identify genes that are not only conserved in sequence but also operated in a similar way in the different species studied.
In this review we discuss the computational and technical challenges associated with these studies, the approaches that have been developed to address these challenges and the advantages of cross-species analysis of microarray data. We show how successful application of these methods lead to insights that cannot be obtained when analyzing data from a single species. We also highlight current open problems and discuss possible ways to address them.
许多生物系统在大量物种或条件下以相似的方式运行。序列和相互作用数据的跨物种分析经常用于确定新基因的功能。与这些静态测量不同,微阵列测量复杂生物系统的动态、特定条件下的反应。最近微阵列表达数据集呈指数增长,这使得研究人员能够将来自多个物种的表达实验结合起来,以识别不仅在序列上保守,而且在所研究的不同物种中以相似方式运作的基因。
在本综述中,我们讨论了与这些研究相关的计算和技术挑战、为应对这些挑战而开发的方法以及微阵列数据跨物种分析的优势。我们展示了这些方法的成功应用如何带来分析单一物种数据时无法获得的见解。我们还强调了当前存在的开放性问题,并讨论了解决这些问题的可能方法。