Zhang Yu, Song Giltae, Vinar Tomás, Green Eric D, Siepel Adam, Miller Webb
Center for Comparative Genomics and Bioinformatics, Penn State University , University Park, PA 16802, USA.
J Comput Biol. 2009 Aug;16(8):1051-70. doi: 10.1089/cmb.2009.0040.
Clusters of genes that evolved from single progenitors via repeated segmental duplications present significant challenges to the generation of a truly complete human genome sequence. Such clusters can confound both accurate sequence assembly and downstream computational analysis, yet they represent a hotbed of functional innovation, making them of extreme interest. We have developed an algorithm for reconstructing the evolutionary history of gene clusters using only human genomic sequence data, which allows the tempo of large-scale evolutionary events in human gene clusters to be estimated. We further propose an extension of the method to simultaneously reconstructing the evolutionary histories of orthologous gene clusters in multiple primates, which will facilitate primate comparative sequencing studies that aim to reconstruct their evolutionary history more fully.
通过重复的片段重复从单个祖先进化而来的基因簇,给生成真正完整的人类基因组序列带来了重大挑战。这样的基因簇会使准确的序列组装和下游的计算分析都变得复杂,但它们却是功能创新的温床,因此极具研究价值。我们开发了一种仅使用人类基因组序列数据来重建基因簇进化历史的算法,该算法能够估计人类基因簇中大规模进化事件的发生速度。我们进一步提出将该方法扩展,以同时重建多个灵长类动物中直系同源基因簇的进化历史,这将有助于旨在更全面地重建其进化历史的灵长类动物比较测序研究。