Ma Jian, Zhang Louxin, Suh Bernard B, Raney Brian J, Burhans Richard C, Kent W James, Blanchette Mathieu, Haussler David, Miller Webb
Center for Comparative Genomics and Bioinformatics, Penn State University, University Park, Pennsylvania 16802, USA.
Genome Res. 2006 Dec;16(12):1557-65. doi: 10.1101/gr.5383506. Epub 2006 Sep 18.
This article analyzes mammalian genome rearrangements at higher resolution than has been published to date. We identify 3171 intervals, covering approximately 92% of the human genome, within which we find no rearrangements larger than 50 kilobases (kb) in the lineages leading to human, mouse, rat, and dog from their most recent common ancestor. Combining intervals that are adjacent in all contemporary species produces 1338 segments that may contain large insertions or deletions but that are free of chromosome fissions or fusions as well as inversions or translocations >50 kb in length. We describe a new method for predicting the ancestral order and orientation of those intervals from their observed adjacencies in modern species. We combine the results from this method with data from chromosome painting experiments to produce a map of an early mammalian genome that accounts for 96.8% of the available human genome sequence data. The precision is further increased by mapping inversions as small as 31 bp. Analysis of the predicted evolutionary breakpoints in the human lineage confirms certain published observations but disagrees with others. Although only a few mammalian genomes are currently sequenced to high precision, our theoretical analyses and computer simulations indicate that our results are reasonably accurate and that they will become highly accurate in the foreseeable future. Our methods were developed as part of a project to reconstruct the genome sequence of the last ancestor of human, dogs, and most other placental mammals.
本文以比迄今已发表的研究更高的分辨率分析了哺乳动物基因组重排。我们识别出3171个区间,覆盖了约92%的人类基因组,在从人类、小鼠、大鼠和狗的最近共同祖先到现代的谱系中,我们在这些区间内未发现大于50千碱基(kb)的重排。将所有现代物种中相邻的区间合并,产生了1338个片段,这些片段可能包含大的插入或缺失,但没有染色体裂变或融合,也没有长度大于50 kb的倒位或易位。我们描述了一种新方法,可根据现代物种中观察到的邻接关系预测这些区间的祖先顺序和方向。我们将该方法的结果与染色体涂染实验的数据相结合,生成了一张早期哺乳动物基因组图谱,该图谱涵盖了96.8%的可用人类基因组序列数据。通过绘制小至31 bp的倒位,精度进一步提高。对人类谱系中预测的进化断点的分析证实了一些已发表的观察结果,但与其他结果不一致。尽管目前只有少数哺乳动物基因组被高精度测序,但我们的理论分析和计算机模拟表明,我们的结果相当准确,并且在可预见的未来将变得高度准确。我们的方法是作为一个项目的一部分而开发的,该项目旨在重建人类、狗和大多数其他胎盘哺乳动物的最后共同祖先的基因组序列。