Newman Tera L, Tuzun Eray, Morrison V Anne, Hayden Karen E, Ventura Mario, McGrath Sean D, Rocchi Mariano, Eichler Evan E
Department of Genome Sciences, University of Washington School of Medicine, Seattle, Washington 98195, USA.
Genome Res. 2005 Oct;15(10):1344-56. doi: 10.1101/gr.4338005. Epub 2005 Sep 16.
Structural changes (deletions, insertions, and inversions) between human and chimpanzee genomes have likely had a significant impact on lineage-specific evolution because of their potential for dramatic and irreversible mutation. The low-quality nature of the current chimpanzee genome assembly precludes the reliable identification of many of these differences. To circumvent this, we applied a method to optimally map chimpanzee fosmid paired-end sequences against the human genome to systematically identify sites of structural variation > or = 12 kb between the two species. Our analysis yielded a total of 651 putative sites of chimpanzee deletion (n = 293), insertions (n = 184), and rearrangements consistent with local inversions between the two genomes (n = 174). We validated a subset (19/23) of insertion and deletions using PCR and Southern blot assays, confirming the accuracy of our method. The events are distributed throughout the genome on all chromosomes but are highly correlated with sites of segmental duplication in human and chimpanzee. These structural variants encompass at least 24 Mb of DNA and overlap with > 245 genes. Seventeen of these genes contain exons missing in the chimpanzee genomic sequence and also show a significant reduction in gene expression in chimpanzee. Compared with the pioneering work of Yunis, Prakash, Dutrillaux, and Lejeune, this analysis expands the number of potential rearrangements between chimpanzees and humans 50-fold. Furthermore, this work prioritizes regions for further finishing in the chimpanzee genome and provides a resource for interrogating functional differences between humans and chimpanzees.
人类和黑猩猩基因组之间的结构变化(缺失、插入和倒位)可能因其具有产生剧烈且不可逆转突变的潜力,而对特定谱系的进化产生了重大影响。当前黑猩猩基因组组装的低质量特性使得难以可靠地识别其中许多差异。为了规避这一问题,我们应用了一种方法,将黑猩猩黏粒双末端序列与人类基因组进行最佳比对,以系统地识别这两个物种之间长度≥12 kb的结构变异位点。我们的分析总共产生了651个推定的黑猩猩缺失位点(n = 293)、插入位点(n = 184)以及与两个基因组之间局部倒位一致的重排位点(n = 174)。我们使用PCR和Southern印迹分析验证了一部分插入和缺失位点(19/23),证实了我们方法的准确性。这些事件分布在所有染色体的整个基因组中,但与人类和黑猩猩的片段重复位点高度相关。这些结构变异至少涵盖了24 Mb的DNA,并且与超过245个基因重叠。其中17个基因包含在黑猩猩基因组序列中缺失的外显子,并且在黑猩猩中的基因表达也显著降低。与尤尼斯、普拉卡什、迪特里洛和勒热纳的开创性工作相比,该分析将黑猩猩和人类之间潜在重排的数量扩大了50倍。此外,这项工作确定了黑猩猩基因组中需要进一步完善的区域,并为探究人类和黑猩猩之间的功能差异提供了资源。