Konkel Miriam K, Walker Jerilyn A, Hotard Ashley B, Ranck Megan C, Fontenot Catherine C, Storer Jessica, Stewart Chip, Marth Gabor T, Batzer Mark A
Department of Biological Sciences, Louisiana State University.
Department of Biological Sciences, Louisiana State University Department of Molecular, Cellular and Developmental Biology, The Ohio State University.
Genome Biol Evol. 2015 Aug 29;7(9):2608-22. doi: 10.1093/gbe/evv167.
The goal of the 1000 Genomes Consortium is to characterize human genome structural variation (SV), including forms of copy number variations such as deletions, duplications, and insertions. Mobile element insertions, particularly Alu elements, are major contributors to genomic SV among humans. During the pilot phase of the project we experimentally validated 645 (611 intergenic and 34 exon targeted) polymorphic "young" Alu insertion events, absent from the human reference genome. Here, we report high resolution sequencing of 343 (322 unique) recent Alu insertion events, along with their respective target site duplications, precise genomic breakpoint coordinates, subfamily assignment, percent divergence, and estimated A-rich tail lengths. All the sequenced Alu loci were derived from the AluY lineage with no evidence of retrotransposition activity involving older Alu families (e.g., AluJ and AluS). AluYa5 is currently the most active Alu subfamily in the human lineage, followed by AluYb8, and many others including three newly identified subfamilies we have termed AluYb7a3, AluYb8b1, and AluYa4a1. This report provides the structural details of 322 unique Alu variants from individual human genomes collectively adding about 100 kb of genomic variation. Many Alu subfamilies are currently active in human populations, including a surprising level of AluY retrotransposition. Human Alu subfamilies exhibit continuous evolution with potential drivers sprouting new Alu lineages.
千人基因组计划的目标是对人类基因组结构变异(SV)进行表征,包括拷贝数变异的形式,如缺失、重复和插入。移动元件插入,特别是Alu元件,是人类基因组SV的主要贡献者。在该项目的试点阶段,我们通过实验验证了645个(611个基因间和34个外显子靶向)多态性“年轻”Alu插入事件,这些事件在人类参考基因组中不存在。在这里,我们报告了343个(322个独特)近期Alu插入事件的高分辨率测序结果,以及它们各自的靶位点重复、精确的基因组断点坐标、亚家族分类、差异百分比和估计的富含A的尾巴长度。所有测序的Alu位点均来自AluY谱系,没有证据表明涉及较老的Alu家族(如AluJ和AluS)的逆转录转座活性。AluYa5是目前人类谱系中最活跃的Alu亚家族,其次是AluYb8,以及许多其他亚家族,包括我们命名为AluYb7a3、AluYb8b1和AluYa4a1的三个新鉴定的亚家族。本报告提供了来自个体人类基因组的322个独特Alu变体的结构细节,总共增加了约100 kb的基因组变异。许多Alu亚家族目前在人类群体中活跃,包括令人惊讶的AluY逆转录转座水平。人类Alu亚家族表现出持续进化,潜在驱动因素催生了新的Alu谱系。