Department of Biological Sciences, Brock University, St Catharines, Ontario L2S 3A1, Canada.
Mob DNA. 2013 Nov 12;4(1):25. doi: 10.1186/1759-8753-4-25.
Alu elements are the most abundant mobile elements in the human genome, with over 1 million copies and constituting more than 10% of the genome. The majority of these Alu elements were inserted into the primate genome 35 to 60 million years ago, but certain subfamilies of Alu elements are relatively very new and suspected to be still evolving. We attempted to trace the source/master copies of all human-specific members of the Alu Yb lineage using a computational approach by clustering similar Yb elements and constructing an evolutionary relation among the members of a cluster.
We discovered that one copy of Yb8 at 10p14 is the source of several active Yb8 copies, which retrotransposed to generate 712 copies or 54% of all human-specific Yb8 elements. We detected eight other Yb8 elements that had generated ten or more copies, potentially acting as 'stealth drivers'. One Yb8 element at 14q32.31 seemed to act as the source copy for all Yb9 elements tested, having producing 13 active Yb9 elements, and subsequently generated a total of 131 full-length copies. We identified and characterized three new subclasses of Yb elements: Yb8a1, Yb10 and Yb11. Their copy numbers in the reference genome are 75, 8 and 16. We analysed personal genome data from the 1000 Genome Project and detected an additional 6 Yb8a1, 3 Yb10 and 15 Yb11 copies outside the reference genome. Our analysis indicates that the Yb8a1 subfamily has a similar age to Yb9 (1.93 million years and 2.15 million years, respectively), while Yb10 and Yb11 evolved only 1.4 and 0.71 million years ago, suggesting a linear evolutionary path from Yb8a1 to Yb10 and then to Yb11. Our preliminary data indicate that members in Yb10 and Yb11 are mostly polymorphic, indicating their young age.
Our findings suggest that the Yb lineage is still evolving with new subfamilies being formed. Due to their very young age and the high rate of being polymorphic, insertions from these young subfamilies are very useful genetic markers for studying human population genetics and migration patterns, and the trend for mobile element insertions in the human genome.
Alu 元件是人类基因组中最丰富的移动元件,超过 100 万个拷贝,占基因组的 10%以上。这些 Alu 元件中的大多数在 3500 万至 6000 万年前被插入灵长类基因组,但某些 Alu 元件的亚家族非常新,疑似仍在进化。我们试图通过聚类相似的 Yb 元件,并构建聚类成员之间的进化关系,利用计算方法追踪 Alu Yb 谱系中所有人类特有的成员的来源/主拷贝。
我们发现 10p14 上的 Yb8 拷贝之一是几个活跃的 Yb8 拷贝的来源,这些拷贝通过逆转录转座生成了 712 个拷贝或所有人类特有的 Yb8 元件的 54%。我们检测到另外 8 个 Yb8 元件,它们产生了 10 个或更多的拷贝,可能充当“隐形驱动子”。14q32.31 上的一个 Yb8 元件似乎是所有测试的 Yb9 元件的来源拷贝,产生了 13 个活跃的 Yb9 元件,随后总共产生了 131 个全长拷贝。我们鉴定并描述了三个新的 Yb 元件子类:Yb8a1、Yb10 和 Yb11。它们在参考基因组中的拷贝数分别为 75、8 和 16。我们分析了来自 1000 基因组计划的个人基因组数据,并在参考基因组之外检测到另外 6 个 Yb8a1、3 个 Yb10 和 15 个 Yb11 拷贝。我们的分析表明,Yb8a1 亚家族与 Yb9 的年龄相似(分别为 193 万年和 215 万年),而 Yb10 和 Yb11 仅在 140 万和 71 万年前进化,表明从 Yb8a1 到 Yb10 再到 Yb11 存在线性进化途径。我们的初步数据表明,Yb10 和 Yb11 中的成员大多是多态的,表明它们很年轻。
我们的发现表明,Yb 谱系仍在进化,新的亚家族正在形成。由于它们非常年轻,且多态性很高,因此这些年轻亚家族的插入对于研究人类群体遗传学和迁移模式以及人类基因组中移动元件的插入趋势非常有用。