Centre for Genetics and Genomics, School of Biology, University of Nottingham, University Park, Nottingham, UK.
J Mol Evol. 2011 Dec;73(5-6):287-96. doi: 10.1007/s00239-011-9475-y. Epub 2012 Jan 6.
Almost half the human genome consists of mobile DNA elements, and their analysis is a vital part of understanding the human genome as a whole. Many of these elements are ancient and have persisted in the genome for tens or hundreds of millions of years, providing a window into the evolution of modern mammals. The Golem family have been used as model transposons to highlight computational analyses which can be used to investigate these elements, particularly the use of molecular dating with large transposon families. Whole-genome searches found Golem sequences in 20 mammalian species. Golem A and B subsequences were only found in primates and squirrel. Interestingly, the full-length Golem, found as a few copies in many mammalian genomes, was found abundantly in horse. A phylogenetic profile suggested that Golem originated after the eutherian-metatherian divergence and that the A and B subfamilies originated at a much later date. Molecular dating based on sequence diversity suggests an early age, of 175 Mya, for the origin of the family and that the A and B lineages originated much earlier than expected from their current taxonomic distribution and have subsequently been lost in some lineages. Using publically available data, it is possible to investigate the evolutionary history of transposon families. Determining in which organisms a transposon can be found is often used to date the origin and expansion of the families. However, in this analysis, molecular dating, commonly used for determining the age of gene sequences, has been used, reducing the likelihood of errors from deleted lineages.
人类基因组的近一半由移动 DNA 元件组成,分析这些元件是理解整个人类基因组的重要组成部分。这些元件中有许多是古老的,已经在基因组中存在了数千万年甚至数亿年,为现代哺乳动物的进化提供了一个窗口。Golem 家族被用作模型转座子,以突出可以用于研究这些元件的计算分析,特别是使用大规模转座子家族进行分子定年。全基因组搜索在 20 种哺乳动物物种中发现了 Golem 序列。Golem A 和 B 亚序列仅在灵长类动物和松鼠中发现。有趣的是,在许多哺乳动物基因组中发现的少量拷贝的全长 Golem 在马中大量存在。系统发育谱表明 Golem 起源于真兽类-有袋类分化之后,A 和 B 亚家族起源于更晚的时期。基于序列多样性的分子定年表明,该家族的起源可追溯到 1.75 亿年前,A 和 B 谱系的起源比从其当前分类分布所预期的要早得多,并且此后在一些谱系中丢失。利用公共可用数据,可以研究转座子家族的进化历史。确定转座子可以在哪些生物体中发现,通常用于对家族的起源和扩张进行定年。然而,在这项分析中,常用于确定基因序列年龄的分子定年已被用于减少因缺失谱系而导致的错误的可能性。