Gentles Andrew J, Wakefield Matthew J, Kohany Oleksiy, Gu Wanjun, Batzer Mark A, Pollock David D, Jurka Jerzy
Department of Radiology, School of Medicine, Stanford University, Stanford, California 94305, USA.
Genome Res. 2007 Jul;17(7):992-1004. doi: 10.1101/gr.6070707. Epub 2007 May 10.
The genome of the gray short-tailed opossum Monodelphis domestica is notable for its large size ( approximately 3.6 Gb). We characterized nearly 500 families of interspersed repeats from the Monodelphis. They cover approximately 52% of the genome, higher than in any other amniotic lineage studied to date, and may account for the unusually large genome size. In comparison to other mammals, Monodelphis is significantly rich in non-LTR retrotransposons from the LINE-1, CR1, and RTE families, with >29% of the genome sequence comprised of copies of these elements. Monodelphis has at least four families of RTE, and we report support for horizontal transfer of this non-LTR retrotransposon. In addition to short interspersed elements (SINEs) mobilized by L1, we found several families of SINEs that appear to use RTE elements for mobilization. In contrast to L1-mobilized SINEs, the RTE-mobilized SINEs in Monodelphis appear to shift from G+C-rich to G+C-low regions with time. Endogenous retroviruses have colonized approximately 10% of the opossum genome. We found that their density is enhanced in centromeric and/or telomeric regions of most Monodelphis chromosomes. We identified 83 new families of ancient repeats that are highly conserved across amniotic lineages, including 14 LINE-derived repeats; and a novel SINE element, MER131, that may have been exapted as a highly conserved functional noncoding RNA, and whose emergence dates back to approximately 300 million years ago. Many of these conserved repeats are also present in human, and are highly over-represented in predicted cis-regulatory modules. Seventy-six of the 83 families are present in chicken in addition to mammals.
灰短尾负鼠(Monodelphis domestica)的基因组因其庞大的规模(约3.6Gb)而引人注目。我们对来自Monodelphis的近500个散布重复序列家族进行了特征分析。它们覆盖了约52%的基因组,高于迄今为止研究的任何其他羊膜动物谱系,这可能是该基因组异常庞大的原因。与其他哺乳动物相比,Monodelphis在LINE-1、CR1和RTE家族的非LTR反转录转座子方面显著丰富,基因组序列中超过29%由这些元件的拷贝组成。Monodelphis至少有四个RTE家族,我们报告了对这种非LTR反转录转座子水平转移的支持。除了由L1动员的短散布元件(SINEs)外,我们还发现了几个似乎利用RTE元件进行动员的SINEs家族。与L1动员的SINEs不同,Monodelphis中由RTE动员的SINEs似乎随着时间的推移从富含G+C的区域转移到G+C含量低的区域。内源性逆转录病毒占据了负鼠基因组的约10%。我们发现它们在大多数Monodelphis染色体的着丝粒和/或端粒区域密度增加。我们鉴定了83个在羊膜动物谱系中高度保守的古老重复序列新家族,包括14个LINE衍生重复序列;以及一个新的SINE元件MER131,它可能已被改造为一种高度保守的功能性非编码RNA,其出现可追溯到约3亿年前。这些保守重复序列中的许多也存在于人类中,并且在预测的顺式调控模块中高度富集。除哺乳动物外,83个家族中的76个也存在于鸡中。