Triant Deborah A, Pearson William R
Department of Biochemistry and Molecular Genetics, University of Virginia, Charlottesville, VA, United States.
Front Genet. 2022 Nov 22;13:984513. doi: 10.3389/fgene.2022.984513. eCollection 2022.
The integration of mitochondrial genome fragments into the nuclear genome is well documented, and the transfer of these mitochondrial nuclear pseudogenes (numts) is thought to be an ongoing evolutionary process. With the increasing number of eukaryotic genomes available, genome-wide distributions of numts are often surveyed. However, inconsistencies in genome quality can reduce the accuracy of numt estimates, and methods used for identification can be complicated by the diverse sizes and ages of numts. Numts have been previously characterized in rodent genomes and it was postulated that they might be more prevalent in a group of voles with rapidly evolving karyotypes. Here, we examine 37 rodent genomes, and an additional 26 vertebrate genomes, while also considering numt detection methods. We identify numts using DNA:DNA and protein:translated-DNA similarity searches and compare numt distributions among rodent and vertebrate taxa to assess whether some groups are more susceptible to transfer. A combination of protein sequence comparisons (protein:translated-DNA) and BLASTN genomic DNA searches detect 50% more numts than genomic DNA:DNA searches alone. In addition, higher-quality RefSeq genomes produce lower estimates of numts than GenBank genomes, suggesting that lower quality genome assemblies can overestimate numts abundance. Phylogenetic analysis shows that mitochondrial transfers are not associated with karyotypic diversity among rodents. Surprisingly, we did not find a strong correlation between numt counts and genome size. Estimates using DNA: DNA analyses can underestimate the amount of mitochondrial DNA that is transferred to the nucleus.
线粒体基因组片段整合到核基因组中已有充分记载,并且这些线粒体核假基因(numts)的转移被认为是一个持续的进化过程。随着可用的真核生物基因组数量不断增加,人们经常对numts的全基因组分布进行调查。然而,基因组质量的不一致会降低numt估计的准确性,而且用于鉴定的方法可能会因numts的大小和年代各异而变得复杂。此前已对啮齿动物基因组中的numts进行了特征描述,并推测它们在一组核型快速进化的田鼠中可能更为普遍。在这里,我们研究了37个啮齿动物基因组以及另外26个脊椎动物基因组,同时还考虑了numt检测方法。我们使用DNA:DNA和蛋白质:翻译后的DNA相似性搜索来识别numts,并比较啮齿动物和脊椎动物类群之间的numt分布,以评估某些类群是否更容易发生转移。蛋白质序列比较(蛋白质:翻译后的DNA)和BLASTN基因组DNA搜索相结合,比单独的基因组DNA:DNA搜索能多检测出50%的numts。此外,与GenBank基因组相比,质量更高的RefSeq基因组对numts的估计值更低,这表明质量较低的基因组组装可能会高估numts的丰度。系统发育分析表明,线粒体转移与啮齿动物的核型多样性无关。令人惊讶的是,我们没有发现numt数量与基因组大小之间存在强相关性。使用DNA:DNA分析的估计可能会低估转移到细胞核中的线粒体DNA的数量。