Pereira Sérgio L, Baker Allan J
Centre for Biodiversity and Conservation Biology - Royal Ontario Museum, Toronto, ON, M5S 2C6 Canada.
BMC Evol Biol. 2004 Jun 25;4:17. doi: 10.1186/1471-2148-4-17.
Mitochondrial DNA has been detected in the nuclear genome of eukaryotes as pseudogenes, or Numts. Human and plant genomes harbor a large number of Numts, some of which have high similarity to mitochondrial fragments and thus may have been inadvertently included in population genetic and phylogenetic studies using mitochondrial DNA. Birds have smaller genomes relative to mammals, and the genome-wide frequency and distribution of Numts is still unknown. The release of a preliminary version of the chicken (Gallus gallus) genome by the Genome Sequencing Center at Washington University, St. Louis provided an opportunity to search this first avian genome for the frequency and characteristics of Numts relative to those in human and plants.
We detected at least 13 Numts in the chicken nuclear genome. Identities between Numts and mitochondrial sequences varied from 58.6 to 88.8%. Fragments ranged from 131 to 1,733 nucleotides, collectively representing only 0.00078% of the nuclear genome. Because fewer Numts were detected in the chicken nuclear genome, they do not represent all regions of the mitochondrial genome and are not widespread in all chromosomes. Nuclear integrations in chicken seem to occur by a DNA intermediate and in regions of low gene density, especially in macrochromosomes.
The number of Numts in chicken is low compared to those in human and plant genomes, and is within the range found for most sequenced eukaryotic genomes. For chicken, PCR amplifications of fragments of about 1.5 kilobases are highly likely to represent true mitochondrial amplification. Sequencing of these fragments should expose the presence of unusual features typical of pseudogenes, unless the nuclear integration is very recent and has not yet been mutated. Metabolic selection for compact genomes with reduced repetitive DNA and gene-poor regions where Numts occur may explain their low incidence in birds.
线粒体DNA已作为假基因或核线粒体DNA(Numts)在真核生物的核基因组中被检测到。人类和植物基因组中含有大量的核线粒体DNA,其中一些与线粒体片段具有高度相似性,因此可能在使用线粒体DNA的群体遗传学和系统发育研究中被无意中纳入。相对于哺乳动物,鸟类的基因组较小,而核线粒体DNA在全基因组中的频率和分布仍然未知。华盛顿大学圣路易斯分校基因组测序中心发布的鸡(原鸡)基因组初步版本,为在首个鸟类基因组中搜索核线粒体DNA相对于人类和植物的频率及特征提供了机会。
我们在鸡的核基因组中检测到至少13个核线粒体DNA。核线粒体DNA与线粒体序列之间的同一性从58.6%到88.8%不等。片段长度从131到1733个核苷酸不等,总共仅占核基因组的0.00078%。由于在鸡的核基因组中检测到的核线粒体DNA较少,它们并不代表线粒体基因组的所有区域,也并非在所有染色体中广泛分布。鸡的核整合似乎通过DNA中间体发生,且发生在基因密度低的区域,尤其是在大型染色体中。
与人类和植物基因组相比,鸡的核线粒体DNA数量较少,且处于大多数已测序真核生物基因组的范围内。对于鸡来说,约1.5千碱基片段的PCR扩增极有可能代表真正的线粒体扩增。对这些片段进行测序应能揭示假基因典型的异常特征,除非核整合是最近发生的且尚未发生突变。对具有减少的重复DNA和存在核线粒体DNA的基因贫乏区域的紧凑基因组进行代谢选择,可能解释了它们在鸟类中低发生率的原因。