Department of Evolutionary Genetics, Max Planck Institute for Evolutionary Anthropology, D04103 Leipzig, Germany.
Am J Hum Genet. 2010 Aug 13;87(2):237-49. doi: 10.1016/j.ajhg.2010.07.014.
Heteroplasmy, the existence of multiple mtDNA types within an individual, has been previously detected by using mostly indirect methods and focusing largely on just the hypervariable segments of the control region. Next-generation sequencing technologies should enable studies of heteroplasmy across the entire mtDNA genome at much higher resolution, because many independent reads are generated for each position. However, the higher error rate associated with these technologies must be taken into consideration to avoid false detection of heteroplasmy. We used simulations and phiX174 sequence data to design criteria for accurate detection of heteroplasmy with the Illumina Genome Analyzer platform, and we used artificial mixtures and replicate data to test and refine the criteria. We then applied these criteria to mtDNA sequence reads for 131 individuals from five Eurasian populations that had been generated via a parallel tagged approach. We identified 37 heteroplasmies at 10% frequency or higher at 34 sites in 32 individuals. The mutational spectrum does not differ between heteroplasmic mutations and polymorphisms in the same individuals, but the relative mutation rate at heteroplasmic mutations is significantly higher than that estimated for all mutable sites in the human mtDNA genome. Moreover, there is also a significant excess of nonsynonymous mutations observed among heteroplasmies, compared to polymorphism data from the same individuals. Both mutation-drift and negative selection influence the fate of heteroplasmies to determine the polymorphism spectrum in humans. With appropriate criteria for avoiding false positives due to sequencing errors, next-generation technologies can provide novel insights into genome-wide aspects of mtDNA heteroplasmy.
异质性,即个体中存在多种 mtDNA 类型,以前主要通过间接方法检测,并主要集中在控制区的高变片段上。下一代测序技术应该能够以更高的分辨率研究整个 mtDNA 基因组中的异质性,因为每个位置都会生成许多独立的读取。然而,必须考虑到这些技术相关的更高错误率,以避免假阳性检测到异质性。我们使用模拟和 phiX174 序列数据来设计使用 Illumina 基因组分析仪平台准确检测异质性的标准,并使用人工混合物和重复数据来测试和完善这些标准。然后,我们将这些标准应用于通过平行标记方法生成的来自五个欧亚人群的 131 个人的 mtDNA 序列读取。我们在 32 个人中的 34 个位点以 10%或更高的频率鉴定了 37 个异质性。在相同个体中,异质性突变和多态性的突变谱没有差异,但异质性突变的相对突变率明显高于人类 mtDNA 基因组中所有可突变位点的估计值。此外,与相同个体的多态性数据相比,异质性中观察到的非同义突变也明显过多。突变-漂变和负选择都会影响异质性的命运,从而决定人类 mtDNA 多态性的频谱。通过适当的标准来避免由于测序错误导致的假阳性,可以为 mtDNA 异质性的全基因组方面提供新的见解。