Department of Evolutionary Genetics, Max Planck Institute for Evolutionary Anthropology, 04103 Leipzig, Germany.
Am J Hum Genet. 2013 Feb 7;92(2):279-84. doi: 10.1016/j.ajhg.2012.12.004. Epub 2013 Jan 10.
An A-to-G transition at position 16247 in the human mtDNA genome denotes haplogroup B4a1a1a and its sublineages. Informally known as the "Polynesian motif," this haplogroup has been widely used as a marker in Oceania of genetic affiliation with the Austronesian expansion. The 16247G allele has arisen only once in the human mtDNA phylogeny, about 7,000 thousand years ago, and is nearly fixed in Remote Oceania. We analyzed 536 complete mtDNA genome sequences from the Solomon Islands from haplogroup B4a1a1 and associated subhaplogroups and found multiple independent back mutations from 16247G to 16247A. We also find elevated levels of heteroplasmy at this position in samples with the 16247G allele, suggesting the ongoing occurrence of somatic back-mutations and/or transmission of heteroplasmy. Moreover, the G allele is predicted to introduce a novel stem-loop structure in the DNA sequence that may be structurally unfavorable, thereby accounting for the remarkable number of back-mutations observed at the 16247G allele in this short evolutionary time span. More generally, haplogroup-calling scripts result in inaccurate haplogroup calls involving the back-mutation and need to be supplemented with other types of analyses; this may be true for other mtDNA lineages because no other lineage has been investigated to the same extent (over 500 complete mtDNA sequences).
人类 mtDNA 基因组中位置 16247 的 A 到 G 转换表示单倍群 B4a1a1a 及其亚支。这个单倍群非正式地被称为“波利尼西亚 motif”,已被广泛用作大洋洲与南岛语族扩张有关的遗传关联的标记。16247G 等位基因在人类 mtDNA 系统发育中仅出现过一次,大约在 7000 千年前,并且在偏远的大洋洲几乎是固定的。我们分析了来自所罗门群岛的 536 个完整的 mtDNA 基因组序列,这些序列属于 B4a1a1 单倍群和相关的亚单倍群,发现了从 16247G 到 16247A 的多个独立的回复突变。我们还在具有 16247G 等位基因的样本中发现了该位置的异质性水平升高,这表明体细胞回复突变和/或异质性的传递正在发生。此外,G 等位基因预计会在 DNA 序列中引入一个新的茎环结构,这可能在结构上是不利的,从而解释了在如此短的进化时间内观察到的 16247G 等位基因的大量回复突变。更一般地说,单倍群调用脚本会导致涉及回复突变的不准确的单倍群调用,需要辅以其他类型的分析;这可能对其他 mtDNA 谱系也是如此,因为没有对其他谱系进行过同样程度的研究(超过 500 个完整的 mtDNA 序列)。