Cox Murray P, Mendez Fernando L, Karafet Tatiana M, Pilkington Maya Metni, Kingan Sarah B, Destro-Bisol Giovanni, Strassmann Beverly I, Hammer Michael F
Department of Ecology and Evolutionary Biology, University of Arizona, Tucson 85721, USA.
Genetics. 2008 Jan;178(1):427-37. doi: 10.1534/genetics.107.080432.
A 2.4-kb stretch within the RRM2P4 region of the X chromosome, previously sequenced in a sample of 41 globally distributed humans, displayed both an ancient time to the most recent common ancestor (e.g., a TMRCA of approximately 2 million years) and a basal clade composed entirely of Asian sequences. This pattern was interpreted to reflect a history of introgressive hybridization from archaic hominins (most likely Asian Homo erectus) into the anatomically modern human genome. Here, we address this hypothesis by resequencing the 2.4-kb RRM2P4 region in 131 African and 122 non-African individuals and by extending the length of sequence in a window of 16.5 kb encompassing the RRM2P4 pseudogene in a subset of 90 individuals. We find that both the ancient TMRCA and the skew in non-African representation in one of the basal clades are essentially limited to the central 2.4-kb region. We define a new summary statistic called the minimum clade proportion (pmc), which quantifies the proportion of individuals from a specified geographic region in each of the two basal clades of a binary gene tree, and then employ coalescent simulations to assess the likelihood of the observed central RRM2P4 genealogy under two alternative views of human evolutionary history: recent African replacement (RAR) and archaic admixture (AA). A molecular-clock-based TMRCA estimate of 2.33 million years is a statistical outlier under the RAR model; however, the large variance associated with this estimate makes it difficult to distinguish the predictions of the human origins models tested here. The pmc summary statistic, which has improved power with larger samples of chromosomes, yields values that are significantly unlikely under the RAR model and fit expectations better under a range of archaic admixture scenarios.
X染色体RRM2P4区域内一段2.4千碱基对的片段,此前在41名全球分布的人类样本中进行过测序,显示出其最近共同祖先的时间古老(例如,约200万年的最近共同祖先时间),且一个基部进化枝完全由亚洲序列组成。这种模式被解释为反映了古代人类(最有可能是亚洲直立人)向解剖学上现代人类基因组渐渗杂交的历史。在此,我们通过对131名非洲人和122名非非洲人重新测序2.4千碱基对的RRM2P4区域,并在90名个体的子集中扩展包含RRM2P4假基因的16.5千碱基对窗口的序列长度,来验证这一假设。我们发现,古老的最近共同祖先时间以及基部进化枝之一中非非洲代表性的偏差基本上仅限于中央2.4千碱基对区域。我们定义了一个新的汇总统计量,称为最小进化枝比例(pmc),它量化了二元基因树的两个基部进化枝中来自指定地理区域的个体比例,然后采用溯祖模拟来评估在人类进化历史的两种替代观点下观察到的中央RRM2P4系统发育的可能性:近期非洲替代(RAR)和古代混合(AA)。在RAR模型下,基于分子钟的最近共同祖先时间估计值233万年是一个统计异常值;然而,与该估计值相关的巨大方差使得难以区分这里测试的人类起源模型的预测。pmc汇总统计量在染色体样本量更大时具有更高的功效,其产生的值在RAR模型下极不可能出现,而在一系列古代混合情景下更符合预期。