Racimo Fernando, Marnetto Davide, Huerta-Sánchez Emilia
Department of Integrative Biology, University of California Berkeley, Berkeley, CA.
Department of Molecular Biotechnology and Health Sciences, University of Torino, Turin, Italy.
Mol Biol Evol. 2017 Feb 1;34(2):296-317. doi: 10.1093/molbev/msw216.
Comparisons of DNA from archaic and modern humans show that these groups interbred, and in some cases received an evolutionary advantage from doing so. This process-adaptive introgression-may lead to a faster rate of adaptation than is predicted from models with mutation and selection alone. Within the last couple of years, a series of studies have identified regions of the genome that are likely examples of adaptive introgression. In many cases, once a region was ascertained as being introgressed, commonly used statistics based on both haplotype as well as allele frequency information were employed to test for positive selection. Introgression by itself, however, changes both the haplotype structure and the distribution of allele frequencies, thus confounding traditional tests for detecting positive selection. Therefore, patterns generated by introgression alone may lead to false inferences of positive selection. Here we explore models involving both introgression and positive selection to investigate the behavior of various statistics under adaptive introgression. In particular, we find that the number and allelic frequencies of sites that are uniquely shared between archaic humans and specific present-day populations are particularly useful for detecting adaptive introgression. We then examine the 1000 Genomes dataset to characterize the landscape of uniquely shared archaic alleles in human populations. Finally, we identify regions that were likely subject to adaptive introgression and discuss some of the most promising candidate genes located in these regions.
对古代人类和现代人类的DNA比较表明,这些群体曾进行过杂交,而且在某些情况下,杂交还带来了进化优势。这种过程——适应性基因渗入——可能会导致比仅由突变和选择构成的模型所预测的更快的适应速度。在过去几年里,一系列研究已经确定了基因组中可能是适应性基因渗入实例的区域。在许多情况下,一旦确定某个区域存在基因渗入,就会使用基于单倍型以及等位基因频率信息的常用统计方法来检测正选择。然而,基因渗入本身会改变单倍型结构和等位基因频率分布,从而混淆检测正选择的传统测试。因此,仅由基因渗入产生的模式可能会导致对正选择的错误推断。在这里,我们探索涉及基因渗入和正选择的模型,以研究适应性基因渗入情况下各种统计方法的表现。特别是,我们发现古代人类与特定现代人群体之间独特共享的位点数量和等位基因频率对于检测适应性基因渗入特别有用。然后,我们检查了千人基因组数据集,以描绘人类群体中独特共享的古代等位基因的情况。最后,我们确定了可能经历适应性基因渗入的区域,并讨论了位于这些区域的一些最有前景的候选基因。