Racimo Fernando
Department of Integrative Biology, University of California, Berkeley, California 94720
Genetics. 2016 Feb;202(2):733-50. doi: 10.1534/genetics.115.178095. Epub 2015 Nov 23.
A powerful way to detect selection in a population is by modeling local allele frequency changes in a particular region of the genome under scenarios of selection and neutrality and finding which model is most compatible with the data. A previous method based on a cross-population composite likelihood ratio (XP-CLR) uses an outgroup population to detect departures from neutrality that could be compatible with hard or soft sweeps, at linked sites near a beneficial allele. However, this method is most sensitive to recent selection and may miss selective events that happened a long time ago. To overcome this, we developed an extension of XP-CLR that jointly models the behavior of a selected allele in a three-population tree. Our method - called "3-population composite likelihood ratio" (3P-CLR) - outperforms XP-CLR when testing for selection that occurred before two populations split from each other and can distinguish between those events and events that occurred specifically in each of the populations after the split. We applied our new test to population genomic data from the 1000 Genomes Project, to search for selective sweeps that occurred before the split of Yoruba and Eurasians, but after their split from Neanderthals, and that could have led to the spread of modern-human-specific phenotypes. We also searched for sweep events that occurred in East Asians, Europeans, and the ancestors of both populations, after their split from Yoruba. In both cases, we are able to confirm a number of regions identified by previous methods and find several new candidates for selection in recent and ancient times. For some of these, we also find suggestive functional mutations that may have driven the selective events.
在群体中检测选择的一种有效方法是,在选择和中性的情况下,对基因组特定区域的局部等位基因频率变化进行建模,并找出最符合数据的模型。一种基于跨群体复合似然比(XP-CLR)的先前方法,利用一个外群群体来检测与硬选择或软选择兼容的、有益等位基因附近连锁位点的中性偏离。然而,该方法对近期选择最为敏感,可能会错过很久以前发生的选择事件。为了克服这一问题,我们开发了XP-CLR的扩展方法,该方法在一个三群体树中联合建模选定等位基因的行为。我们的方法——称为“三群体复合似然比”(3P-CLR)——在测试两个群体彼此分裂之前发生的选择时,优于XP-CLR,并且可以区分这些事件与分裂后每个群体中专门发生的事件。我们将新测试应用于千人基因组计划的群体基因组数据,以寻找在约鲁巴人和欧亚人分裂之前、但在他们与尼安德特人分裂之后发生的、可能导致现代人类特异性表型传播的选择扫荡。我们还搜索了东亚人、欧洲人以及这两个人群的祖先在与约鲁巴人分裂之后发生的扫荡事件。在这两种情况下,我们都能够确认一些先前方法所确定的区域,并找到一些近期和古代选择的新候选区域。对于其中一些区域,我们还发现了可能驱动选择事件的功能性突变线索。