Department of Biological Sciences, University of Calgary, Calgary, Alberta, Canada.
Department of Microbiology & Immunology, University of Minnesota, Minneapolis, Minnesota, United States of America.
PLoS Genet. 2018 Oct 8;14(10):e1007717. doi: 10.1371/journal.pgen.1007717. eCollection 2018 Oct.
Convergent adaptation occurs at the genome scale when independently evolving lineages use the same genes to respond to similar selection pressures. These patterns of genetic repeatability provide insights into the factors that facilitate or constrain the diversity of genetic responses that contribute to adaptive evolution. A first step in studying such factors is to quantify the observed amount of repeatability relative to expectations under a null hypothesis. Here, we formulate a novel index to quantify the constraints driving the observed amount of repeated adaptation in pairwise contrasts based on the hypergeometric distribution, and then generalize this for simultaneous analysis of multiple lineages. This index is explicitly based on the probability of observing a given amount of repeatability by chance under a given null hypothesis and is readily compared among different species and types of trait. We also formulate an index to quantify the effective proportion of genes in the genome that have the potential to contribute to adaptation. As an example of how these indices can be used to draw inferences, we assess the amount of repeatability observed in existing datasets on adaptation to stress in yeast and climate in conifers. This approach provides a method to test a wide range of hypotheses about how different kinds of factors can facilitate or constrain the diversity of genetic responses observed during adaptive evolution.
当独立进化的谱系使用相同的基因来应对相似的选择压力时,就会在基因组范围内发生趋同适应。这些遗传可重复性模式为研究促进或限制有助于适应性进化的遗传反应多样性的因素提供了线索。研究这些因素的第一步是量化相对于零假设下预期的观察到的可重复性程度。在这里,我们基于超几何分布,为基于成对比较的观察到的重复适应的驱动因素制定了一个新的指标,然后将其推广到对多个谱系的同时分析。该指标明确基于给定零假设下观察到给定程度可重复性的概率,并且可以在不同物种和性状类型之间进行比较。我们还制定了一个指标来量化基因组中具有潜在适应能力的基因的有效比例。作为如何使用这些指标进行推断的示例,我们评估了在酵母应激适应和针叶树气候适应的现有数据集中观察到的可重复性程度。这种方法提供了一种方法来测试关于不同类型的因素如何促进或限制适应性进化过程中观察到的遗传反应多样性的广泛假设。