WSL Swiss Federal Research Institute Zürcherstrasse 111, CH-8903, Birmensdorf, Switzerland ; Department of Environmental Systems Science, ETH Zürich Universitätsstrasse 16, CH-8092, Zürich, Switzerland.
WSL Swiss Federal Research Institute Zürcherstrasse 111, CH-8903, Birmensdorf, Switzerland.
Ecol Evol. 2014 Nov;4(22):4296-306. doi: 10.1002/ece3.1300. Epub 2014 Oct 24.
Outlier detection and environmental association analysis are common methods to search for loci or genomic regions exhibiting signals of adaptation to environmental factors. However, a validation of outlier loci and corresponding allele distribution models through functional molecular biology or transplant/common garden experiments is rarely carried out. Here, we employ another method for validation, namely testing outlier loci in specifically designed, independent data sets. Previously, an outlier locus associated with three different habitat types had been detected in Arabis alpina. For the independent validation data set, we sampled 30 populations occurring in these three habitat types across five biogeographic regions of the Swiss Alps. The allele distribution model found in the original study could not be validated in the independent test data set: The outlier locus was no longer indicative of habitat-mediated selection. We propose several potential causes of this failure of validation, of which unaccounted genetic structure and technical issues in the original data set used to detect the outlier locus were most probable. Thus, our study shows that validating outlier loci and allele distribution models in independent data sets is a helpful tool in ecological genomics which, in the case of positive validation, adds confidence to outlier loci and their association with environmental factors or, in the case of failure of validation, helps to explain inconsistencies.
异常值检测和环境关联分析是寻找表现出对环境因素适应信号的基因座或基因组区域的常用方法。然而,很少通过功能分子生物学或移植/共同花园实验来验证异常值基因座和相应的等位基因分布模型。在这里,我们采用另一种验证方法,即在专门设计的独立数据集上测试异常值基因座。此前,在拟南芥中已经检测到一个与三种不同生境类型相关的异常值基因座。对于独立验证数据集,我们在瑞士阿尔卑斯山五个生物地理区域的这三种生境类型中共采样了 30 个种群。在原始研究中发现的等位基因分布模型在独立测试数据集上无法得到验证:异常值基因座不再指示生境介导的选择。我们提出了这种验证失败的几个潜在原因,其中最有可能的是原始数据集在检测异常值基因座时未考虑遗传结构和技术问题。因此,我们的研究表明,在独立数据集中验证异常值基因座和等位基因分布模型是生态基因组学中的一个有用工具,在验证成功的情况下,它增加了对异常值基因座及其与环境因素的关联的置信度,而在验证失败的情况下,它有助于解释不一致性。