Storfer Andrew, Patton Austin, Fraik Alexandra K
School of Biological Sciences, Washington State University, Pullman, WA, United States.
Front Genet. 2018 Mar 13;9:68. doi: 10.3389/fgene.2018.00068. eCollection 2018.
As next-generation sequencing data become increasingly available for non-model organisms, a shift has occurred in the focus of studies of the geographic distribution of genetic variation. Whereas landscape genetics studies primarily focus on testing the effects of landscape variables on gene flow and genetic population structure, landscape genomics studies focus on detecting candidate genes under selection that indicate possible local adaptation. Navigating the transition between landscape genomics and landscape genetics can be challenging. The number of molecular markers analyzed has shifted from what used to be a few dozen loci to thousands of loci and even full genomes. Although genome scale data can be separated into sets of neutral loci for analyses of gene flow and population structure and putative loci under selection for inference of local adaptation, there are inherent differences in the questions that are addressed in the two study frameworks. We discuss these differences and their implications for study design, marker choice and downstream analysis methods. Similar to the rapid proliferation of analysis methods in the early development of landscape genetics, new analytical methods for detection of selection in landscape genomics studies are burgeoning. We focus on genome scan methods for detection of selection, and in particular, outlier differentiation methods and genetic-environment association tests because they are the most widely used. Use of genome scan methods requires an understanding of the potential mismatches between the biology of a species and assumptions inherent in analytical methods used, which can lead to high false positive rates of detected loci under selection. Key to choosing appropriate genome scan methods is an understanding of the underlying demographic structure of study populations, and such data can be obtained using neutral loci from the generated genome-wide data or prior knowledge of a species' phylogeographic history. To this end, we summarize recent simulation studies that test the power and accuracy of genome scan methods under a variety of demographic scenarios and sampling designs. We conclude with a discussion of additional considerations for future method development, and a summary of methods that show promise for landscape genomics studies but are not yet widely used.
随着非模式生物的下一代测序数据越来越容易获得,遗传变异地理分布研究的重点发生了转变。景观遗传学研究主要侧重于测试景观变量对基因流和遗传种群结构的影响,而景观基因组学研究则侧重于检测受选择的候选基因,这些基因表明可能存在局部适应性。在景观基因组学和景观遗传学之间进行转换可能具有挑战性。分析的分子标记数量已从过去的几十个位点转变为数千个位点甚至全基因组。虽然基因组规模的数据可以分为中性位点集,用于分析基因流和种群结构,以及用于推断局部适应性的假定选择位点,但这两个研究框架所解决的问题存在内在差异。我们讨论了这些差异及其对研究设计、标记选择和下游分析方法的影响。与景观遗传学早期发展中分析方法的迅速增加类似,景观基因组学研究中用于检测选择的新分析方法正在迅速涌现。我们重点关注用于检测选择的基因组扫描方法,特别是离群值分化方法和遗传-环境关联测试,因为它们是使用最广泛的。使用基因组扫描方法需要了解物种生物学与所用分析方法固有假设之间可能存在的不匹配,这可能导致检测到的选择位点出现高假阳性率。选择合适的基因组扫描方法的关键是了解研究种群的潜在种群结构,并且可以使用从生成的全基因组数据中获取的中性位点或物种系统发育地理历史的先验知识来获得此类数据。为此,我们总结了最近的模拟研究,这些研究在各种种群结构情景和抽样设计下测试了基因组扫描方法的效能和准确性。我们最后讨论了未来方法开发的其他考虑因素,并总结了对景观基因组学研究有前景但尚未广泛使用的方法。