US Department of Agriculture, Institute of Forest Genetics, Pacific Southwest Research Station, Davis, California 95616, USA.
Genetics. 2010 Jul;185(3):969-82. doi: 10.1534/genetics.110.115543. Epub 2010 May 3.
Natural populations of forest trees exhibit striking phenotypic adaptations to diverse environmental gradients, thereby making them appealing subjects for the study of genes underlying ecologically relevant phenotypes. Here, we use a genome-wide data set of single nucleotide polymorphisms genotyped across 3059 functional genes to study patterns of population structure and identify loci associated with aridity across the natural range of loblolly pine (Pinus taeda L.). Overall patterns of population structure, as inferred using principal components and Bayesian cluster analyses, were consistent with three genetic clusters likely resulting from expansions out of Pleistocene refugia located in Mexico and Florida. A novel application of association analysis, which removes the confounding effects of shared ancestry on correlations between genetic and environmental variation, identified five loci correlated with aridity. These loci were primarily involved with abiotic stress response to temperature and drought. A unique set of 24 loci was identified as F(ST) outliers on the basis of the genetic clusters identified previously and after accounting for expansions out of Pleistocene refugia. These loci were involved with a diversity of physiological processes. Identification of nonoverlapping sets of loci highlights the fundamental differences implicit in the use of either method and suggests a pluralistic, yet complementary, approach to the identification of genes underlying ecologically relevant phenotypes.
森林树木的自然种群表现出对不同环境梯度的惊人表型适应,因此成为研究与生态相关表型相关基因的理想对象。在这里,我们使用跨越 3059 个功能基因的单核苷酸多态性全基因组数据集来研究群体结构模式,并确定与火炬松(Pinus taeda L.)自然分布范围内干旱相关的位点。使用主成分和贝叶斯聚类分析推断的总体群体结构模式与三个遗传聚类一致,这些聚类可能是由位于墨西哥和佛罗里达州的更新世避难所扩张而来的。关联分析的一种新应用,该方法消除了共同祖先对遗传和环境变异之间相关性的混杂影响,鉴定出与干旱相关的 5 个位点。这些位点主要涉及对温度和干旱的非生物胁迫反应。基于先前确定的遗传聚类和考虑到更新世避难所的扩张,有一组独特的 24 个位点被确定为 F(ST)异常值。这些位点涉及多种生理过程。非重叠位点集的鉴定突出了使用这两种方法所隐含的基本差异,并表明了一种多元化但互补的方法来识别与生态相关表型相关的基因。