Leibniz Institute of Plant Genetics and Crop Plant Research (IPK), Gatersleben, Germany.
Theor Appl Genet. 2010 Jun;121(1):157-68. doi: 10.1007/s00122-010-1299-4. Epub 2010 Mar 3.
Plant genetic diversity has been mainly investigated with neutral markers, but large-scale DNA sequencing projects now enable the identification and analysis of different classes of genetic polymorphisms, such as non-synonymous single nucleotide polymorphisms (nsSNPs) in protein coding sequences. Using the SIFT and MAPP programs to predict whether nsSNPs are tolerated (i.e., effectively neutral) or deleterious for protein function, genome-wide nsSNP data from Arabidopsis thaliana and rice were analyzed. In both species, about 20% of polymorphic sites with nsSNPs were classified as deleterious; they segregate at lower allele frequencies than tolerated nsSNPs due to purifying selection. Furthermore, A. thaliana accessions from marginal populations show a higher relative proportion of deleterious nsSNPs, which likely reflects differential selection or demographic effects in subpopulations. To evaluate the sensitivity of predictions, genes from model and crop plants with known functional effects of nsSNPs were inferred with the algorithms. The programs predicted about 70% of nsSNPs correctly as tolerated or deleterious, i.e., as having a functional effect. Forward-in-time simulations of bottleneck and domestication models indicated a high power to detect demographic effects on nsSNP frequencies in sufficiently large datasets. The results indicate that nsSNPs are useful markers for analyzing genetic diversity in plant genetic resources and breeding populations to infer natural/artificial selection and genetic drift.
植物遗传多样性主要通过中性标记进行研究,但大规模 DNA 测序项目现在使不同类别的遗传多态性的识别和分析成为可能,如蛋白质编码序列中的非同义单核苷酸多态性(nsSNP)。使用 SIFT 和 MAPP 程序来预测 nsSNP 是否耐受(即有效中性)或对蛋白质功能具有破坏性,对拟南芥和水稻的全基因组 nsSNP 数据进行了分析。在这两个物种中,约 20%的多态性位点带有 nsSNP 被归类为有害;由于纯化选择,它们的等位基因频率低于耐受的 nsSNP 分离。此外,来自边缘种群的拟南芥品系显示出更高比例的有害 nsSNP,这可能反映了亚群中的差异选择或人口统计学效应。为了评估预测的敏感性,使用算法从具有已知 nsSNP 功能效应的模型和作物植物中推断出基因。这些程序正确预测了约 70%的 nsSNP 为耐受或有害,即具有功能效应。瓶颈和驯化模型的正向模拟表明,在足够大的数据集上检测 nsSNP 频率的人口统计学效应的能力很高。结果表明,nsSNP 是分析植物遗传资源和育种群体遗传多样性的有用标记,可推断自然/人工选择和遗传漂变。