Department of Chemical Engineering, Pennsylvania State University, University Park, PA, USA.
Plant J. 2020 Jul;103(2):512-531. doi: 10.1111/tpj.14746. Epub 2020 Apr 18.
Genetic sources of phenotypic variation have been a focus of plant studies aimed at improving agricultural yield and understanding adaptive processes. Genome-wide association studies identify the genetic background behind a trait by examining associations between phenotypes and single-nucleotide polymorphisms (SNPs). Although such studies are common, biological interpretation of the results remains a challenge; especially due to the confounding nature of population structure and the systematic biases thus introduced. Here, we propose a complementary analysis (SNPeffect) that offers putative genotype-to-phenotype mechanistic interpretations by integrating biochemical knowledge encoded in metabolic models. SNPeffect is used to explain differential growth rate and metabolite accumulation in A. thaliana and P. trichocarpa accessions as the outcome of SNPs in enzyme-coding genes. To this end, we also constructed a genome-scale metabolic model for Populus trichocarpa, the first for a perennial woody tree. As expected, our results indicate that growth is a complex polygenic trait governed by carbon and energy partitioning. The predicted set of functional SNPs in both species are associated with experimentally characterized growth-determining genes and also suggest putative ones. Functional SNPs were found in pathways such as amino acid metabolism, nucleotide biosynthesis, and cellulose and lignin biosynthesis, in line with breeding strategies that target pathways governing carbon and energy partition.
遗传变异的表型来源一直是植物研究的重点,目的是提高农业产量和理解适应过程。全基因组关联研究通过检查表型和单核苷酸多态性(SNP)之间的关联来确定性状背后的遗传背景。尽管此类研究很常见,但对结果的生物学解释仍然是一个挑战;特别是由于群体结构的混杂性质及其引入的系统偏差。在这里,我们提出了一种补充分析(SNPeffect),通过整合代谢模型中编码的生化知识,提供潜在的基因型到表型的机制解释。SNPeffect 用于解释拟南芥和毛白杨品种中由于酶编码基因中的 SNP 而导致的生长速率和代谢物积累的差异。为此,我们还构建了杨树的基因组规模代谢模型,这是第一个用于多年生木本植物的模型。正如预期的那样,我们的结果表明,生长是一个由碳和能量分配控制的复杂多基因性状。在这两个物种中,预测的功能 SNP 与经过实验表征的决定生长的基因相关,也暗示了一些潜在的基因。在氨基酸代谢、核苷酸生物合成以及纤维素和木质素生物合成等途径中发现了功能 SNP,这与针对控制碳和能量分配的途径的育种策略一致。