Plant Breeding & Genetics Section, School of Integrative Plant Science, Cornell University, Ithaca, NY 14853
CSIC, Institute for Sustainable Agriculture, Córdoba, Spain.
G3 (Bethesda). 2019 Sep 4;9(9):2963-2975. doi: 10.1534/g3.119.400228.
Oat ( L.) has a high concentration of oils, comprised primarily of healthful unsaturated oleic and linoleic fatty acids. To accelerate oat plant breeding efforts, we sought to identify loci associated with variation in fatty acid composition, defined as the types and quantities of fatty acids. We genotyped a panel of 500 oat cultivars with genotyping-by-sequencing and measured the concentrations of ten fatty acids in these oat cultivars grown in two environments. Measurements of individual fatty acids were highly correlated across samples, consistent with fatty acids participating in shared biosynthetic pathways. We leveraged these phenotypic correlations in two multivariate genome-wide association study (GWAS) approaches. In the first analysis, we fitted a multivariate linear mixed model for all ten fatty acids simultaneously while accounting for population structure and relatedness among cultivars. In the second, we performed a univariate association test for each principal component (PC) derived from a singular value decomposition of the phenotypic data matrix. To aid interpretation of results from the multivariate analyses, we also conducted univariate association tests for each trait. The multivariate mixed model approach yielded 148 genome-wide significant single-nucleotide polymorphisms (SNPs) at a 10% false-discovery rate, compared to 129 and 73 significant SNPs in the PC and univariate analyses, respectively. Thus, explicit modeling of the correlation structure between fatty acids in a multivariate framework enabled identification of loci associated with variation in seed fatty acid concentration that were not detected in the univariate analyses. Ultimately, a detailed characterization of the loci underlying fatty acid variation can be used to enhance the nutritional profile of oats through breeding.
燕麦(L.)含有高浓度的油,主要由健康的不饱和油酸和亚油酸组成。为了加速燕麦植物的选育工作,我们试图鉴定与脂肪酸组成变化相关的基因座,脂肪酸组成变化定义为脂肪酸的类型和数量。我们使用测序的基因分型方法对 500 个燕麦品种进行了基因分型,并测量了这些在两个环境中生长的燕麦品种中十种脂肪酸的浓度。个体脂肪酸的测量值在样本之间高度相关,这与脂肪酸参与共享生物合成途径一致。我们利用这两个多维全基因组关联研究(GWAS)方法中的表型相关性。在第一个分析中,我们同时拟合了一个包含十种脂肪酸的多元线性混合模型,同时考虑了群体结构和品种间的相关性。在第二个分析中,我们对来自表型数据矩阵奇异值分解的每个主成分(PC)进行了单变量关联测试。为了帮助解释多元分析的结果,我们还对每个性状进行了单变量关联测试。多元混合模型方法在 10%的假发现率下产生了 148 个全基因组显著的单核苷酸多态性(SNP),而在 PC 和单变量分析中,分别有 129 个和 73 个显著 SNP。因此,在多元框架中明确建模脂肪酸之间的相关性结构,能够鉴定出与种子脂肪酸浓度变化相关的基因座,而这些基因座在单变量分析中未被检测到。最终,对脂肪酸变化的基因座进行详细描述可以通过选育来提高燕麦的营养特性。