Lin Yu-Lan, Wu Dong-Hong, Wu Cheng-Chieh, Huang Yung-Fen
Department of Agronomy, National Taiwan University, No. 1, Sec. 4, Roosevelt Rd, Da'an Dist., Taipei, 10617, Taiwan.
Taiwan Agricultural Research Institute, Council of Agriculture, Executive Yuan, No. 189, Zhongzheng Rd, Wufeng Dist, Taichung City, 41362, Taiwan.
Bot Stud. 2021 Jan 12;62(1):2. doi: 10.1186/s40529-020-00309-y.
Weedy rice, a conspecific weedy counterpart of the cultivated rice (Oryza sativa L.), has been problematic in rice-production area worldwide. Although we started to know about the origin of some weedy traits for some rice-growing regions, an overall assessment of weedy trait-related loci was not yet available. On the other hand, the advances in sequencing technologies, together with community efforts, have made publicly available a large amount of genomic data. Given the availability of public data and the need of "weedy" allele mining for a better management of weedy rice, the objective of the present study was to explore the genetic architecture of weedy traits based on publicly available data, mainly from the 3000 Rice Genome Project (3K-RGP).
Based on the results of population structure analysis, we have selected 1378 individuals from four sub-populations (aus, indica, temperate japonica, tropical japonica) without admixed genomic composition for genome-wide association analysis (GWAS). Five traits were investigated: awn color, seed shattering, seed threshability, seed coat color, and seedling height. GWAS was conducted for each sub-population × trait combination and we have identified 66 population-specific trait-associated SNPs. Eleven significant SNPs fell into an annotated gene and four other SNPs were close to a putative candidate gene (± 25 kb). SNPs located in or close to Rc were particularly predictive of the occurrence of seed coat color and our results showed that different sub-populations required different SNPs for a better seed coat color prediction. We compared the data of 3K-RGP to a publicly available weedy rice dataset. The profile of allele frequency, phenotype-genotype segregation of target SNP, as well as GWAS results for the presence and absence of awns diverged between the two sets of data.
The genotype of trait-associated SNPs identified in this study, especially those located in or close to Rc, can be developed to diagnostic SNPs to trace the origin of weedy trait occurred in the field. The difference of results from the two publicly available datasets used in this study emphasized the importance of laboratory experiments to confirm the allele mining results based on publicly available data.
杂草稻是栽培稻(Oryza sativa L.)的同物种杂草对应物,在全球水稻种植区一直是个问题。尽管我们已经开始了解一些水稻种植区域某些杂草性状的起源,但尚未对与杂草性状相关的基因座进行全面评估。另一方面,测序技术的进步以及各方的努力,使得大量基因组数据得以公开。鉴于公开数据的可用性以及为更好地管理杂草稻而进行“杂草”等位基因挖掘的需求,本研究的目的是基于主要来自3000份水稻基因组计划(3K-RGP)的公开数据,探索杂草性状的遗传结构。
基于群体结构分析的结果,我们从四个亚群体(籼稻、粳稻、温带粳稻、热带粳稻)中选择了1378个没有混合基因组组成的个体进行全基因组关联分析(GWAS)。研究了五个性状:芒颜色、种子散落性、种子脱粒性、种皮颜色和幼苗高度。对每个亚群体×性状组合进行了GWAS分析,我们鉴定出66个群体特异性性状相关的单核苷酸多态性(SNP)。11个显著的SNP位于一个注释基因中,另外4个SNP靠近一个假定的候选基因(±25 kb)。位于Rc基因内或其附近的SNP对种皮颜色的出现具有特别的预测性,我们的结果表明,不同的亚群体需要不同的SNP才能更好地预测种皮颜色。我们将3K-RGP的数据与一个公开可用的杂草稻数据集进行了比较。两组数据之间在等位基因频率分布、目标SNP的表型-基因型分离以及有无芒的GWAS结果方面存在差异。
本研究中鉴定出的与性状相关的SNP的基因型,特别是那些位于Rc基因内或其附近的SNP,可以开发成诊断性SNP,以追踪田间出现的杂草性状的起源。本研究中使用的两个公开可用数据集结果的差异强调了实验室实验对于基于公开数据确认等位基因挖掘结果的重要性。