Department of Genetics and Plant Breeding, Ch. Charan Singh University, Meerut, UP, India.
State Level Biotechnology Centre, Mahatma Phule Agricultural University, Rahuri, MS, India.
Adv Genet. 2019;104:75-154. doi: 10.1016/bs.adgen.2018.12.001. Epub 2019 Jan 22.
With the availability of DNA-based molecular markers during early 1980s and that of sophisticated statistical tools in late 1980s and later, it became possible to identify genomic regions that control a quantitative trait. The two methods used for this purpose included quantitative trait loci (QTL) interval mapping and genome-wide association mapping/studies (GWAS). Both these methods have their own merits and demerits, so that newer approaches were developed in order to deal with the demerits. We have now entered a post-GWAS era, where either the original data on individual genotypes are being used again keeping in view the results of GWAS or else summary statistics obtained through GWAS is subjected to further analysis. The first half of this review briefly deals with the approaches that were used for GWAS, the GWAS results obtained in some major crops (maize, wheat, rice, sorghum and soybean), their utilization for crop improvement and the improvements made to address the limitations of original GWA studies (computational demand, multiple testing and false discovery, rare marker alleles, etc.). These improvements included the development of multi-locus and multi-trait analysis, joint linkage association mapping, etc. Since originally GWA studies were used for mere identification of marker-trait association for marker-assisted selection, the second half of the review is devoted to activities in post-GWAS era, which include different methods that are being used for identification of causal variants and their prioritization (meta-analysis, pathway-based analysis, methylation QTL), functional characterization of candidate signals, gene- and gene-set based association mapping, GWAS using high dimensional data through machine learning, etc. The last section deals with popular resources available for GWAS in plants in the post-GWAS era and the implications of the results of post-GWAS for crop improvement.
自 20 世纪 80 年代初 DNA 分子标记的出现和 80 年代末及以后复杂统计工具的出现,人们有可能识别控制数量性状的基因组区域。为此目的使用的两种方法包括数量性状位点(QTL)区间作图和全基因组关联作图/研究(GWAS)。这两种方法都有其自身的优点和缺点,因此为了克服缺点,开发了更新的方法。我们现在已经进入了后 GWAS 时代,要么根据 GWAS 的结果再次使用个体基因型的原始数据,要么对通过 GWAS 获得的汇总统计数据进行进一步分析。本文的前半部分简要介绍了用于 GWAS 的方法、在一些主要作物(玉米、小麦、水稻、高粱和大豆)中获得的 GWAS 结果、它们在作物改良中的利用以及为解决原始 GWA 研究的局限性而进行的改进(计算需求、多重测试和错误发现、稀有标记等位基因等)。这些改进包括多基因座和多性状分析、联合连锁关联作图等的发展。由于最初的 GWA 研究仅用于识别标记辅助选择的标记 - 性状关联,因此本文的后半部分致力于后 GWAS 时代的活动,包括用于鉴定因果变异及其优先级的不同方法(荟萃分析、基于途径的分析、甲基化 QTL)、候选信号的功能特征、基于候选信号的基因和基因集关联作图、通过机器学习使用高维数据进行 GWAS 等。最后一部分讨论了后 GWAS 时代植物中用于 GWAS 的流行资源以及后 GWAS 结果对作物改良的影响。