Biometrics and Statistics Unit, International Maize and Wheat Improvement Center (CIMMYT), Apdo. Postal 6-641, 06600 Mexico, D.F., Mexico.
Curr Genomics. 2012 May;13(3):225-44. doi: 10.2174/138920212800543066.
Historically in plant breeding a large number of statistical models has been developed and used for studying genotype × environment interaction. These models have helped plant breeders to assess the stability of economically important traits and to predict the performance of newly developed genotypes evaluated under varying environmental conditions. In the last decade, the use of relatively low numbers of markers has facilitated the mapping of chromosome regions associated with phenotypic variability (e.g., QTL mapping) and, to a lesser extent, revealed the differetial response of these chromosome regions across environments (i.e., QTL × environment interaction). QTL technology has been useful for marker-assisted selection of simple traits; however, it has not been efficient for predicting complex traits affected by a large number of loci. Recently the appearance of cheap, abundant markers has made it possible to saturate the genome with high density markers and use marker information to predict genomic breeding values, thus increasing the precision of genetic value prediction over that achieved with the traditional use of pedigree information. Genomic data also allow assessing chromosome regions through marker effects and studying the pattern of covariablity of marker effects across differential environmental conditions. In this review, we outline the most important models for assessing genotype × environment interaction, QTL × environment interaction, and marker effect (gene) × environment interaction. Since analyzing genetic and genomic data is one of the most challenging statistical problems researchers currently face, different models from different areas of statistical research must be attempted in order to make significant progress in understanding genetic effects and their interaction with environment.
从历史上看,植物育种中已经开发和使用了大量的统计模型,用于研究基因型与环境互作。这些模型帮助植物育种者评估经济重要性状的稳定性,并预测在不同环境条件下评估的新开发基因型的表现。在过去十年中,相对较少数量的标记的使用促进了与表型变异相关的染色体区域的图谱绘制(例如,QTL 图谱),并且在较小程度上揭示了这些染色体区域在不同环境下的不同响应(即,QTL×环境互作)。QTL 技术对于简单性状的标记辅助选择很有用;然而,对于预测受大量基因座影响的复杂性状,它并不有效。最近,廉价、丰富的标记的出现使得用高密度标记饱和基因组并利用标记信息预测基因组育种值成为可能,从而提高了遗传值预测的精度,超过了传统使用系谱信息的预测精度。基因组数据还允许通过标记效应评估染色体区域,并研究标记效应在不同环境条件下的协变模式。在这篇综述中,我们概述了评估基因型与环境互作、QTL 与环境互作和标记效应(基因)与环境互作的最重要模型。由于分析遗传和基因组数据是研究人员目前面临的最具挑战性的统计问题之一,因此必须尝试来自统计研究不同领域的不同模型,才能在理解遗传效应及其与环境的相互作用方面取得重大进展。