Department of Plant Biology, Michigan State University, East Lansing, Michigan 48824.
The DOE Great Lakes Bioenergy Research Center, Michigan State University, East Lansing, Michigan, 48824.
Plant Cell. 2020 Jan;32(1):139-151. doi: 10.1105/tpc.19.00332. Epub 2019 Oct 22.
The ability to predict traits from genome-wide sequence information (i.e., genomic prediction) has improved our understanding of the genetic basis of complex traits and transformed breeding practices. Transcriptome data may also be useful for genomic prediction. However, it remains unclear how well transcript levels can predict traits, particularly when traits are scored at different development stages. Using maize () genetic markers and transcript levels from seedlings to predict mature plant traits, we found that transcript and genetic marker models have similar performance. When the transcripts and genetic markers with the greatest weights (i.e., the most important) in those models were used in one joint model, performance increased. Furthermore, genetic markers important for predictions were not close to or identified as regulatory variants for important transcripts. These findings demonstrate that transcript levels are useful for predicting traits and that their predictive power is not simply due to genetic variation in the transcribed genomic regions. Finally, genetic marker models identified only 1 of 14 benchmark flowering-time genes, while transcript models identified 5. These data highlight that, in addition to being useful for genomic prediction, transcriptome data can provide a link between traits and variation that cannot be readily captured at the sequence level.
从全基因组序列信息中预测性状的能力(即基因组预测)提高了我们对复杂性状遗传基础的理解,并改变了育种实践。转录组数据也可能对基因组预测有用。然而,目前尚不清楚转录水平在多大程度上可以预测性状,特别是当性状在不同的发育阶段进行评分时。利用玉米()遗传标记和幼苗的转录水平来预测成熟植物的性状,我们发现转录组和遗传标记模型具有相似的性能。当这些模型中权重最大(即最重要)的转录本和遗传标记被用于一个联合模型中时,性能得到了提高。此外,对预测重要的遗传标记并不接近或被鉴定为重要转录本的调控变体。这些发现表明转录本水平可用于预测性状,并且其预测能力并非仅仅归因于转录基因组区域的遗传变异。最后,遗传标记模型仅鉴定出 14 个基准开花时间基因中的 1 个,而转录本模型鉴定出 5 个。这些数据突出表明,转录组数据除了可用于基因组预测外,还可以提供性状与变异之间的联系,而这种联系在序列水平上难以捕捉。