Edlich-Muth Christian, Muraya Moses M, Altmann Thomas, Selbig Joachim
Bioinformatics Group, Institute for Biochemistry and Biology, University of Potsdam, 14476, Germany; Max Planck Institute of Molecular Plant Physiology, Potsdam 14476, Germany.
Department of Molecular Genetics, Leibniz Institute of Plant Genetics and Crop Plant Research (IPK) Gatersleben, Stadt Seeland, Germany.
Biosystems. 2016 Aug;146:102-9. doi: 10.1016/j.biosystems.2016.05.008. Epub 2016 May 19.
Phenomic experiments are carried out in large-scale plant phenotyping facilities that acquire a large number of pictures of hundreds of plants simultaneously. With the aid of automated image processing, the data are converted into genotype-feature matrices that cover many consecutive days of development. Here, we explore the possibility of predicting the biomass of the fully grown plant from early developmental stage image-derived features. We performed phenomic experiments on 195 inbred and 382 hybrid maizes varieties and followed their progress from 16 days after sowing (DAS) to 48 DAS with 129 image-derived features. By applying sparse regression methods, we show that 73% of the variance in hybrid fresh weight of fully-grown plants is explained by about 20 features at the three-leaf-stage or earlier. Dry weight prediction explained over 90% of the variance. When phenomic features of parental inbred lines were used as predictors of hybrid biomass, the proportion of variance explained was 42 and 45%, for fresh weight and dry weight models consisting of 35 and 36 features, respectively. These models were very robust, showing only a small amount of variation in performance over the time scale of the experiment. We also examined mid-parent heterosis in phenomic features. Feature heterosis displayed a large degree of variance which resulted in prediction performance that was less robust than models of either parental or hybrid predictors. Our results show that phenomic prediction is a viable alternative to genomic and metabolic prediction of hybrid performance. In particular, the utility of early-stage parental lines is very encouraging.
表型组学实验在大规模植物表型分析设施中进行,这些设施能同时获取数百株植物的大量图像。借助自动化图像处理技术,数据被转换为涵盖连续多日发育情况的基因型-特征矩阵。在此,我们探讨了根据发育早期阶段图像衍生特征预测成熟植株生物量的可能性。我们对195个自交系和382个杂交玉米品种进行了表型组学实验,并从播种后16天(DAS)到48 DAS跟踪它们的生长进程,共使用了129个图像衍生特征。通过应用稀疏回归方法,我们发现成熟杂交植株鲜重变异的73%可由三叶期或更早阶段的约20个特征来解释。干重预测解释了超过90%的变异。当使用亲本自交系的表型组学特征作为杂交生物量的预测指标时,对于分别由35个和36个特征组成的鲜重和干重模型,所解释的变异比例分别为42%和45%。这些模型非常稳健,在实验时间尺度上性能仅有少量变化。我们还研究了表型组学特征中的中亲杂种优势。特征杂种优势表现出很大程度的变异,导致预测性能不如亲本或杂交预测指标的模型稳健。我们的结果表明,表型组学预测是杂交性能基因组和代谢预测的可行替代方法。特别是,早期亲本系的效用非常令人鼓舞。