Ghysels Sarah, De Baets Bernard, Reheul Dirk, Maenhout Steven
Department of Plants and Crops, Faculty of Bioscience Engineering, Ghent University, Ghent, Belgium.
Department of Data Analysis and Mathematical Modelling, Faculty of Bioscience Engineering, Ghent University, Ghent, Belgium.
Front Plant Sci. 2025 Mar 12;16:1549099. doi: 10.3389/fpls.2025.1549099. eCollection 2025.
In the early stages of selection, many plant breeding programmes still rely on visual evaluations of traits by experienced breeders. While this approach has proven to be effective, it requires considerable time, labour and expertise. Moreover, its subjective nature makes it difficult to reproduce and compare evaluations. The field of automated high-throughput phenotyping aims to resolve these issues. A widely adopted strategy uses drone images processed by machine learning algorithms to characterise phenotypes. This approach was used in the present study to assess the dry matter yield of tall fescue and its accuracy was compared to that of the breeder's evaluations, using field measurements as ground truth. RGB images of tall fescue individuals were processed by two types of predictive models: a random forest and convolutional neural network. In addition to computing dry matter yield, the two methods were applied to identify the top 10% highest-yielding plants and predict the breeder's score. The convolutional neural network outperformed the random forest method and exceeded the predictive power of the breeder's eye. It predicted dry matter yield with an R² of 0.62, which surpassed the accuracy of the breeder's score by 8 percentage points. Additionally, the algorithm demonstrated strong performance in identifying top-performing plants and estimating the breeder's score, achieving balanced accuracies of 0.81 and 0.74, respectively. These findings indicate that the tested automated phenotyping approach could not only offer improvements in cost, time efficiency and objectivity, but also enhance selection accuracy. As a result, this technique has the potential to increase overall breeding efficiency, accelerate genetic progress, and shorten the time to market. To conclude, phenotyping by means of RGB-based machine learning models provides a reliable alternative or addition to the visual evaluation of selection candidates in a tall fescue breeding programme.
在选育的早期阶段,许多植物育种计划仍依赖经验丰富的育种者对性状进行视觉评估。虽然这种方法已被证明是有效的,但它需要大量的时间、劳动力和专业知识。此外,其主观性使得评估难以复制和比较。自动化高通量表型分析领域旨在解决这些问题。一种广泛采用的策略是使用通过机器学习算法处理的无人机图像来表征表型。本研究采用这种方法评估高羊茅的干物质产量,并将其准确性与育种者的评估结果进行比较,以田间测量作为地面真值。高羊茅个体的RGB图像由两种预测模型处理:随机森林和卷积神经网络。除了计算干物质产量外,这两种方法还被用于识别产量最高的前10%的植株并预测育种者的评分。卷积神经网络的表现优于随机森林方法,并且超过了育种者肉眼的预测能力。它预测干物质产量的R²为0.62,比育种者评分的准确性高出8个百分点。此外,该算法在识别表现最佳的植株和估计育种者评分方面表现出色,平衡准确率分别达到0.81和0.74。这些发现表明,所测试的自动化表型分析方法不仅可以在成本、时间效率和客观性方面有所改进,还可以提高选择准确性。因此,这项技术有可能提高整体育种效率,加速遗传进展,并缩短上市时间。总之,通过基于RGB的机器学习模型进行表型分析为高羊茅育种计划中对候选选择植株的视觉评估提供了一种可靠的替代方法或补充。