Corcoran Evangeline, Siles Laura, Kurup Smita, Ahnert Sebastian
Environment and Sustainability Theme, AI for Science and Government Programme, The Alan Turing Institute, London, United Kingdom.
Department of Plant Sciences for the Bioeconomy, Rothamsted Research, Harpenden, United Kingdom.
Front Plant Sci. 2023 Feb 24;14:1120182. doi: 10.3389/fpls.2023.1120182. eCollection 2023.
Plant image datasets have the potential to greatly improve our understanding of the phenotypic response of plants to environmental and genetic factors. However, manual data extraction from such datasets are known to be time-consuming and resource intensive. Therefore, the development of efficient and reliable machine learning methods for extracting phenotype data from plant imagery is crucial.
In this paper, a current gold standard computed vision method for detecting and segmenting objects in three-dimensional imagery (StartDist-3D) is applied to X-ray micro-computed tomography scans of oilseed rape () mature pods.
With a relatively minimal training effort, this fine-tuned StarDist-3D model accurately detected (Validation F1-score = 96.3%,Testing F1-score = 99.3%) and predicted the shape (mean matched score = 90%) of seeds.
This method then allowed rapid extraction of data on the number, size, shape, seed spacing and seed location in specific valves that can be integrated into models of plant development or crop yield. Additionally, the fine-tuned StarDist-3D provides an efficient way to create a dataset of segmented images of individual seeds that could be used to further explore the factors affecting seed development, abortion and maturation synchrony within the pod. There is also potential for the fine-tuned Stardist-3D method to be applied to imagery of seeds from other plant species, as well as imagery of similarly shaped plant structures such as beans or wheat grains, provided the structures targeted for detection and segmentation can be described as star-convex polygons.
植物图像数据集有潜力极大地增进我们对植物对环境和遗传因素的表型反应的理解。然而,从这类数据集中手动提取数据既耗时又耗费资源。因此,开发高效且可靠的机器学习方法以从植物图像中提取表型数据至关重要。
本文将一种用于在三维图像中检测和分割物体的当前金标准计算机视觉方法(StartDist-3D)应用于油菜成熟豆荚的X射线显微计算机断层扫描。
经过相对较少的训练,这个微调后的StarDist-3D模型准确地检测出种子(验证F1分数 = 96.3%,测试F1分数 = 99.3%)并预测了种子的形状(平均匹配分数 = 90%)。
该方法随后能够快速提取关于特定荚室中种子数量、大小、形状、种子间距和种子位置的数据,这些数据可整合到植物发育或作物产量模型中。此外,微调后的StarDist-3D提供了一种有效的方法来创建单个种子的分割图像数据集,可用于进一步探索影响种子发育、败育和荚室内成熟同步性的因素。如果目标检测和分割的结构可以描述为星凸多边形,微调后的Stardist-3D方法还有可能应用于其他植物物种种子的图像,以及豆类或麦粒等形状相似的植物结构的图像。