Department of Plant and Environmental Science, University of Copenhagen, Højbakkegårds Alle 13, Tåstrup, 2630, Denmark.
Department of Computer Science, University of Copenhagen, Universitetsparken 1, 2100, Copenhagen, Denmark.
New Phytol. 2022 Oct;236(2):774-791. doi: 10.1111/nph.18387. Epub 2022 Aug 10.
Convolutional neural networks (CNNs) are a powerful tool for plant image analysis, but challenges remain in making them more accessible to researchers without a machine-learning background. We present RootPainter, an open-source graphical user interface based software tool for the rapid training of deep neural networks for use in biological image analysis. We evaluate RootPainter by training models for root length extraction from chicory (Cichorium intybus L.) roots in soil, biopore counting, and root nodule counting. We also compare dense annotations with corrective ones that are added during the training process based on the weaknesses of the current model. Five out of six times the models trained using RootPainter with corrective annotations created within 2 h produced measurements strongly correlating with manual measurements. Model accuracy had a significant correlation with annotation duration, indicating further improvements could be obtained with extended annotation. Our results show that a deep-learning model can be trained to a high accuracy for the three respective datasets of varying target objects, background, and image quality with < 2 h of annotation time. They indicate that, when using RootPainter, for many datasets it is possible to annotate, train, and complete data processing within 1 d.
卷积神经网络(CNNs)是植物图像分析的强大工具,但对于没有机器学习背景的研究人员来说,仍然存在使其更易于使用的挑战。我们提出了 RootPainter,这是一个基于图形用户界面的开源软件工具,用于快速训练深度神经网络,用于生物图像分析。我们通过在土壤中从菊苣(Cichorium intybus L.)根中提取根长、生物孔计数和根瘤计数来评估 RootPainter。我们还比较了密集注释和在训练过程中根据当前模型的弱点添加的纠正注释。在使用 RootPainter 进行训练的模型中,有 5 次中有 6 次使用在 2 小时内创建的纠正注释,其测量结果与手动测量结果高度相关。模型准确性与注释持续时间有显著相关性,表明通过扩展注释可以进一步提高准确性。我们的结果表明,可以使用少于 2 小时的注释时间,对具有不同目标对象、背景和图像质量的三个相应数据集进行高度准确的深度学习模型训练。它们表明,在使用 RootPainter 时,对于许多数据集,可以在 1 天内完成注释、训练和数据处理。