Yan Jingkun, Yan Tianying, Ye Weixin, Lv Xin, Gao Pan, Xu Wei
College of Information Science and Technology, Shihezi University, Shihezi, China.
National-Local Joint Engineering Research Center for Agricultural Big Data, Xinjiang Production and Construction Group, Shihezi, China.
Front Plant Sci. 2023 Jan 31;14:1111175. doi: 10.3389/fpls.2023.1111175. eCollection 2023.
Plant leaf segmentation, especially leaf edge accurate recognition, is the data support for automatically measuring plant phenotypic parameters. However, adjusting the backbone in the current cutting-edge segmentation model for cotton leaf segmentation applications requires various trial and error costs (e.g., expert experience and computing costs). Thus, a simple and effective semantic segmentation architecture (our model) based on the composite backbone was proposed, considering the computational requirements of the mainstream Transformer backbone integrating attention mechanism. The composite backbone was composed of CoAtNet and Xception. CoAtNet integrated the attention mechanism of the Transformers into the convolution operation. The experimental results showed that our model outperformed the benchmark segmentation models PSPNet, DANet, CPNet, and DeepLab v3+ on the cotton leaf dataset, especially on the leaf edge segmentation (MIoU: 0.940, BIoU: 0.608). The composite backbone of our model integrated the convolution of the convolutional neural networks and the attention of the Transformers, which alleviated the computing power requirements of the Transformers under excellent performance. Our model reduces the trial and error cost of adjusting the segmentation model architecture for specific agricultural applications and provides a potential scheme for high-throughput phenotypic feature detection of plants.
植物叶片分割,尤其是叶片边缘的精确识别,是自动测量植物表型参数的数据支撑。然而,在当前用于棉花叶片分割应用的前沿分割模型中调整主干网络需要各种反复试验的成本(例如,专家经验和计算成本)。因此,考虑到集成注意力机制的主流Transformer主干网络的计算需求,提出了一种基于复合主干网络的简单有效的语义分割架构(我们的模型)。该复合主干网络由CoAtNet和Xception组成。CoAtNet将Transformer的注意力机制集成到卷积操作中。实验结果表明,在棉花叶片数据集上,我们的模型优于基准分割模型PSPNet、DANet、CPNet和DeepLab v3+,尤其是在叶片边缘分割方面(平均交并比:0.940,边界交并比:0.608)。我们模型的复合主干网络集成了卷积神经网络的卷积和Transformer的注意力,在性能优异的情况下减轻了对Transformer计算能力的要求。我们的模型降低了针对特定农业应用调整分割模型架构的反复试验成本,并为植物高通量表型特征检测提供了一种潜在方案。