Srivathsan M Sundara, Jenish S Alden, Arvindhan K, Karthik R
School of Electronics Engineering, Vellore Institute of Technology, Chennai, India.
Centre for Cyber Physical Systems, Vellore Institute of Technology, Chennai, India.
Sci Rep. 2025 Apr 6;15(1):11750. doi: 10.1038/s41598-025-95985-w.
Cassava is a tuberous edible plant native to the American tropics and is essential for its versatile applications including cassava flour, bread, tapioca, and laundry starch. Cassava leaf diseases reduce crop yields, elevate production costs, and disrupt market stability. This places significant burdens on farmers and economies while highlighting the need for effective management strategies. Traditional methods of manual disease diagnosis are costly, labor-intensive, and time-consuming. This research aims to address the challenge of accurate disease classification by overcoming the limitations of existing methods, which encounter difficulties with the complexity and variability of leaf disease symptoms. To the best of our knowledge, this is the first study to propose a novel dual-track feature aggregation architecture that integrates the Residual Inception Positional Encoding Attention (RIPEA) Network with EfficientNet for the classification of cassava leaf diseases. The proposed model employs a dual-track feature aggregation architecture which integrates the RIPEA Network with EfficientNet. The RIPEA track extracts significant features by leveraging residual connections for preserving gradients and uses multi-scale feature fusion for combining fine-grained details with broader patterns. It also incorporates Coordinate and Mixed Attention mechanisms which focus on cross-channel and long-range dependencies. The extracted features from both tracks are aggregated for classification. Furthermore, it incorporates an image augmentation method and a cosine decay learning rate schedule to improve model training. This improves the ability of the model to accurately differentiate between Cassava Bacterial Blight (CBB), Brown Streak Disease (CBSD), Green Mottle (CGM), Mosaic Disease (CMD), and healthy leaves, addressing both local textures and global structures. Additionally, to enhance the interpretability of the model, we apply Grad-CAM to provide visual explanations for the model's decision-making process, helping to understand which regions of the leaf images contribute to the classification results. The proposed network achieved a classification accuracy of 93.06%.
木薯是一种原产于美洲热带地区的块茎可食用植物,因其在多种应用中的广泛用途而至关重要,这些应用包括木薯粉、面包、木薯淀粉和洗衣淀粉。木薯叶病会降低作物产量、提高生产成本并扰乱市场稳定。这给农民和经济带来了巨大负担,同时凸显了有效管理策略的必要性。传统的人工疾病诊断方法成本高昂、劳动强度大且耗时。本研究旨在通过克服现有方法的局限性来应对准确疾病分类的挑战,现有方法在叶病症状的复杂性和变异性方面存在困难。据我们所知,这是第一项提出一种新颖的双轨特征聚合架构的研究,该架构将残差 inception 位置编码注意力(RIPEA)网络与 EfficientNet 集成用于木薯叶病的分类。所提出的模型采用了一种双轨特征聚合架构,该架构将 RIPEA 网络与 EfficientNet 集成。RIPEA 轨道通过利用残差连接来保留梯度来提取显著特征,并使用多尺度特征融合将细粒度细节与更广泛的模式相结合。它还结合了坐标和混合注意力机制,这些机制专注于跨通道和长程依赖性。从两个轨道提取的特征被聚合用于分类。此外,它还结合了一种图像增强方法和余弦衰减学习率调度来改进模型训练。这提高了模型准确区分木薯细菌性枯萎病(CBB)、褐色条纹病(CBSD)、绿色斑驳病(CGM)、花叶病(CMD)和健康叶片的能力,同时兼顾了局部纹理和全局结构。此外,为了提高模型的可解释性,我们应用 Grad-CAM 为模型的决策过程提供可视化解释,有助于理解叶图像的哪些区域对分类结果有贡献。所提出的网络实现了 93.06%的分类准确率。