Shandilya Gunjan, Gupta Sheifali, Mohamed Heba G, Bharany Salil, Rehman Ateeq Ur, Hussen Seada
Chitkara University Institute of Engineering and Technology Chitkara University Punjab India.
Department of Electrical Engineering, College of Engineering Princess Nourah Bint Abdulrahman University Riyadh Saudi Arabia.
Food Sci Nutr. 2025 Jun 30;13(7):e70513. doi: 10.1002/fsn3.70513. eCollection 2025 Jul.
Maize crop productivity is significantly impacted by various foliar diseases, emphasizing the need for early, accurate, and automated disease detection methods to enable timely intervention and ensure optimal crop management. Traditional classification techniques often fall short in capturing the complex visual patterns inherent in disease-affected leaf imagery, resulting in limited diagnostic performance. To overcome these limitations, this study introduces a robust hybrid deep learning framework that synergistically combines convolutional neural networks (CNNs) and vision transformers (ViTs) for enhanced maize leaf disease classification. In the proposed architecture, the CNN module effectively extracts fine-grained local features, while the ViT module captures long-range contextual dependencies through self-attention mechanisms. The complementary features obtained from both branches are concatenated and passed through fully connected layers for final classification. Data from Mendeley and Kaggle were used to build and check the model, and the model did this by applying image resizing, data normalization, expanding its training data, and shuffling the data to increase generalization. Additional testing is done on the corn disease and severity (CD&S) dataset, which is separate from the main combined dataset. After validation, the accuracy of the proposed model was 99.15%, and each of its precision, recall, and F1-score equaled 99.13%. To confirm it is statistically reliable, 5-fold cross-validation was performed, reporting on the Kaggle + Mendeley set an average accuracy of 99.06% and on the CD&S dataset 95.93%. As both of these scores are high, it shows that the model works well across other datasets as well. Experiments have shown that Hybrid CNN-ViT works better than standalone CNNs. Dropout regularization and using the RAdam optimizer greatly improved both stability and performance. The model stood out as a reliable, high-accuracy method for discovering maize diseases correctly, which may be valuable in real agricultural settings.
玉米作物的生产力受到多种叶部病害的显著影响,这凸显了需要早期、准确且自动化的病害检测方法,以便能够及时进行干预并确保最佳的作物管理。传统的分类技术在捕捉受病害影响的叶片图像中固有的复杂视觉模式方面往往存在不足,导致诊断性能有限。为了克服这些限制,本研究引入了一个强大的混合深度学习框架,该框架将卷积神经网络(CNN)和视觉Transformer(ViT)协同结合,以增强玉米叶部病害分类。在所提出的架构中,CNN模块有效地提取细粒度的局部特征,而ViT模块通过自注意力机制捕捉长距离上下文依赖关系。从两个分支获得的互补特征被连接起来并通过全连接层进行最终分类。使用来自Mendeley和Kaggle的数据来构建和检验模型,该模型通过应用图像缩放、数据归一化、扩展训练数据以及对数据进行洗牌来提高泛化能力。在与主要组合数据集分开的玉米病害与严重程度(CD&S)数据集上进行了额外测试。经过验证,所提出模型的准确率为99.15%,其精确率、召回率和F1分数均为99.13%。为了确认其在统计上是可靠的,进行了5折交叉验证,在Kaggle + Mendeley集上报告的平均准确率为99.06%,在CD&S数据集上为95.93%。由于这两个分数都很高,表明该模型在其他数据集上也表现良好。实验表明,混合CNN - ViT比单独的CNN效果更好。随机失活正则化和使用RAdam优化器极大地提高了稳定性和性能。该模型作为一种可靠的、高精度的正确发现玉米病害的方法脱颖而出,这在实际农业环境中可能具有重要价值。