Aslam Afira, Usman Syed Muhammad, Zubair Muhammad, Yasin Amanullah, Owais Muhammad, Hussain Irfan
Department of Creative Technologies, Faculty of Computing and Artificial Intelligence, Air University, Islamabad, Pakistan.
Department of Computer Science, Bahria School of Engineering and Applied Sciences, Bahria University, Islamabad, Pakistan.
PLoS One. 2025 May 27;20(5):e0324293. doi: 10.1371/journal.pone.0324293. eCollection 2025.
Cotton is a major cash crop, and increasing its production is extremely important worldwide, especially in agriculture-led economies. The crop is susceptible to various diseases, leading to decreased yields. In recent years, advancements in deep learning methods have enabled researchers to develop automated methods for detecting diseases in cotton crops. Such automation not only assists farmers in mitigating the effects of the disease but also conserves resources in terms of labor and fertilizer costs. However, accurate classification of multiple diseases simultaneously in cotton remains challenging due to multiple factors, including class imbalance, variation in disease symptoms, and the need for real-time detection, as most existing datasets are acquired under controlled conditions. This research proposes a novel method for addressing these challenges and accurately classifying seven classes, including six diseases and a healthy class. We address the class imbalance issue through synthetic data generation using conventional methods like scaling, rotating, transforming, shearing, and zooming and propose a customized StyleGAN for synthetic data generation. After preprocessing, we combine features extracted from MobileNet and VGG16 to create a comprehensive feature vector, passed to three classifiers: Long Short Term Memory Units, Support Vector Machines, and Random Forest. We propose a StackNet-based ensemble classifier that takes the output probabilities of these three classifiers and predicts the class label among six diseases-Bacterial blight, Curl virus, Fusarium wilt, Alternaria, Cercospora, Greymildew-and a healthy class. We trained and tested our method on publicly available datasets, achieving an average accuracy of 97%. Our robust method outperforms state-of-the-art techniques to identify the six diseases and the healthy class.
棉花是一种主要的经济作物,提高其产量在全球范围内极其重要,尤其是在以农业为主导的经济体中。这种作物易受多种疾病影响,导致产量下降。近年来,深度学习方法的进步使研究人员能够开发出用于检测棉花作物疾病的自动化方法。这种自动化不仅有助于农民减轻疾病的影响,还能在劳动力和肥料成本方面节省资源。然而,由于多种因素,包括类别不平衡、疾病症状的变化以及实时检测的需求,同时对棉花中的多种疾病进行准确分类仍然具有挑战性,因为大多数现有数据集是在受控条件下获取的。本研究提出了一种新颖的方法来应对这些挑战,并准确地对七个类别进行分类,包括六种疾病和一个健康类别。我们通过使用缩放、旋转、变换、剪切和缩放等传统方法生成合成数据来解决类别不平衡问题,并提出了一种定制的StyleGAN用于合成数据生成。预处理后,我们将从MobileNet和VGG16中提取的特征相结合,创建一个综合特征向量,传递给三个分类器:长短期记忆单元、支持向量机和随机森林。我们提出了一种基于StackNet的集成分类器,它采用这三个分类器的输出概率,并预测六种疾病——细菌性枯萎病、卷曲病毒、枯萎病、链格孢菌、尾孢菌、灰霉病——和一个健康类别中的类别标签。我们在公开可用的数据集上对我们的方法进行了训练和测试,平均准确率达到了97%。我们强大的方法在识别这六种疾病和健康类别方面优于现有技术。