Shacha Arnob Das, Durjoy Sabbir Hossain, Shikder Md Emon, Kamal Md Mostafa, Shoib Md Mehedi Hasan, Bijoy Md Hasan Imam
Department of Computer Science and Engineering, Daffodil International University, Daffodil Smart City, Birulia, Dhaka 1216, Bangladesh.
Data Brief. 2025 Aug 12;62:111968. doi: 10.1016/j.dib.2025.111968. eCollection 2025 Oct.
The Rose (genus Rosa) has become a significant factor in the Bangladeshi flower industry, both in terms of exports and local consumption. However, rose farming in this country faces serious challenges due to diseases affecting its leaves, which weaken the plants and result in lower flower yields and financial losses for farmers. Rosa (genus Rosa) is one of the most attractive and commercially valuable flower genera. However, agricultural rose production faces several challenges, such as pesticide resistance, which affects plant growth and results in a reduced quantity and quality of healthy flowers. Several natural factors also cause interference with rose production. Most farmers involved in this industry have limited education, which hinders their ability to identify early-stage rose-leaf disease solely through visual inspection. Furthermore, limited communication with agricultural experts exacerbates the situation, leading to delayed interventions and economic losses. This study presents the rose leaf disease dataset, which would help enhance disease tracking, diagnosis, and research in roses. From October 2024 to January 2025, large-scale field surveys were conducted to capture quality images for each condition class in rose leaves. In this paper, four classes comprise 'Black Spot,' 'Insect Hole,' 'Yellow Mosaic Virus,' and 'Healthy,' representing different stages in disease progression. There are 3,228 original images, categorized as follows: Black Spot (409), Insect Hole (453), Yellow Mosaic Virus (680), and Healthy (1,686). During the pre-processing stage, the images are resized to 3000×3000 pixels, and low-quality, duplicate, or irrelevant images are removed to ensure high quality. We have employed various augmentation techniques, including rotation, flipping, contrast adjustment, blurring, shearing, zooming, and noise addition, to increase the dataset size and enhance model generalization. Datasets like this one are in high demand for agricultural research, leading to improved disease management and increased yields. These goals can be achieved through high-accuracy machine-learning models for early disease detection and cause identification. This gives the farmers more time to take necessary actions for disease prevention and pest control. This tech-based system combines the field of agriculture with the cutting edge of computer science and AI, making precision agriculture even more effective and efficient. Our dataset is designed to meet the need for data to train these models and provide a baseline benchmark for disease detection in our specific crop, the Rose. Improvements in different generations of models, as well as numerous other forms of scientific advancements, can lead to further increases in efficiency and ultimately result in better, smarter farms. In our initial testing for categorizing rose leaves, we employed two well-known transfer learning models. Among them, MobileNetV2 performed exceptionally well, achieving an accuracy of 96.79% in image classification. This dataset can be integrated with innovative farming equipment, such as drones and sensors, to monitor large fields in real-time. This dataset serves as a benchmark for training deep learning models, enabling enhanced automated monitoring and decision-making in precision agriculture.
玫瑰(蔷薇属)在孟加拉国花卉产业中已成为一个重要因素,无论是在出口还是本地消费方面。然而,该国的玫瑰种植面临严峻挑战,因为叶片病害会影响植株,削弱其生长势,导致花朵产量降低,给种植户造成经济损失。蔷薇属是最具吸引力且具有商业价值的花卉属之一。然而,玫瑰的农业生产面临诸多挑战,比如病虫害抗药性问题,这会影响植株生长,导致健康花朵的数量和质量下降。一些自然因素也会干扰玫瑰生产。该行业的大多数种植户受教育程度有限,这使得他们仅通过目视检查难以识别玫瑰叶片病害的早期阶段。此外,与农业专家的沟通有限使情况更加恶化,导致干预延迟和经济损失。本研究展示了玫瑰叶片病害数据集,这将有助于加强对玫瑰病害的跟踪、诊断和研究。在2024年10月至2025年1月期间,进行了大规模实地调查,以获取玫瑰叶片各病症类别的高质量图像。本文中的四个类别包括“黑斑病”“虫洞”“黄花叶病毒”和“健康”,代表病害发展的不同阶段。共有3228张原始图像,分类如下:黑斑病(409张)、虫洞(453张)、黄花叶病毒(680张)和健康(1686张)。在预处理阶段,图像被调整为3000×3000像素,并去除低质量、重复或无关的图像以确保高质量。我们采用了多种增强技术,包括旋转、翻转、对比度调整、模糊、剪切、缩放和添加噪声,以增加数据集规模并提高模型泛化能力。这样的数据集在农业研究中需求很大,有助于改善病害管理并提高产量。通过用于早期病害检测和病因识别的高精度机器学习模型可以实现这些目标。这使种植户有更多时间采取必要的病害预防和虫害控制措施。这种基于技术的系统将农业领域与计算机科学和人工智能的前沿技术相结合,使精准农业更加有效和高效。我们的数据集旨在满足训练这些模型的数据需求,并为我们特定作物玫瑰的病害检测提供基线基准。不同代模型的改进以及许多其他形式的科学进步可以进一步提高效率,最终带来更好、更智能的农场。在我们对玫瑰叶片进行分类的初步测试中,我们使用了两个著名的迁移学习模型。其中,MobileNetV2表现出色,在图像分类中准确率达到96.79%。该数据集可与无人机和传感器等创新型农用设备集成,以实时监测大片农田。这个数据集可作为训练深度学习模型的基准,有助于在精准农业中加强自动化监测和决策。