Ahmed Sarder Iftekhar, Ibrahim Muhammad, Nadim Md, Rahman Md Mizanur, Shejunti Maria Mehjabin, Jabid Taskeed, Ali Md Sawkat
Department of Computer Science and Engineering, East West University, Dhaka, Bangladesh.
Department of Computer Science and Engineering, University of Dhaka, Dhaka, Bangladesh.
Data Brief. 2023 Jan 30;47:108941. doi: 10.1016/j.dib.2023.108941. eCollection 2023 Apr.
Agriculture is one of the few remaining sectors that is yet to receive proper attention from the machine learning community. The importance of datasets in the machine learning discipline cannot be overemphasized. The lack of standard and publicly available datasets related to agriculture impedes practitioners of this discipline to harness the full benefit of these powerful computational predictive tools and techniques. To improve this scenario, we develop, to the best of our knowledge, the first-ever standard, ready-to-use, and publicly available dataset of mango leaves. The images are collected from four mango orchards of Bangladesh, one of the top mango-growing countries of the world. The dataset contains 4000 images of about 1800 distinct leaves covering seven diseases. Although the dataset is developed using mango leaves of Bangladesh only, since we deal with diseases that are common across many countries, this dataset is likely to be applicable to identify mango diseases in other countries as well, thereby boosting mango yield. This dataset is expected to draw wide attention from machine learning researchers and practitioners in the field of automated agriculture.
农业是为数不多的尚未受到机器学习社区适当关注的领域之一。数据集在机器学习学科中的重要性再怎么强调也不为过。缺乏与农业相关的标准且公开可用的数据集阻碍了该学科的从业者充分利用这些强大的计算预测工具和技术。为改善这种情况,据我们所知,我们开发了首个标准的、即用型且公开可用的芒果叶数据集。这些图像是从孟加拉国的四个芒果园收集的,孟加拉国是世界上最大的芒果种植国之一。该数据集包含约1800片不同叶子的4000张图像,涵盖七种病害。尽管该数据集仅使用孟加拉国的芒果叶开发,但由于我们处理的是许多国家常见的病害,这个数据集很可能也适用于识别其他国家的芒果病害,从而提高芒果产量。预计该数据集将引起自动化农业领域机器学习研究人员和从业者的广泛关注。