• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

BDMANGO:一个基于芒果叶识别芒果品种的图像数据集。

BDMANGO: An image dataset for identifying the variety of mango based on the mango leaves.

作者信息

Islam Mohammad Manzurul, Ahmed Md Jubayer, Shafi Mahmud Bin, Das Aritra, Hasan Md Rakibul, Rafi Abdullah Al, Rashid Mohammad Rifat Ahmmad, Niloy Nishat Tasnim, Ali Md Sawkat, Chowdhury Abdullahi, Rasel Ahmed Abdal Shafi

机构信息

Department of Computer Science and Engineering, East West University, Aftabnagar, Dhaka, Bangladesh.

出版信息

Data Brief. 2024 Dec 19;58:111241. doi: 10.1016/j.dib.2024.111241. eCollection 2025 Feb.

DOI:10.1016/j.dib.2024.111241
PMID:39840229
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11748707/
Abstract

In the field of agriculture, particularly within the context of machine learning applications, quality datasets are essential for advancing research and development. To address the challenges of identifying different mango leaf types and recognizing the diverse and unique characteristics of mango varieties in Bangladesh, a comprehensive and publicly accessible dataset titled "BDMANGO" has been created. This dataset includes images essential for research, featuring six mango varieties: Amrapali, Banana, Chaunsa, Fazli, Haribhanga, and Himsagar, which were collected from different locations. The images were captured using the rear cameras of a Google Pixel 6a and an iPhone XR and were stored in 640 × 480 pixels resolution. Both sides of each mango leaf were photographed against white background to accurately reflect real-world scenarios in mango cultivation fields. The white background was specifically chosen to remove noise in image sample, allowing for accurate feature extraction by machine learning algorithms. This will ensure the trained model's efficacy in identifying a specific mango leaf while implemented alongside any segmentation algorithm. Additionally, image augmentation techniques such as rotation, horizontal flip, vertical flip, width shift, height shift, shear range, and zooming were applied to expand the dataset from 837 original images to a total of 6696 images (837 original image and 5859 augmented images). This expansion significantly enhances the dataset's utility for training, testing, and validating machine learning models designed for classifying mango leaf varieties, thereby supporting research efforts in this domain.

摘要

在农业领域,特别是在机器学习应用的背景下,高质量的数据集对于推动研究与开发至关重要。为应对识别不同芒果叶类型以及识别孟加拉国芒果品种多样且独特特征的挑战,创建了一个名为“BDMANGO”的全面且可公开访问的数据集。该数据集包含研究所需的图像,有六个芒果品种:阿姆拉普利、香蕉、乔恩萨、法兹利、哈里班加和希姆萨加尔,这些图像是从不同地点收集的。图像使用谷歌Pixel 6a和iPhone XR的后置摄像头拍摄,存储分辨率为640×480像素。每张芒果叶的两面都以白色背景拍摄,以准确反映芒果种植园的真实场景。特意选择白色背景是为了去除图像样本中的噪声,以便机器学习算法进行准确的特征提取。这将确保在与任何分割算法一起实施时,训练模型在识别特定芒果叶方面的有效性。此外,还应用了旋转、水平翻转、垂直翻转、宽度偏移、高度偏移、剪切范围和缩放等图像增强技术,将数据集从837张原始图像扩展到总共6696张图像(837张原始图像和5859张增强图像)。这种扩展显著提高了数据集在训练、测试和验证用于分类芒果叶品种的机器学习模型方面的效用,从而支持该领域的研究工作。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f1e1/11748707/1ef488d2ae3a/gr5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f1e1/11748707/4f0c5ffb8224/gr1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f1e1/11748707/618c307c4c61/gr2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f1e1/11748707/aa9f3083e468/gr3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f1e1/11748707/c127a99a33d5/gr4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f1e1/11748707/1ef488d2ae3a/gr5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f1e1/11748707/4f0c5ffb8224/gr1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f1e1/11748707/618c307c4c61/gr2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f1e1/11748707/aa9f3083e468/gr3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f1e1/11748707/c127a99a33d5/gr4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f1e1/11748707/1ef488d2ae3a/gr5.jpg

相似文献

1
BDMANGO: An image dataset for identifying the variety of mango based on the mango leaves.BDMANGO:一个基于芒果叶识别芒果品种的图像数据集。
Data Brief. 2024 Dec 19;58:111241. doi: 10.1016/j.dib.2024.111241. eCollection 2025 Feb.
2
Advancing mango leaf variant identification with a robust multi-layer perceptron model.利用稳健的多层感知机模型推进芒果叶变体识别。
Sci Rep. 2024 Nov 9;14(1):27406. doi: 10.1038/s41598-024-74612-0.
3
MangoLeafBD: A comprehensive image dataset to classify diseased and healthy mango leaves.芒果叶BD:一个用于对患病和健康芒果叶进行分类的综合图像数据集。
Data Brief. 2023 Jan 30;47:108941. doi: 10.1016/j.dib.2023.108941. eCollection 2023 Apr.
4
CottonFabricImageBD: An image dataset characterized by the percentage of cotton in a fabric for computer vision-based garment recycling.棉织物图像数据集BD:一个以织物中棉花百分比为特征的图像数据集,用于基于计算机视觉的服装回收。
Data Brief. 2024 Jul 6;55:110712. doi: 10.1016/j.dib.2024.110712. eCollection 2024 Aug.
5
BananaImageBD: A comprehensive banana image dataset for classification of banana varieties and detection of ripeness stages in Bangladesh.香蕉图像数据库(BananaImageBD):一个用于孟加拉国香蕉品种分类和成熟阶段检测的综合香蕉图像数据集。
Data Brief. 2024 Dec 19;58:111239. doi: 10.1016/j.dib.2024.111239. eCollection 2025 Feb.
6
A comprehensive image dataset for the identification of eggplant leaf diseases and computer vision applications.一个用于识别茄子叶部病害和计算机视觉应用的综合图像数据集。
Data Brief. 2025 Jan 31;59:111353. doi: 10.1016/j.dib.2025.111353. eCollection 2025 Apr.
7
SoyNet: A high-resolution Indian soybean image dataset for leaf disease classification.SoyNet:用于叶片病害分类的高分辨率印度大豆图像数据集。
Data Brief. 2023 Jul 26;49:109447. doi: 10.1016/j.dib.2023.109447. eCollection 2023 Aug.
8
A comprehensive image dataset for the identification of lemon leaf diseases and computer vision applications.一个用于识别柠檬叶病害和计算机视觉应用的综合图像数据集。
Data Brief. 2024 Dec 19;58:111244. doi: 10.1016/j.dib.2024.111244. eCollection 2025 Feb.
9
An extensive image dataset for deep learning-based classification of rice kernel varieties in Bangladesh.用于基于深度学习对孟加拉国水稻品种进行分类的大规模图像数据集。
Data Brief. 2024 Nov 6;57:111109. doi: 10.1016/j.dib.2024.111109. eCollection 2024 Dec.
10
BananaLSD: A banana leaf images dataset for classification of banana leaf diseases using machine learning.香蕉叶 LSD:一个用于通过机器学习对香蕉叶疾病进行分类的香蕉叶图像数据集。
Data Brief. 2023 Sep 22;50:109608. doi: 10.1016/j.dib.2023.109608. eCollection 2023 Oct.

引用本文的文献

1
MangoImageBD: An extensive mango image dataset for identification and classification of various mango varieties in Bangladesh.芒果图像数据库(MangoImageBD):一个用于识别和分类孟加拉国各种芒果品种的大型芒果图像数据集。
Data Brief. 2025 Jul 21;62:111908. doi: 10.1016/j.dib.2025.111908. eCollection 2025 Oct.

本文引用的文献

1
MFCIS: an automatic leaf-based identification pipeline for plant cultivars using deep learning and persistent homology.MFCIS:一种基于深度学习和持久同调的用于植物品种自动叶片识别流程
Hortic Res. 2021 Aug 1;8(1):172. doi: 10.1038/s41438-021-00608-w.
2
Text Data Augmentation for Deep Learning.用于深度学习的文本数据增强
J Big Data. 2021;8(1):101. doi: 10.1186/s40537-021-00492-0. Epub 2021 Jul 19.