B R Pushpa, N Manohar, Rani N Shobha
Department of Computer Science, School of Computing, Amrita Vishwa Vidyapeetham, Mysuru, India.
Department of Artificial Intelligence and Data Science, GITAM School of Technology, GITAM (Deemed to be) University, Bengaluru, Karnataka, India.
Data Brief. 2025 Jun 24;61:111825. doi: 10.1016/j.dib.2025.111825. eCollection 2025 Aug.
Star gooseberry provides immense health benefits and is widely recognized in the Indian medicinal system. It holds significant importance in the food production, pharmaceuticals, and cosmetics industries due to the presence of therapeutic and pharmacological properties. Due to its beneficial properties, gooseberry fruit is widely used in treating various ailments. Therefore, cultivating these fruits presents an opportunity to generate revenue, benefiting both farmers and the agricultural sector. The post-harvest process of fruit typically performs the quality assessment by segregating fruits based on visual characteristics, which is tedious and prone to human error. Hence, there is a need to develop an automated computer vision model to assess the fruit quality more accurately. This study focuses on dataset collection, including image samples of both single and multiple-star gooseberry fruits to automate fruit grading. This dataset has been specifically developed for research purposes, contributing to fruit detection, quality assessment, weight estimation, and classification of fruits at various ripeness stages. Further, it provides researchers with an opportunity to develop an automated system for detecting overlapping fruits and touching contours using machine learning, deep learning, and computer vision systems. Image samples of star gooseberry at different growth stages were collected from orchids in Mysuru, India. The dataset, named "AmlaNet" comprises 792 image samples of star gooseberry, captured against a plain background from varying angles, sizes, brightness levels, and distances. The dataset is organized into four folders such as single star gooseberry fruit, multiple fruits, overlapped, and annotated samples of overlapped star gooseberry fruits including fruit samples with different ripeness stages. This publicly accessible dataset is expected to benefit the research community, enabling advancement in computer vision and AI Applications. It can be accessed at DOI: 10.17632/2255bdy9mm.1.
醋栗具有巨大的健康益处,在印度医学体系中广为人知。由于其具有治疗和药理特性,在食品生产、制药和化妆品行业具有重要意义。因其有益特性,醋栗果实被广泛用于治疗各种疾病。因此,种植这些果实为创收提供了机会,对农民和农业部门都有益。果实的采后过程通常通过根据视觉特征对果实进行分类来进行质量评估,这既繁琐又容易出现人为错误。因此,需要开发一种自动化的计算机视觉模型来更准确地评估果实质量。本研究重点在于数据集收集,包括单颗和多颗醋栗果实的图像样本,以实现果实分级自动化。该数据集是专门为研究目的而开发的,有助于果实检测、质量评估、重量估计以及对不同成熟阶段果实的分类。此外,它为研究人员提供了一个机会,利用机器学习、深度学习和计算机视觉系统开发一个用于检测重叠果实和触摸轮廓的自动化系统。从印度迈索尔的果园收集了不同生长阶段的醋栗图像样本。该数据集名为“AmlaNet”,包含792个醋栗图像样本,在纯色背景下从不同角度、大小、亮度水平和距离拍摄。该数据集被组织成四个文件夹,如单颗醋栗果实、多颗果实、重叠以及重叠醋栗果实的标注样本,包括不同成熟阶段的果实样本。这个可公开访问的数据集有望使研究界受益,推动计算机视觉和人工智能应用的发展。可通过DOI: 10.17632/2255bdy9mm.1访问。