Gené-Mola Jordi, Ferrer-Ferrer Mar, Hemming Jochen, van Dalfsen Pieter, de Hoog Dirk, Sanz-Cortiella Ricardo, Rosell-Polo Joan R, Morros Josep-Ramon, Vilaplana Verónica, Ruiz-Hidalgo Javier, Gregorio Eduard
Efficient Use of Water in Agriculture Program, Institute of AgriFood Research and Technology (IRTA), Fruitcentre, Parc Científic i Tecnològic Agroalimentari de Gardeny (PCiTAL), 25003 Lleida, Catalonia, Spain.
Research Group in AgroICT& Precision Agriculture - GRAP, Department of Agricultural and Forest Sciences and Engineering, Universitat de Lleida (UdL) - Agrotecnio-CERCA Center, Lleida, Catalonia, Spain.
Data Brief. 2023 Dec 30;52:110000. doi: 10.1016/j.dib.2023.110000. eCollection 2024 Feb.
The present dataset comprises a collection of RGB-D apple tree images that can be used to train and test computer vision-based fruit detection and sizing methods. This dataset encompasses two distinct sets of data obtained from a Fuji and an Elstar apple orchards. The Fuji apple orchard sub-set consists of 3925 RGB-D images containing a total of 15,335 apples annotated with both modal and amodal apple segmentation masks. Modal masks denote the visible portions of the apples, whereas amodal masks encompass both visible and occluded apple regions. Notably, this dataset is the first public resource to incorporate on-tree fruit amodal masks. This pioneering inclusion addresses a critical gap in existing datasets, enabling the development of robust automatic fruit sizing methods and accurate fruit visibility estimation, particularly in the presence of partial occlusions. Besides the fruit segmentation masks, the dataset also includes the fruit size (calliper) ground truth for each annotated apple. The second sub-set comprises 2731 RGB-D images capturing five Elstar apple trees at four distinct growth stages. This sub-set includes mean diameter information for each tree at every growth stage and serves as a valuable resource for evaluating fruit sizing methods trained with the first sub-set. The present data was employed in the research paper titled "Looking behind occlusions: a study on amodal segmentation for robust on-tree apple fruit size estimation" [1].
本数据集包含一组RGB-D苹果树图像,可用于训练和测试基于计算机视觉的水果检测和尺寸测量方法。该数据集涵盖从富士苹果园和埃尔斯特苹果园获得的两组不同数据。富士苹果园子集由3925张RGB-D图像组成,总共包含15335个苹果,并用模态和非模态苹果分割掩码进行了标注。模态掩码表示苹果的可见部分,而非模态掩码则涵盖可见和被遮挡的苹果区域。值得注意的是,该数据集是第一个纳入树上水果非模态掩码的公共资源。这一开创性的纳入解决了现有数据集中的一个关键空白,能够开发强大的自动水果尺寸测量方法和准确的水果可见性估计,特别是在存在部分遮挡的情况下。除了水果分割掩码外,该数据集还包括每个标注苹果的水果尺寸(卡尺)真值。第二个子集由2731张RGB-D图像组成,在四个不同生长阶段拍摄了五棵埃尔斯特苹果树。该子集包括每个生长阶段每棵树的平均直径信息,是评估用第一个子集训练的水果尺寸测量方法的宝贵资源。本数据被用于题为《透过遮挡看背后:关于稳健的树上苹果果实尺寸估计的非模态分割研究》的研究论文[1]。