Vélez Sergio, Ariza-Sentís Mar, Triviño Mario, Cob-Parro Antonio Carlos, Mila Miquel, Valente João
JRU Drone Technology, Department of Architectural Constructions and I.C.T., University of Burgos, Burgos, 09001, Spain.
Information Technology Group, Wageningen University & Research, Wageningen, 6708 PB, the Netherlands.
Heliyon. 2025 Feb 6;11(4):e42525. doi: 10.1016/j.heliyon.2025.e42525. eCollection 2025 Feb 28.
Viticulture benefits significantly from rapid grape bunch identification and counting, enhancing yield and quality. Recent technological and machine learning advancements, particularly in deep learning, have provided the tools necessary to create more efficient, automated processes that significantly reduce the time and effort required for these tasks. On one hand, drone, or Unmanned Aerial Vehicles (UAV) imagery combined with deep learning algorithms has revolutionised agriculture by automating plant health classification, disease identification, and fruit detection. However, these advancements often remain inaccessible to farmers due to their reliance on specialized hardware like ground robots or UAVs. On the other hand, most farmers have access to smartphones. This article proposes a novel approach combining UAVs and smartphone technologies. An AI-based framework is introduced, integrating a 5-stage AI pipeline combining object detection and pixel-level segmentation algorithms to automatically detect grape bunches in smartphone images of a commercial vineyard with vertical trellis training. By leveraging UAV-captured data for training, the proposed model not only accelerates the detection process but also enhances the accuracy and adaptability of grape bunch detection across different devices, surpassing the efficiency of traditional and purely UAV-based methods. To this end, using a dataset of UAV videos recorded during early growth stages in July (BBCH77-BBCH79), the X-Decoder segments vegetation in the front of the frames from their background and surroundings. X-Decoder is particularly advantageous because it can be seamlessly integrated into the AI pipeline without requiring changes to how data is captured, making it more versatile than other methods. Then, YOLO is trained using the videos and further applied to images taken by farmers with common smartphones (Xiaomi Poco X3 Pro and iPhone X). In addition, a web app was developed to connect the system with mobile technology easily. The proposed approach achieved a precision of 0.92 and recall of 0.735, with an F1 score of 0.82 and an Average Precision (AP) of 0.802 under different operation conditions, indicating high accuracy and reliability in detecting grape bunches. In addition, the AI-detected grape bunches were compared with the actual ground truth, achieving an R value as high as 0.84, showing the robustness of the system. This study highlights the potential of using smartphone imaging and web applications together, making an effort to integrate these models into a real platform for farmers, offering a practical, affordable, accessible, and scalable solution. While smartphone-based image collection for model training is labour-intensive and costly, incorporating UAV data accelerates the process, facilitating the creation of models that generalise across diverse data sources and platforms. This blend of UAV efficiency and smartphone precision significantly cuts vineyard monitoring time and effort.
葡萄栽培从快速的葡萄串识别和计数中受益匪浅,这有助于提高产量和品质。近期的技术和机器学习进展,尤其是深度学习方面的进展,提供了创建更高效、自动化流程所需的工具,这些流程能显著减少完成这些任务所需的时间和精力。一方面,无人机或无人驾驶飞行器(UAV)图像与深度学习算法相结合,通过实现植物健康分类、病害识别和果实检测的自动化,彻底改变了农业。然而,由于依赖地面机器人或无人机等专业硬件,这些进展对农民来说往往遥不可及。另一方面,大多数农民都能使用智能手机。本文提出了一种将无人机和智能手机技术相结合的新方法。引入了一个基于人工智能的框架,该框架集成了一个5阶段的人工智能管道,结合了目标检测和像素级分割算法,以自动检测商业葡萄园垂直棚架栽培下智能手机图像中的葡萄串。通过利用无人机捕获的数据进行训练,所提出的模型不仅加速了检测过程,还提高了葡萄串检测在不同设备上的准确性和适应性,超越了传统方法和单纯基于无人机方法的效率。为此,使用7月早期生长阶段(BBCH77 - BBCH79)记录的无人机视频数据集,X - Decoder将帧前的植被与其背景和周围环境分割开来。X - Decoder特别有优势,因为它可以无缝集成到人工智能管道中,而无需改变数据采集方式,使其比其他方法更具通用性。然后,使用这些视频对YOLO进行训练,并进一步应用于农民使用普通智能手机(小米Poco X3 Pro和iPhone X)拍摄的图像。此外,还开发了一个网络应用程序,以便轻松地将系统与移动技术连接起来。所提出的方法在不同操作条件下实现了0.92的精度、0.735的召回率、0.82的F1分数和0.802的平均精度(AP),表明在检测葡萄串方面具有很高的准确性和可靠性。此外,将人工智能检测到的葡萄串与实际地面真值进行比较,获得了高达0.84的R值,显示了该系统的稳健性。这项研究突出了同时使用智能手机成像和网络应用程序的潜力,努力将这些模型集成到一个面向农民的真实平台上,提供了一个实用、经济实惠、易于使用且可扩展的解决方案。虽然基于智能手机的图像采集用于模型训练既费力又昂贵,但纳入无人机数据加速了这一过程,有助于创建能够在不同数据源和平台上通用的模型。无人机效率和智能手机精度的这种结合显著减少了葡萄园监测的时间和精力。