Xie Shuang, Sun Hongwei
School of Automation, Hangzhou Dianzi University, Hangzhou 310083, China.
Sensors (Basel). 2023 Jul 21;23(14):6576. doi: 10.3390/s23146576.
Tea bud target detection is essential for mechanized selective harvesting. To address the challenges of low detection precision caused by the complex backgrounds of tea leaves, this paper introduces a novel model called Tea-YOLOv8s. First, multiple data augmentation techniques are employed to increase the amount of information in the images and improve their quality. Then, the Tea-YOLOv8s model combines deformable convolutions, attention mechanisms, and improved spatial pyramid pooling, thereby enhancing the model's ability to learn complex object invariance, reducing interference from irrelevant factors, and enabling multi-feature fusion, resulting in improved detection precision. Finally, the improved YOLOv8 model is compared with other models to validate the effectiveness of the proposed improvements. The research results demonstrate that the Tea-YOLOv8s model achieves a mean average precision of 88.27% and an inference time of 37.1 ms, with an increase in the parameters and calculation amount by 15.4 M and 17.5 G, respectively. In conclusion, although the proposed approach increases the model's parameters and calculation amount, it significantly improves various aspects compared to mainstream YOLO detection models and has the potential to be applied to tea buds picked by mechanization equipment.
茶芽目标检测对于机械化选择性采摘至关重要。为应对茶叶复杂背景导致检测精度低的挑战,本文介绍了一种名为Tea-YOLOv8s的新型模型。首先,采用多种数据增强技术来增加图像中的信息量并提高其质量。然后,Tea-YOLOv8s模型结合了可变形卷积、注意力机制和改进的空间金字塔池化,从而增强了模型学习复杂物体不变性的能力,减少无关因素的干扰,并实现多特征融合,进而提高检测精度。最后,将改进后的YOLOv8模型与其他模型进行比较,以验证所提改进措施的有效性。研究结果表明,Tea-YOLOv8s模型的平均精度达到88.27%,推理时间为37.1毫秒,参数和计算量分别增加了15.4M和17.5G。总之,尽管所提方法增加了模型的参数和计算量,但与主流YOLO检测模型相比,在各方面都有显著改进,并且有潜力应用于机械化设备采摘的茶芽。