Suppr超能文献

基于深度学习的田间柑橘果实检测与跟踪

Deep-learning-based in-field citrus fruit detection and tracking.

作者信息

Zhang Wenli, Wang Jiaqi, Liu Yuxin, Chen Kaizhen, Li Huibin, Duan Yulin, Wu Wenbin, Shi Yun, Guo Wei

机构信息

Information department, Beijing University of Technology, Beijing, 100022, China.

Institute of Agricultural Resources and Regional Planning, Chinese Academy of Agricultural Sciences, Beijing, 100081, China.

出版信息

Hortic Res. 2022 Feb 11;9. doi: 10.1093/hr/uhac003.

Abstract

Fruit yield estimation is crucial to establish fruit harvesting and marketing strategies. Recently, computer vision and deep learning techniques have been used to estimate citrus fruit yield and have exhibited a notable fruit detection ability. However, computer-vision-based citrus fruit counting has two key limitations: inconsistent fruit detection accuracy and double-counting of the same fruit. Using oranges as the experimental material, this paper proposes a deep-learning-based orange counting algorithm using video sequences to help overcome these problems. The algorithm consists of two sub-algorithms, OrangeYolo for fruit detection and OrangeSort for fruit tracking. The OrangeYolo backbone network is partially based on the YOLOv3 algorithm and improved upon to detect small object fruits at multiple scales. The network structure was adjusted to detect small-scale targets while enabling multiscale target detection. A channel attention and spatial attention multiscale fusion module was introduced to fuse the semantic features of the deep network with the shallow textural detail features. OrangeYolo can reach mean Average Precision (mAP) to 0.957 in the citrus dataset, which is higher than the 0.905, 0.911, and 0.917 that the YOLOv3, YOLOv4 and YOLOv5 algorithms. OrangeSort was designed to alleviate the double-counting problem of occluded fruits. A specific tracking region counting strategy and tracking algorithm based on motion displacement estimation are established. Six video sequences, which were taken from two fields containing 22 trees, were used as a validation dataset. The proposed method showed better performance (Mean Absolute Error(MAE) = 0.081, Standard Deviation(SD) = 0.08) compared to video-based manual counting and demonstrated more accurate results compared with existing standard Sort and DeepSort (MAE = 0.45, 1.212; SD = 0.4741, 1.3975; respectively).

摘要

水果产量估计对于制定水果收获和营销策略至关重要。最近,计算机视觉和深度学习技术已被用于估计柑橘类水果产量,并展现出显著的水果检测能力。然而,基于计算机视觉的柑橘类水果计数存在两个关键限制:水果检测精度不一致以及同一水果的重复计数。本文以橙子为实验材料,提出一种基于深度学习的橙子计数算法,利用视频序列来帮助克服这些问题。该算法由两个子算法组成,用于水果检测的OrangeYolo和用于水果跟踪的OrangeSort。OrangeYolo骨干网络部分基于YOLOv3算法,并进行了改进以在多个尺度上检测小目标水果。调整了网络结构以检测小尺度目标,同时实现多尺度目标检测。引入了通道注意力和空间注意力多尺度融合模块,将深度网络的语义特征与浅层纹理细节特征进行融合。OrangeYolo在柑橘数据集上的平均精度均值(mAP)可达0.957,高于YOLOv3、YOLOv4和YOLOv5算法的0.905、0.911和0.917。OrangeSort旨在缓解被遮挡水果的重复计数问题。建立了一种基于运动位移估计的特定跟踪区域计数策略和跟踪算法。从包含22棵树的两个果园拍摄的六个视频序列用作验证数据集。与基于视频的人工计数相比,所提出的方法表现出更好的性能(平均绝对误差(MAE)=0.081,标准差(SD)=0.08),并且与现有的标准Sort和DeepSort相比,结果更准确(MAE分别为0.45、1.212;SD分别为0.4741、1.3975)。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a81b/9113225/23dfde45e02f/uhac003f1.jpg

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验