Ta-YOLO：克服温室番茄检测与计数中的目标遮挡挑战

Ta-YOLO: overcoming target blocked challenges in greenhouse tomato detection and counting.

作者信息

Zhao Yun, Chen Yijia, Xu Xing, He Yong, Gan Hao, Wu Na, Wang Zhechen, Sun Xi, Wang Yali, Skobelev Petr, Mi Yanan

机构信息

School of Artificial Intelligence and Information Engineering, Zhejiang University of Science and Technology, Hangzhou, China.

College of Biosystems Engineering and Food Science, Zhejiang University, Hangzhou, China.

出版信息

Front Plant Sci. 2025 Jul 8;16:1618214. doi: 10.3389/fpls.2025.1618214. eCollection 2025.

DOI:10.3389/fpls.2025.1618214

PMID:40697876

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12279841/

Abstract

Screening and cultivating healthy small tomatoes, along with accurately predicting their yields, are crucial for sustaining the economy of tomato industry. However, in field scenarios, counting small tomato fruits is often hindered by environmental factors such as leaf shading. To address this challenge, this study proposed the Ta-YOLO modeling framework, aimed at improving the efficiency and accuracy of small tomato fruit detection. We captured images of small tomatoes at various stages of ripeness in real-world settings and compiled them into datasets for training and testing the model. First, we utilized the Space-to-Depth module to efficiently leverage the implicit features of the images while ensuring a lightweight operation of the backbone network. Next, we developed a novel pyramid pooling module(DASPPF) to capture global information through average pooling, effectively reducing the impact of edge and background noise on detection. We also introduced an additional tiny target detection head alongside the original detection head, enabling multi-scale detection of small tomatoes. To further enhance the model's focus on relevant information and improve its ability to recognize small targets, we designed a multi-dimensional attention structure(CSAM) that generated feature maps with more valuable information. Finally, we proposed the EWDIoU bounding box loss function, which leveraged a 2D Gaussian distribution to enhance the model's accuracy and robustness. The experimental results showed that the number of parameters, FLOPs, and FPS of our designed Ta-YOLO were 10.58M, 14.4G, and 131.58, respectively, and its mean average precision(mAP) reached 84.4%. It can better realize the counting of tomatoes with different maturity levels, which helps to improve the efficiency of the small tomato production and planting process.

摘要

筛选和培育健康的小番茄，并准确预测其产量，对维持番茄产业的经济发展至关重要。然而，在田间场景中，小番茄果实的计数常常受到叶片遮挡等环境因素的阻碍。为应对这一挑战，本研究提出了Ta-YOLO建模框架，旨在提高小番茄果实检测的效率和准确性。我们在实际场景中拍摄了处于不同成熟阶段的小番茄图像，并将其整理成数据集用于训练和测试模型。首先，我们利用空间到深度模块有效地利用图像的隐含特征，同时确保骨干网络的轻量级操作。接下来，我们开发了一种新颖的金字塔池化模块（DASPPF），通过平均池化来捕获全局信息，有效减少边缘和背景噪声对检测的影响。我们还在原始检测头之外引入了一个额外的微小目标检测头，实现对小番茄的多尺度检测。为进一步增强模型对相关信息的关注并提高其识别小目标的能力，我们设计了一种多维度注意力结构（CSAM），生成具有更有价值信息的特征图。最后，我们提出了EWDIoU边界框损失函数，利用二维高斯分布提高模型的准确性和鲁棒性。实验结果表明，我们设计的Ta-YOLO的参数数量、FLOPs和FPS分别为1058万、144亿和131.58，其平均精度均值（mAP）达到84.4%。它能够更好地实现不同成熟度番茄的计数，有助于提高小番茄生产和种植过程的效率。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3054/12279841/390148acb025/fpls-16-1618214-g001.jpg

相似文献

Ta-YOLO: overcoming target blocked challenges in greenhouse tomato detection and counting.Ta-YOLO：克服温室番茄检测与计数中的目标遮挡挑战

Front Plant Sci. 2025 Jul 8;16:1618214. doi: 10.3389/fpls.2025.1618214. eCollection 2025.

YOLO-GML: An object edge enhancement detection model for UAV aerial images in complex environments.YOLO-GML：一种用于复杂环境中无人机航空图像的目标边缘增强检测模型。

PLoS One. 2025 Jul 10;20(7):e0328070. doi: 10.1371/journal.pone.0328070. eCollection 2025.

An improved YOLOv5 method for accurate recognition of grazing sheep activities: active, inactive, ruminating behaviors.一种用于准确识别放牧绵羊活动的改进YOLOv5方法：活跃、不活跃、反刍行为。

J Anim Sci. 2025 Jan 4;103. doi: 10.1093/jas/skaf084.

Signs and symptoms to determine if a patient presenting in primary care or hospital outpatient settings has COVID-19.在基层医疗机构或医院门诊环境中，如果患者出现以下症状和体征，可判断其是否患有 COVID-19。

Cochrane Database Syst Rev. 2022 May 20;5(5):CD013665. doi: 10.1002/14651858.CD013665.pub3.

DASNet a dual branch multi level attention sheep counting network.DASNet是一种双分支多级注意力羊只计数网络。

Sci Rep. 2025 Jul 2;15(1):23228. doi: 10.1038/s41598-025-97929-w.

Systemic pharmacological treatments for chronic plaque psoriasis: a network meta-analysis.系统性药理学治疗慢性斑块状银屑病：网络荟萃分析。

Cochrane Database Syst Rev. 2021 Apr 19;4(4):CD011535. doi: 10.1002/14651858.CD011535.pub4.

Survivor, family and professional experiences of psychosocial interventions for sexual abuse and violence: a qualitative evidence synthesis.性虐待和暴力的心理社会干预的幸存者、家庭和专业人员的经验：定性证据综合。

Cochrane Database Syst Rev. 2022 Oct 4;10(10):CD013648. doi: 10.1002/14651858.CD013648.pub2.

Are Current Survival Prediction Tools Useful When Treating Subsequent Skeletal-related Events From Bone Metastases?当前的生存预测工具在治疗骨转移后的骨骼相关事件时有用吗？

Clin Orthop Relat Res. 2024 Sep 1;482(9):1710-1721. doi: 10.1097/CORR.0000000000003030. Epub 2024 Mar 22.

Short-Term Memory Impairment短期记忆障碍

Systemic pharmacological treatments for chronic plaque psoriasis: a network meta-analysis.慢性斑块状银屑病的全身药理学治疗：一项网状Meta分析。

Cochrane Database Syst Rev. 2020 Jan 9;1(1):CD011535. doi: 10.1002/14651858.CD011535.pub3.

本文引用的文献

Rapid and accurate identification of bakanae pathogens carried by rice seeds based on hyperspectral imaging and deep transfer learning.基于高光谱成像和深度迁移学习的水稻种子携带的细菌性枯萎病菌的快速准确识别。

Spectrochim Acta A Mol Biomol Spectrosc. 2024 Apr 15;311:123889. doi: 10.1016/j.saa.2024.123889. Epub 2024 Feb 5.

Tomato Maturity Detection and Counting Model Based on MHSA-YOLOv8.基于 MHSA-YOLOv8 的番茄成熟度检测与计数模型

Sensors (Basel). 2023 Jul 26;23(15):6701. doi: 10.3390/s23156701.

Precision detection of crop diseases based on improved YOLOv5 model.基于改进YOLOv5模型的作物病害精准检测

Front Plant Sci. 2023 Jan 9;13:1066835. doi: 10.3389/fpls.2022.1066835. eCollection 2022.

Apple detection and instance segmentation in natural environments using an improved Mask Scoring R-CNN Model.使用改进的掩码评分R-CNN模型在自然环境中进行苹果检测和实例分割。

Front Plant Sci. 2022 Dec 2;13:1016470. doi: 10.3389/fpls.2022.1016470. eCollection 2022.

Enhanced Field-Based Detection of Potato Blight in Complex Backgrounds Using Deep Learning.基于深度学习的复杂背景下马铃薯晚疫病增强田间检测

Plant Phenomics. 2021 May 16;2021:9835724. doi: 10.34133/2021/9835724. eCollection 2021.

Convolutional Neural Networks for Image-Based Corn Kernel Detection and Counting.基于卷积神经网络的玉米籽粒图像检测与计数。

Sensors (Basel). 2020 May 10;20(9):2721. doi: 10.3390/s20092721.

YOLO-Tomato: A Robust Algorithm for Tomato Detection Based on YOLOv3.YOLO-Tomato：一种基于 YOLOv3 的番茄检测稳健算法。

Sensors (Basel). 2020 Apr 10;20(7):2145. doi: 10.3390/s20072145.

Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks.更快的 R-CNN：基于区域建议网络的实时目标检测。

IEEE Trans Pattern Anal Mach Intell. 2017 Jun;39(6):1137-1149. doi: 10.1109/TPAMI.2016.2577031. Epub 2016 Jun 6.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

Ta-YOLO：克服温室番茄检测与计数中的目标遮挡挑战

Ta-YOLO: overcoming target blocked challenges in greenhouse tomato detection and counting.

作者信息

机构信息

出版信息

相似文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

本文引用的文献