Chen Bo-Jin, Bu Jun-Yan, Xia Jun-Lin, Li Ming-Xuan, Su Wen-Hao
College of Engineering, China Agricultural University, 17 Qinghua East Road, Haidian, Beijing 100083, China.
Plants (Basel). 2025 Aug 20;14(16):2587. doi: 10.3390/plants14162587.
Accurate detection of cherry tomato clusters and their ripeness stages is critical for the development of intelligent harvesting systems in modern agriculture. In response to the challenges posed by occlusion, overlapping clusters, and subtle ripeness variations under complex greenhouse environments, an improved YOLO11-based deep convolutional neural network detection model, called AFBF-YOLO, is proposed in this paper. First, a dataset comprising 486 RGB images and over 150,000 annotated instances was constructed and augmented, covering four ripeness stages and fruit clusters. Then, based on YOLO11, the ACmix attention mechanism was incorporated to strengthen feature representation under occluded and cluttered conditions. Additionally, a novel neck structure, FreqFusion-BiFPN, was designed to improve multi-scale feature fusion through frequency-aware filtering. Finally, a refined loss function, Inner-Focaler-IoU, was applied to enhance bounding box localization by emphasizing inner-region overlap and focusing on difficult samples. Experimental results show that AFBF-YOLO achieves a precision of 81.2%, a recall of 81.3%, and an mAP@0.5 of 85.6%, outperforming multiple mainstream YOLO series. High accuracy across ripeness stages and low computational complexity indicate it excels in simultaneous detection of cherry tomato fruit bunches and fruit maturity, supporting automated maturity assessment and robotic harvesting in precision agriculture.
准确检测樱桃番茄簇及其成熟阶段对于现代农业智能收获系统的发展至关重要。针对复杂温室环境下遮挡、簇重叠以及成熟度细微变化带来的挑战,本文提出了一种基于YOLO11改进的深度卷积神经网络检测模型,称为AFBF-YOLO。首先,构建并扩充了一个包含486张RGB图像和超过150,000个标注实例的数据集,涵盖四个成熟阶段和果实簇。然后,基于YOLO11,引入了ACmix注意力机制,以加强在遮挡和杂乱条件下的特征表示。此外,设计了一种新颖的颈部结构FreqFusion-BiFPN,通过频率感知滤波来改进多尺度特征融合。最后,应用了一种改进的损失函数Inner-Focaler-IoU,通过强调内部区域重叠和关注困难样本,来增强边界框定位。实验结果表明,AFBF-YOLO的精度达到81.2%,召回率达到81.3%,mAP@0.5为85.6%,优于多个主流YOLO系列。在不同成熟阶段都具有高精度且计算复杂度低,表明它在同时检测樱桃番茄果串和果实成熟度方面表现出色,支持精准农业中的自动成熟度评估和机器人收获。