Ren Jie, Wang Wendong, Tian Yuan, He Jinrong
College of Mathematics and Computer Science, Yan'an University, Yan'an, Shaanxi, China.
Front Plant Sci. 2025 Aug 25;16:1616165. doi: 10.3389/fpls.2025.1616165. eCollection 2025.
To address the challenge of real-time kiwifruit detection in trellised orchards, this paper proposes YOLOv10-Kiwi, a lightweight detection model optimized for resource-constrained devices. First, a more compact network is developed by adjusting the scaling factors of the YOLOv10n architecture. Second, to further reduce model complexity, a novel C2fDualHet module is proposed by integrating two consecutive Heterogeneous Kernel Convolution (HetConv) layers as a replacement for the traditional Bottleneck structure. This replacement enables parallel processing and enhances feature extraction efficiency. By combining heterogeneous kernels in sequence, C2fDualHet captures both local and global features while significantly lowering parameter count and computational cost. To mitigate potential accuracy loss due to lightweighting, a Cross-Channel Fusion Module (CCFM) is introduced in the neck network. This module incorporates four additional convolutional layers to adjust channel dimensions and strengthen cross-channel information flow, thereby enhancing multi-scale feature integration. In addition, a MPDIoU loss function is introduced to overcome the limitations of the traditional CIoU in terms of aspect ratio mismatch and bounding box regression, accelerating convergence and improving detection accuracy. Experimental results demonstrate that YOLOv10-Kiwi achieves a model size of only 2.02 MB, with 0.51M parameters and 2.1 GFLOPs, representing reductions of 80.34%, 81.11%, and 68.18%, respectively, compared to the YOLOv10n baseline. On a self-constructed kiwifruit dataset, the model achieves 93.6% mAP@50 and an inference speed of 74 FPS. YOLOv10-Kiwi offers an efficient solution for automated kiwifruit detection on low-power agricultural robots.
为应对棚架果园中猕猴桃实时检测的挑战,本文提出了YOLOv10-Kiwi,这是一种针对资源受限设备优化的轻量级检测模型。首先,通过调整YOLOv10n架构的缩放因子来开发更紧凑的网络。其次,为进一步降低模型复杂度,提出了一种新颖的C2fDualHet模块,该模块集成了两个连续的异构内核卷积(HetConv)层,以替代传统的瓶颈结构。这种替代实现了并行处理并提高了特征提取效率。通过顺序组合异构内核,C2fDualHet在显著降低参数数量和计算成本的同时,捕获了局部和全局特征。为减轻轻量化带来的潜在精度损失,在颈部网络中引入了跨通道融合模块(CCFM)。该模块包含四个额外的卷积层来调整通道维度并加强跨通道信息流,从而增强多尺度特征融合。此外,引入了MPDIoU损失函数来克服传统CIoU在宽高比不匹配和边界框回归方面的局限性,加速收敛并提高检测精度。实验结果表明,YOLOv10-Kiwi的模型大小仅为2.02MB,有0.51M个参数和2.1 GFLOPs,与YOLOv10n基线相比,分别减少了80.34%、81.11%和68.18%。在自建的猕猴桃数据集上,该模型实现了93.6%的mAP@50和74 FPS的推理速度。YOLOv10-Kiwi为低功耗农业机器人上的猕猴桃自动检测提供了一种高效解决方案。