Yu Zhenghong, Wang Yangxu, Ye Jianxiong, Liufu Shengjie, Lu Dunlu, Zhu Xiuli, Yang Zhongming, Tan Qingji
College of Robotics, Guangdong Polytechnic of Science and Technology, Zhuhai, China.
Department of Network Technology, Guangzhou Institute of Software Engineering, Conghua, China.
Front Plant Sci. 2024 Feb 20;15:1320109. doi: 10.3389/fpls.2024.1320109. eCollection 2024.
Soybean pod count is one of the crucial indicators of soybean yield. Nevertheless, due to the challenges associated with counting pods, such as crowded and uneven pod distribution, existing pod counting models prioritize accuracy over efficiency, which does not meet the requirements for lightweight and real-time tasks.
To address this goal, we have designed a deep convolutional network called PodNet. It employs a lightweight encoder and an efficient decoder that effectively decodes both shallow and deep information, alleviating the indirect interactions caused by information loss and degradation between non-adjacent levels.
We utilized a high-resolution dataset of soybean pods from field harvesting to evaluate the model's generalization ability. Through experimental comparisons between manual counting and model yield estimation, we confirmed the effectiveness of the PodNet model. The experimental results indicate that PodNet achieves an R of 0.95 for the prediction of soybean pod quantities compared to ground truth, with only 2.48M parameters, which is an order of magnitude lower than the current SOTA model YOLO POD, and the FPS is much higher than YOLO POD.
Compared to advanced computer vision methods, PodNet significantly enhances efficiency with almost no sacrifice in accuracy. Its lightweight architecture and high FPS make it suitable for real-time applications, providing a new solution for counting and locating dense objects.
大豆荚果数量是大豆产量的关键指标之一。然而,由于荚果计数存在挑战,如荚果分布拥挤且不均匀,现有的荚果计数模型更注重准确性而非效率,这无法满足轻量级和实时任务的要求。
为实现这一目标,我们设计了一种名为PodNet的深度卷积网络。它采用了轻量级编码器和高效解码器,能有效解码浅层和深层信息,减轻非相邻层之间信息丢失和退化导致的间接交互。
我们利用来自田间收获的大豆荚果高分辨率数据集评估模型的泛化能力。通过人工计数与模型产量估计的实验比较,我们证实了PodNet模型的有效性。实验结果表明,与真实值相比,PodNet在预测大豆荚果数量时的R值达到0.95,参数仅为248万个,比当前的最优模型YOLO POD低一个数量级,且帧率远高于YOLO POD。
与先进的计算机视觉方法相比,PodNet在几乎不牺牲准确性的情况下显著提高了效率。其轻量级架构和高帧率使其适用于实时应用,为密集物体的计数和定位提供了新的解决方案。