College of Robotics, Guangdong Polytechnic of Science and Technology, Zhuhai, Guangdong, China.
Department of Network Technology, Guangzhou Institute of Software Engineering, Conghua, Guangdong, China.
PLoS One. 2024 Oct 29;19(10):e0307643. doi: 10.1371/journal.pone.0307643. eCollection 2024.
With the development of deep learning technology, object detection has been widely applied in various fields. However, in cross-dataset object detection, conventional deep learning models often face performance degradation issues. This is particularly true in the agricultural field, where there is a multitude of crop types and a complex and variable environment. Existing technologies still face performance bottlenecks when dealing with diverse scenarios. To address these issues, this study proposes a lightweight, cross-dataset enhanced object detection method for the agricultural domain based on YOLOv9, named Multi-Adapt Recognition-YOLOv9 (MAR-YOLOv9). The traditional 32x downsampling Backbone network has been optimized, and a 16x downsampling Backbone network has been innovatively designed. A more streamlined and lightweight Main Neck structure has been introduced, along with innovative methods for feature extraction, up-sampling, and Concat connection. The hybrid connection strategy allows the model to flexibly utilize features from different levels. This solves the issues of increased training time and redundant weights caused by the detection neck and auxiliary branch structures in traditional YOLOv9, enabling MAR-YOLOv9 to maintain high performance while reducing the model's computational complexity and improving detection speed, making it more suitable for real-time detection tasks. In comparative experiments on four plant datasets, MAR-YOLOv9 improved the mAP@0.5 accuracy by 39.18% compared to seven mainstream object detection algorithms, and by 1.28% compared to the YOLOv9 model. At the same time, the model size was reduced by 9.3%, and the number of model layers was decreased, reducing computational costs and storage requirements. Additionally, MAR-YOLOv9 demonstrated significant advantages in detecting complex agricultural images, providing an efficient, lightweight, and adaptable solution for object detection tasks in the agricultural field. The curated data and code can be accessed at the following link: https://github.com/YangxuWangamI/MAR-YOLOv9.
随着深度学习技术的发展,目标检测已经在各个领域得到了广泛的应用。然而,在跨数据集目标检测中,传统的深度学习模型通常面临性能下降的问题。这在农业领域尤为明显,因为那里有多种作物类型和复杂多变的环境。现有的技术在处理多样化的场景时仍然面临性能瓶颈。为了解决这些问题,本研究提出了一种基于 YOLOv9 的轻量级跨数据集增强农业领域目标检测方法,名为 Multi-Adapt Recognition-YOLOv9(MAR-YOLOv9)。优化了传统的 32x 下采样 Backbone 网络,创新设计了 16x 下采样 Backbone 网络。引入了更精简的轻量级 Main Neck 结构,以及创新的特征提取、上采样和 Concat 连接方法。混合连接策略允许模型灵活地利用来自不同层次的特征。这解决了传统 YOLOv9 中检测颈和辅助分支结构导致的训练时间增加和权重冗余问题,使 MAR-YOLOv9 在保持高性能的同时降低模型的计算复杂度,提高检测速度,使其更适合实时检测任务。在四个植物数据集上的对比实验中,MAR-YOLOv9 与七种主流目标检测算法相比,mAP@0.5 的精度提高了 39.18%,与 YOLOv9 模型相比提高了 1.28%。同时,模型大小减小了 9.3%,模型层数减少,降低了计算成本和存储需求。此外,MAR-YOLOv9 在检测复杂的农业图像方面表现出显著的优势,为农业领域的目标检测任务提供了高效、轻量级和适应性强的解决方案。已整理的数据和代码可在以下链接访问:https://github.com/YangxuWangamI/MAR-YOLOv9。