Xu Haobin, Zhang Xianhua, Shen Weilin, Lin Zhiqiang, Liu Shuang, Jia Qi, Li Honglong, Zheng Jingyuan, Zhong Fenglin
College of Horticulture, Fujian Agriculture and Forestry University, Fuzhou 350002, China.
Fujian Agricultural Machinery Extension Station, Fuzhou 350002, China.
Plants (Basel). 2024 Nov 27;13(23):3329. doi: 10.3390/plants13233329.
As a crop with significant medicinal value and nutritional components, the market demand for bitter melon continues to grow. The diversity of bitter melon shapes has a direct impact on its market acceptance and consumer preferences, making precise identification of bitter melon germplasm resources crucial for breeding work. To address the limitations of time-consuming and less accurate traditional manual identification methods, there is a need to enhance the automation and intelligence of bitter melon phenotype detection. This study developed a bitter melon phenotype detection model named CSW-YOLO. By incorporating the ConvNeXt V2 module to replace the backbone network of YOLOv8, the model's focus on critical target features is enhanced. Additionally, the SimAM attention mechanism was introduced to compute attention weights for neurons without increasing the parameter count, further enhancing the model's recognition accuracy. Finally, WIoUv3 was introduced as the bounding box loss function to improve the model's convergence speed and positioning capabilities. The model was trained and tested on a bitter melon image dataset, achieving a precision of 94.6%, a recall of 80.6%, a mAP50 of 96.7%, and an F1 score of 87.04%. These results represent improvements of 8.5%, 0.4%, 11.1%, and 4% in precision, recall, mAP50, and F1 score, respectively, over the original YOLOv8 model. Furthermore, the effectiveness of the improvements was validated through heatmap analysis and ablation experiments, demonstrating that the CSW-YOLO model can more accurately focus on target features, reduce false detection rates, and enhance generalization capabilities. Comparative tests with various mainstream deep learning models also proved the superior performance of CSW-YOLO in bitter melon phenotype detection tasks. This research provides an accurate and reliable method for bitter melon phenotype identification and also offers technical support for the visual detection technologies of other agricultural products.
苦瓜作为一种具有重要药用价值和营养成分的作物,市场对其需求持续增长。苦瓜形状的多样性直接影响其市场接受度和消费者偏好,因此精确鉴定苦瓜种质资源对于育种工作至关重要。为解决传统人工鉴定方法耗时且准确性较低的局限性,有必要提高苦瓜表型检测的自动化和智能化水平。本研究开发了一种名为CSW-YOLO的苦瓜表型检测模型。通过引入ConvNeXt V2模块替换YOLOv8的骨干网络,增强了模型对关键目标特征的关注。此外,引入SimAM注意力机制,在不增加参数数量的情况下计算神经元的注意力权重,进一步提高了模型的识别准确率。最后,引入WIoUv3作为边界框损失函数,提高了模型的收敛速度和定位能力。该模型在苦瓜图像数据集上进行训练和测试,精度达到94.6%,召回率为80.6%,mAP50为96.7%,F1分数为87.04%。与原始YOLOv8模型相比,这些结果在精度、召回率、mAP50和F1分数方面分别提高了8.5%、0.4%、11.1%和4%。此外,通过热图分析和消融实验验证了改进的有效性,表明CSW-YOLO模型能够更准确地关注目标特征,降低误检率,并增强泛化能力。与各种主流深度学习模型的对比测试也证明了CSW-YOLO在苦瓜表型检测任务中的优越性能。本研究为苦瓜表型鉴定提供了一种准确可靠的方法,也为其他农产品的视觉检测技术提供了技术支持。