Jiang Liubing, Mu Yujie, Che Li, Wu Yongman
School of Information and Communication, Guilin University of Electronic Technology, Guilin, 541004, China.
School of Computer and Information Security, Guilin University of Electronic Technology, Guilin, 541004, China.
Sci Rep. 2024 Aug 3;14(1):18013. doi: 10.1038/s41598-024-68878-7.
As the pillar industry of coastal areas, aquaculture needs artificial intelligence technology to promote economic development. To realize the automation of the aquaculture industry, this paper proposes a new underwater object detection network: WBi-YOLOSF. It realizes the automatic classification and detection of aquatic products, improves the production efficiency of the aquaculture industry, and promotes its economic development. This paper creates an image dataset containing 15 aquatic products to lay the data foundation for model training. In the data preprocessing part, an underwater image enhancement algorithm is proposed to improve the quality of the data set effectively. Aiming at the problem of high false detection rate and missed detection rate of underwater dense small targets, a new data enhancement method was proposed to improve the training set's data quality comprehensively. Inspired by the weighted bidirectional feature pyramid network, this paper proposes a new feature extraction network: AU-BiFPN, which solves the gradient problem caused by the network hierarchy's deepening on enhancing the network's multi-scale feature fusion. The AU-BiFPN network structure is embedded into the YOLO series network framework, significantly improving the basic network's feature extraction and feature fusion ability and dramatically improving the prediction accuracy without affecting the network inference speed. Here, the swarm intelligence algorithm is introduced to optimize the model hyperparameters, accelerating the convergence speed of model training and significantly reducing the computational cost. At the same time, the model's accuracy is improved by a cliff. In addition, the Funnel Activation is introduced in the network's backbone, and the simple, parameter-free attention module is integrated, effectively improving the accuracy and speed of the model prediction. Ablation and comparison experiments show the effectiveness and superiority of the proposed model. Verified by the mean average precision and frame rate evaluation indicators, the experimental results of the WBi-YOLOSF target detection network can reach 0.982 and 203 frames per second, which are 1.4% and five frames per second higher than the network with the second score. In summary, this method can quickly and accurately identify aquatic products, realize real-time target detection of aquatic products, and lay the foundation for developing an aquaculture automation system.
作为沿海地区的支柱产业,水产养殖需要人工智能技术来推动经济发展。为实现水产养殖业的自动化,本文提出了一种新的水下目标检测网络:WBi - YOLOSF。它实现了水产品的自动分类与检测,提高了水产养殖业的生产效率,推动了其经济发展。本文创建了一个包含15种水产品的图像数据集,为模型训练奠定数据基础。在数据预处理部分,提出了一种水下图像增强算法,有效提高了数据集的质量。针对水下密集小目标误检率和漏检率高的问题,提出了一种新的数据增强方法,全面提高训练集的数据质量。受加权双向特征金字塔网络的启发,本文提出了一种新的特征提取网络:AU - BiFPN,解决了网络层次加深对增强网络多尺度特征融合造成的梯度问题。将AU - BiFPN网络结构嵌入到YOLO系列网络框架中,显著提高了基础网络的特征提取和特征融合能力,在不影响网络推理速度的情况下大幅提高了预测精度。在此,引入群智能算法优化模型超参数,加快了模型训练的收敛速度,显著降低了计算成本。同时,模型的准确率有大幅提高。此外,在网络主干中引入了Funnel Activation,并集成了简单、无参数的注意力模块,有效提高了模型预测的准确率和速度。消融实验和对比实验表明了所提模型的有效性和优越性。经平均精度均值和帧率评估指标验证,WBi - YOLOSF目标检测网络的实验结果可达0.982和每秒203帧,分别比得分第二的网络高1.4%和每秒5帧。综上所述,该方法能够快速准确地识别水产品,实现水产品的实时目标检测,为开发水产养殖自动化系统奠定了基础。