Liu Liang, Wu Junfeng, Zhao Haiyan, Kong Han, Zheng Tao, Qu Boyu, Yu Hong
College of Information Engineering, Dalian Ocean University, Dalian, China.
Dalian Key Laboratory of Smart Fisheries, Dalian Ocean University, Dalian, China.
J Fish Biol. 2025 Mar;106(3):908-920. doi: 10.1111/jfb.15992. Epub 2024 Nov 25.
Underwater fish object detection serves as a pivotal research direction in marine biology, aquaculture management, and computer vision, yet it poses substantial challenges due to the complexity of underwater environments, occultations, and the small-sized and frequently moving fish in aquaculture. Addressing these challenges, we propose a novel underwater fish object detection algorithm named Fish-Finder. First, we engendered a structure titled "C2fBF," utilizing the dual-path routing attention protocol of BiFormer. The primary objective of this structure is to alleviate the perturbations induced by underwater intricacies during the phase of downsampling in the backbone network, thereby discerning and conserving finer contextual features. Subsequently, we co-opted the RepGFPN method within our neck network-a distinctive approach that adeptly merges high-level semantic constructs with low-level spatial specifics, thus fortifying its multi-scale detection prowess. Then, in an endeavor to diminish the sensitivity toward positional aberrations during the detection of diminutive aquatic creatures, we incorporated a novel bounding box regression loss function, the Wasserstein loss, to the existing CIoU. This innovative function gauges the congruity between the predicted bounding box Gaussian distribution and the reference bounding box Gaussian distribution. Finally, in regard to the dataset, we independently assembled a specific dataset termed "SmallFish." This unique dataset, meticulously designed for the detection of small-scale fish within intricate underwater settings, includes 5000 annotated images of small fish. Experimental results demonstrate that, compared to the state-of-the-art detection methods, our proposed method improves the accuracy by and , and mean average precision (mAP) increases and in public dataset Kaggle-Fish and our SmallFish dataset, respectively.
水下鱼类目标检测是海洋生物学、水产养殖管理和计算机视觉中的一个关键研究方向,但由于水下环境的复杂性、遮挡以及水产养殖中鱼类体型小且频繁移动,这一任务面临着巨大挑战。为应对这些挑战,我们提出了一种名为Fish-Finder的新型水下鱼类目标检测算法。首先,我们利用BiFormer的双路径路由注意力协议生成了一种名为“C2fBF”的结构。该结构的主要目标是减轻骨干网络下采样阶段水下复杂性引起的干扰,从而识别并保留更精细的上下文特征。随后,我们在颈部网络中采用了RepGFPN方法——一种独特的方法,能够巧妙地将高级语义结构与低级空间细节融合,从而增强其多尺度检测能力。然后,为了降低在检测小型水生生物时对位置偏差的敏感性,我们在现有的CIoU基础上引入了一种新的边界框回归损失函数——Wasserstein损失。这种创新函数衡量预测边界框高斯分布与参考边界框高斯分布之间的一致性。最后,关于数据集,我们独立组装了一个名为“SmallFish”的特定数据集。这个独特的数据集是为在复杂水下环境中检测小型鱼类而精心设计的,包括5000张小鱼的标注图像。实验结果表明,与现有最先进的检测方法相比,我们提出的方法在公共数据集Kaggle-Fish和我们的SmallFish数据集中,准确率分别提高了[X]和[X],平均精度均值(mAP)分别提高了[X]和[X]。