Ma Jianchao, Guo Jiayuan, Zheng Xiaolong, Fang Chaoyang
School of Geography and Environment, Jiangxi Normal University, Nanchang 330022, China.
Key Laboratory of Poyang Lake Wetland and Watershed Research, Ministry of Education, Jiangxi Normal University, Nanchang 330022, China.
Animals (Basel). 2024 Nov 21;14(23):3353. doi: 10.3390/ani14233353.
Poyang Lake is the largest freshwater lake in China and plays a significant ecological role. Deep-learning-based video surveillance can effectively monitor bird species on the lake, contributing to the local biodiversity preservation. To address the challenges of multi-scale object detection against complex backgrounds, such as a high density and severe occlusion, we propose a new model known as the YOLOv8-bird model. First, we use Receptive-Field Attention convolution, which improves the model's ability to capture and utilize image information. Second, we redesign a feature fusion network, termed the DyASF-P2, which enhances the network's ability to capture small object features and reduces the target information loss. Third, a lightweight detection head is designed to effectively reduce the model's size without sacrificing the precision. Last, the Inner-ShapeIoU loss function is proposed to address the multi-scale bird localization challenge. Experimental results on the PYL-5-2023 dataset demonstrate that the YOLOv8-bird model achieves precision, recall, mAP@0.5, and mAP@0.5:0.95 scores of 94.6%, 89.4%, 94.8%, and 70.4%, respectively. Additionally, the model outperforms other mainstream object detection models in terms of accuracy. These results indicate that the proposed YOLOv8-bird model is well-suited for bird detection and counting tasks, which enable it to support biodiversity monitoring in the complex environment of Poyang Lake.
鄱阳湖是中国最大的淡水湖,发挥着重要的生态作用。基于深度学习的视频监控能够有效监测鄱阳湖的鸟类物种,有助于当地生物多样性保护。为应对复杂背景下多尺度目标检测的挑战,如高密度和严重遮挡等问题,我们提出了一种名为YOLOv8-鸟类模型的新模型。首先,我们使用感受野注意力卷积,提高模型捕捉和利用图像信息的能力。其次,我们重新设计了一个特征融合网络,称为DyASF-P2,增强网络捕捉小目标特征的能力并减少目标信息损失。第三,设计了一个轻量级检测头,在不牺牲精度的情况下有效减小模型大小。最后,提出了Inner-ShapeIoU损失函数来应对多尺度鸟类定位挑战。在PYL-5-2023数据集上的实验结果表明,YOLOv8-鸟类模型的精度、召回率、mAP@0.5和mAP@0.5:0.95分数分别达到94.6%、89.4%、94.8%和70.4%。此外,该模型在准确率方面优于其他主流目标检测模型。这些结果表明,所提出的YOLOv8-鸟类模型非常适合鸟类检测和计数任务,使其能够支持鄱阳湖复杂环境下的生物多样性监测。