School of Engineering, Dali University, Dali, China.
Institute of Eastern Himalayan Biodiversity Research, Dali University, Dali, China.
PLoS One. 2024 Aug 7;19(8):e0306530. doi: 10.1371/journal.pone.0306530. eCollection 2024.
Heatmap-based cattle pose estimation methods suffer from high network complexity and low detection speed. Addressing the issue of cattle pose estimation for complex scenarios without heatmaps, an end-to-end, lightweight cattle pose estimation network utilizing a reparameterized network and an attention mechanism is proposed to improve the overall network performance. The EfficientRepBiPAN (Efficient Representation Bi-Directional Progressive Attention Network) module, incorporated into the neck network, adeptly captures target features across various scales while also mitigating model redundancy. Moreover, a 3D parameterless SimAM (Similarity-based Attention Mechanism) attention mechanism is introduced into the backbone to capture richer directional and positional feature information. We constructed 6846 images to evaluate the performance of the model. The experimental results demonstrate that the proposed network outperforms the baseline method with a 4.3% increase in average accuracy at OKS = 0.5 on the test set. The proposed network reduces the number of floating-point computations by 1.0 G and the number of parameters by 0.16 M. Through comparative evaluations with heatmap and regression-based models such as HRNet, HigherHRNet, DEKR, DEKRv2, and YOLOv5-pose, our method improves AP0.5 by at least 0.4%, reduces the number of parameters by at least 0.4%, and decreases the amount of computation by at least 1.0 GFLOPs, achieving a harmonious balance between accuracy and efficiency. This method can serve as a theoretical reference for estimating cattle poses in various livestock industries.
基于热图的牛体姿态估计方法存在网络复杂度高、检测速度慢的问题。针对复杂场景下无热图的牛体姿态估计问题,提出了一种端到端的轻量级牛体姿态估计网络,利用重参数化网络和注意力机制提高整体网络性能。在颈部网络中加入高效表示双向渐进注意力网络(EfficientRepBiPAN)模块,能够在不同尺度上捕捉目标特征,同时减少模型冗余。此外,在骨干网络中引入了一种无 3D 参数的基于相似性的注意力机制(SimAM)注意力机制,用于捕捉更丰富的方向和位置特征信息。我们构建了 6846 张图像来评估模型的性能。实验结果表明,所提出的网络在测试集上 OKS=0.5 时的平均准确率提高了 4.3%,优于基线方法。所提出的网络将浮点运算次数减少了 1.0G,参数量减少了 0.16M。通过与 HRNet、HigherHRNet、DEKR、DEKRv2 和 YOLOv5-pose 等基于热图和回归的模型进行对比评估,我们的方法在 AP0.5 上至少提高了 0.4%,参数量至少减少了 0.4%,浮点运算次数至少减少了 1.0G,在准确性和效率之间取得了良好的平衡。该方法可为各种畜牧业中牛体姿态估计提供理论参考。