College of Mechanical and Electrical Engineering, Inner Mongolia Agricultural University, Inner Mongolia, Hohhot 010018, China; Inner Mongolia Engineering Research Center for Intelligent Facilities in Prataculture and Livestock Breeding, Inner Mongolia, Hohhot 010018, China.
College of Mechanical and Electrical Engineering, Inner Mongolia Agricultural University, Inner Mongolia, Hohhot 010018, China; Inner Mongolia Engineering Research Center for Intelligent Facilities in Prataculture and Livestock Breeding, Inner Mongolia, Hohhot 010018, China.
Animal. 2023 Aug;17(8):100886. doi: 10.1016/j.animal.2023.100886. Epub 2023 Jun 15.
Accurate identification of individual animals plays a pivotal role in enhancing animal welfare and optimising farm production. Although Radio Frequency Identification technology has been widely applied in animal identification, this method still exhibits several limitations that make it difficult to meet current practical application requirements. In this study, we proposed ViT-Sheep, a sheep face recognition model based on the Vision Transformer (ViT) architecture, to facilitate precise animal management and enhance livestock welfare. Compared to Convolutional Neural Network (CNN), ViT is renowned for its competitive performance. The experimental procedure of this study consisted of three main steps. Firstly, we collected face images of 160 experimental sheep to construct the sheep face image dataset. Secondly, we developed two sets of sheep face recognition models based on CNN and ViT, respectively. To enhance the ability to learn sheep face biological features, we proposed targeted improvement strategies for the sheep face recognition model. Specifically, we introduced the LayerScale module into the encoder of the ViT-Base-16 model and employed transfer learning to improve recognition accuracy. Finally, we compared the training results of different recognition models and the ViT-Sheep model. The results demonstrated that our proposed method achieved the highest performance on the sheep face image dataset, with a recognition accuracy of 97.9%. This study demonstrates that ViT can successfully achieve sheep face recognition tasks with good robustness. Furthermore, the findings of this research will promote the practical application of artificial intelligence animal recognition technology in sheep production.
准确识别个体动物对于提高动物福利和优化农场生产至关重要。尽管射频识别技术已广泛应用于动物识别,但该方法仍存在一些局限性,难以满足当前实际应用的要求。本研究提出了基于视觉Transformer(ViT)架构的羊脸识别模型 ViT-Sheep,以促进精确的动物管理和提高牲畜福利。与卷积神经网络(CNN)相比,ViT 以其出色的性能而闻名。本研究的实验过程包括三个主要步骤。首先,我们收集了 160 只实验羊的面部图像,构建了羊脸图像数据集。其次,我们分别基于 CNN 和 ViT 开发了两套羊脸识别模型。为了提高学习羊脸生物特征的能力,我们针对羊脸识别模型提出了有针对性的改进策略。具体来说,我们在 ViT-Base-16 模型的编码器中引入了 LayerScale 模块,并采用迁移学习来提高识别精度。最后,我们比较了不同识别模型和 ViT-Sheep 模型的训练结果。结果表明,我们提出的方法在羊脸图像数据集上取得了最高的性能,识别准确率达到 97.9%。本研究表明,ViT 可以成功地完成羊脸识别任务,具有良好的鲁棒性。此外,本研究的结果将促进人工智能动物识别技术在绵羊生产中的实际应用。