Primate Models for Behavioural Evolution Lab, Institute of Cognitive and Evolutionary Anthropology, University of Oxford, Oxford, UK.
Visual Geometry Group, Department of Engineering Science, University of Oxford, Oxford, UK.
Sci Adv. 2019 Sep 4;5(9):eaaw0736. doi: 10.1126/sciadv.aaw0736. eCollection 2019 Sep.
Video recording is now ubiquitous in the study of animal behavior, but its analysis on a large scale is prohibited by the time and resources needed to manually process large volumes of data. We present a deep convolutional neural network (CNN) approach that provides a fully automated pipeline for face detection, tracking, and recognition of wild chimpanzees from long-term video records. In a 14-year dataset yielding 10 million face images from 23 individuals over 50 hours of footage, we obtained an overall accuracy of 92.5% for identity recognition and 96.2% for sex recognition. Using the identified faces, we generated co-occurrence matrices to trace changes in the social network structure of an aging population. The tools we developed enable easy processing and annotation of video datasets, including those from other species. Such automated analysis unveils the future potential of large-scale longitudinal video archives to address fundamental questions in behavior and conservation.
视频记录在动物行为研究中已经无处不在,但由于手动处理大量数据所需的时间和资源,其大规模分析受到限制。我们提出了一种深度卷积神经网络(CNN)方法,为从长期视频记录中检测、跟踪和识别野生黑猩猩提供了一个完全自动化的流水线。在一个 14 年的数据集里,我们从 23 只黑猩猩的 50 多小时的视频中获得了 1000 万张面部图像,身份识别的整体准确率为 92.5%,性别识别的准确率为 96.2%。利用识别出的面部图像,我们生成了共现矩阵,以追踪老龄化群体社会网络结构的变化。我们开发的工具可以方便地处理和注释视频数据集,包括来自其他物种的数据集。这种自动化分析揭示了大规模纵向视频档案在解决行为和保护方面基本问题方面的未来潜力。