Weinstein Ben G, Garner Lindsey, Saccomanno Vienna R, Steinkraus Ashley, Ortega Andrew, Brush Kristen, Yenni Glenda, McKellar Ann E, Converse Rowan, Lippitt Christopher D, Wegmann Alex, Holmes Nick D, Edney Alice J, Hart Tom, Jessopp Mark J, Clarke Rohan H, Marchowski Dominik, Senyondo Henry, Dotson Ryan, White Ethan P, Frederick Peter, Ernest S K Morgan
Department of Wildlife Ecology and Conservation, University of Florida, Gainesville, Florida, USA.
California Oceans Program, The Nature Conservancy, Sacramento, California, USA.
Ecol Appl. 2022 Dec;32(8):e2694. doi: 10.1002/eap.2694. Epub 2022 Aug 10.
Advances in artificial intelligence for computer vision hold great promise for increasing the scales at which ecological systems can be studied. The distribution and behavior of individuals is central to ecology, and computer vision using deep neural networks can learn to detect individual objects in imagery. However, developing supervised models for ecological monitoring is challenging because it requires large amounts of human-labeled training data, requires advanced technical expertise and computational infrastructure, and is prone to overfitting. This limits application across space and time. One solution is developing generalized models that can be applied across species and ecosystems. Using over 250,000 annotations from 13 projects from around the world, we develop a general bird detection model that achieves over 65% recall and 50% precision on novel aerial data without any local training despite differences in species, habitat, and imaging methodology. Fine-tuning this model with only 1000 local annotations increases these values to an average of 84% recall and 69% precision by building on the general features learned from other data sources. Retraining from the general model improves local predictions even when moderately large annotation sets are available and makes model training faster and more stable. Our results demonstrate that general models for detecting broad classes of organisms using airborne imagery are achievable. These models can reduce the effort, expertise, and computational resources necessary for automating the detection of individual organisms across large scales, helping to transform the scale of data collection in ecology and the questions that can be addressed.
计算机视觉领域的人工智能进展有望提升生态系统的研究规模。个体的分布和行为是生态学的核心,而使用深度神经网络的计算机视觉能够学会在图像中检测个体对象。然而,开发用于生态监测的监督模型具有挑战性,因为它需要大量人工标注的训练数据,需要先进的技术专长和计算基础设施,并且容易出现过拟合。这限制了其在时空上的应用。一种解决方案是开发可应用于不同物种和生态系统的通用模型。利用来自全球13个项目的超过25万个标注,我们开发了一种通用鸟类检测模型,该模型在新颖的航空数据上实现了超过65%的召回率和50%的精确率,尽管物种、栖息地和成像方法存在差异,但无需任何本地训练。通过利用从其他数据源学到的通用特征,仅用1000个本地标注对该模型进行微调,可将这些值平均提高到84%的召回率和69%的精确率。即使有适度大的标注集,从通用模型重新训练也能改善本地预测,并使模型训练更快、更稳定。我们的结果表明,使用航空图像检测广泛类别的生物体的通用模型是可以实现的。这些模型可以减少在大尺度上自动检测个体生物体所需的工作量、专业知识和计算资源,有助于改变生态学中的数据收集规模以及可解决的问题。