Robillard Alexander J, Trizna Michael G, Ruiz-Tafur Morgan, Dávila Panduro Edgard Leonardo, de Santana C David, White Alexander E, Dikow Rebecca B, Deichmann Jessica L
Data Science Lab Office of the Chief Information Officer, Smithsonian Institution Washington District of Columbia USA.
Center for Conservation and Sustainability Smithsonian National Zoo and Conservation Biology Institute Washington District of Columbia USA.
Ecol Evol. 2023 May 1;13(5):e9987. doi: 10.1002/ece3.9987. eCollection 2023 May.
Given the sharp increase in agricultural and infrastructure development and the paucity of widespread data available to support conservation management decisions, a more rapid and accurate tool for identifying fish fauna in the world's largest freshwater ecosystem, the Amazon, is needed. Current strategies for identification of freshwater fishes require high levels of training and taxonomic expertise for morphological identification or genetic testing for species recognition at a molecular level. To overcome these challenges, we built an image masking model (U-Net) and a convolutional neural net (CNN) to classify Amazonian fish in photographs. Fish used to generate training data were collected and photographed in tributaries in seasonally flooded forests of the upper Morona River valley in Loreto, Peru in 2018 and 2019. Species identifications in the training images ( = 3068) were verified by expert ichthyologists. These images were supplemented with photographs taken of additional Amazonian fish specimens housed in the ichthyological collection of the Smithsonian's National Museum of Natural History. We generated a CNN model that identified 33 genera of fishes with a mean accuracy of 97.9%. Wider availability of accurate freshwater fish image recognition tools, such as the one described here, will enable fishermen, local communities, and citizen scientists to more effectively participate in collecting and sharing data from their territories to inform policy and management decisions that impact them directly.
鉴于农业和基础设施发展的急剧增加以及缺乏广泛可用的数据来支持保护管理决策,需要一种更快、更准确的工具来识别世界上最大的淡水生态系统——亚马逊河中鱼类区系。当前识别淡水鱼的策略需要高水平的训练和分类学专业知识来进行形态学鉴定,或在分子水平上进行基因测试以识别物种。为了克服这些挑战,我们构建了一个图像掩膜模型(U-Net)和一个卷积神经网络(CNN)来对照片中的亚马逊鱼类进行分类。用于生成训练数据的鱼类于2018年和2019年在秘鲁洛雷托莫罗纳河上游流域季节性淹没森林的支流中收集并拍照。训练图像(=3068张)中的物种鉴定由鱼类专家进行了验证。这些图像还补充了史密森尼国家自然历史博物馆鱼类学收藏中保存的其他亚马逊鱼类标本的照片。我们生成了一个CNN模型,该模型能够识别33个鱼类属,平均准确率为97.9%。像这里描述的这种准确的淡水鱼图像识别工具的更广泛应用,将使渔民、当地社区和公民科学家能够更有效地参与收集和分享来自他们所在地区的数据,为直接影响他们的政策和管理决策提供信息。