利用时间上下文提高动物叫声序列的自动检测。

Improve automatic detection of animal call sequences with temporal context.

机构信息

K. Lisa Yang Center for Conservation Bioacoustics, Cornell Lab of Ornithology, Cornell University, Ithaca, NY, USA.

Marine Mammal Institute, Department of Fisheries, Wildlife, and Conservation Sciences, Oregon State University, Corvallis, OR, USA.

出版信息

J R Soc Interface. 2021 Jul;18(180):20210297. doi: 10.1098/rsif.2021.0297. Epub 2021 Jul 21.

DOI:10.1098/rsif.2021.0297

PMID:34283944

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8292017/

Abstract

Many animals rely on long-form communication, in the form of songs, for vital functions such as mate attraction and territorial defence. We explored the prospect of improving automatic recognition performance by using the temporal context inherent in song. The ability to accurately detect sequences of calls has implications for conservation and biological studies. We show that the performance of a convolutional neural network (CNN), designed to detect song notes (calls) in short-duration audio segments, can be improved by combining it with a recurrent network designed to process sequences of learned representations from the CNN on a longer time scale. The combined system of independently trained CNN and long short-term memory (LSTM) network models exploits the temporal patterns between song notes. We demonstrate the technique using recordings of fin whale () songs, which comprise patterned sequences of characteristic notes. We evaluated several variants of the CNN + LSTM network. Relative to the baseline CNN model, the CNN + LSTM models reduced performance variance, offering a 9-17% increase in area under the precision-recall curve and a 9-18% increase in peak F1-scores. These results show that the inclusion of temporal information may offer a valuable pathway for improving the automatic recognition and transcription of wildlife recordings.

摘要

许多动物依赖于长形式的通讯，例如歌曲，以实现重要的功能，如吸引配偶和防御领地。我们探索了利用歌曲中固有的时间上下文来提高自动识别性能的可能性。准确检测呼叫序列的能力对保护和生物研究具有重要意义。我们表明，设计用于在短持续时间音频段中检测歌曲音符（呼叫）的卷积神经网络（CNN）的性能可以通过将其与循环网络相结合来提高，该网络旨在在更长的时间尺度上处理从 CNN 学习的表示序列。独立训练的 CNN 和长短时记忆 (LSTM) 网络模型的组合系统利用了歌曲音符之间的时间模式。我们使用长须鲸（）歌曲的录音演示了该技术，其中包括特征音符的模式序列。我们评估了 CNN + LSTM 网络的几种变体。与基线 CNN 模型相比，CNN + LSTM 模型降低了性能方差，在精度-召回曲线下面积方面提高了 9-17%，在峰值 F1 得分方面提高了 9-18%。这些结果表明，包含时间信息可能为提高野生动物录音的自动识别和转录提供有价值的途径。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a755/8292017/e8d5e4351f40/rsif20210297f01.jpg

相似文献

Improve automatic detection of animal call sequences with temporal context.

J R Soc Interface. 2021 Jul;18(180):20210297. doi: 10.1098/rsif.2021.0297. Epub 2021 Jul 21.

Automatic detection and classification of baleen whale social calls using convolutional neural networks.

J Acoust Soc Am. 2021 May;149(5):3635. doi: 10.1121/10.0005047.

Temporal indexing of medical entity in Chinese clinical notes.

BMC Med Inform Decis Mak. 2019 Jan 31;19(Suppl 1):17. doi: 10.1186/s12911-019-0735-x.

Automatic monitoring and detection of tail-biting behavior in groups of pigs using video-based deep learning methods.

Front Vet Sci. 2023 Jan 11;9:1099347. doi: 10.3389/fvets.2022.1099347. eCollection 2022.

Detecting, classifying, and counting blue whale calls with Siamese neural networks.

J Acoust Soc Am. 2021 May;149(5):3086. doi: 10.1121/10.0004828.

A Novel Gait Phase Recognition Method Based on DPF-LSTM-CNN Using Wearable Inertial Sensors.

Sensors (Basel). 2023 Jun 26;23(13):5905. doi: 10.3390/s23135905.

Temporal evolution of the Mediterranean fin whale song.

Sci Rep. 2022 Aug 9;12(1):13565. doi: 10.1038/s41598-022-15379-0.

A comparative study on deep learning models for text classification of unstructured medical notes with various levels of class imbalance.

BMC Med Res Methodol. 2022 Jul 2;22(1):181. doi: 10.1186/s12874-022-01665-y.

Cohort selection for clinical trials using hierarchical neural network.

J Am Med Inform Assoc. 2019 Nov 1;26(11):1203-1208. doi: 10.1093/jamia/ocz099.

An End-to-End Multi-Channel Convolutional Bi-LSTM Network for Automatic Sleep Stage Detection.

Sensors (Basel). 2023 May 21;23(10):4950. doi: 10.3390/s23104950.

引用本文的文献

Multi-year soundscape recordings and automated call detection reveals varied impact of moonlight on calling activity of neotropical forest katydids.

Philos Trans R Soc Lond B Biol Sci. 2024 Jun 24;379(1904):20230110. doi: 10.1098/rstb.2023.0110. Epub 2024 May 6.

Extensive data engineering to the rescue: building a multi-species katydid detector from unbalanced, atypical training datasets.

Philos Trans R Soc Lond B Biol Sci. 2024 Jun 24;379(1904):20230444. doi: 10.1098/rstb.2023.0444. Epub 2024 May 6.

ORCA-SPY enables killer whale sound source simulation, detection, classification and localization using an integrated deep learning-based segmentation.

Sci Rep. 2023 Jul 10;13(1):11106. doi: 10.1038/s41598-023-38132-7.

Temporal evolution of the Mediterranean fin whale song.

Sci Rep. 2022 Aug 9;12(1):13565. doi: 10.1038/s41598-022-15379-0.

Automated identification of chicken distress vocalizations using deep learning models.

J R Soc Interface. 2022 Jun;19(191):20210921. doi: 10.1098/rsif.2021.0921. Epub 2022 Jun 29.

Computational bioacoustics with deep learning: a review and roadmap.

PeerJ. 2022 Mar 21;10:e13152. doi: 10.7717/peerj.13152. eCollection 2022.

A machine learning pipeline for classification of cetacean echolocation clicks in large underwater acoustic datasets.

PLoS Comput Biol. 2021 Dec 3;17(12):e1009613. doi: 10.1371/journal.pcbi.1009613. eCollection 2021 Dec.

本文引用的文献

Exploring movement patterns and changing distributions of baleen whales in the western North Atlantic using a decade of passive acoustic data.

Glob Chang Biol. 2020 Sep;26(9):4812-4840. doi: 10.1111/gcb.15191. Epub 2020 Jul 12.

Fin whale acoustic presence and song characteristics in seas to the southwest of Portugal.

J Acoust Soc Am. 2020 Apr;147(4):2235. doi: 10.1121/10.0001066.

Deep neural networks for automated detection of marine mammal species.

Sci Rep. 2020 Jan 17;10(1):607. doi: 10.1038/s41598-020-57549-y.

Deep Machine Learning Techniques for the Detection and Classification of Sperm Whale Bioacoustics.

Sci Rep. 2019 Aug 29;9(1):12588. doi: 10.1038/s41598-019-48909-4.

Convolutional neural network for detecting odontocete echolocation clicks.

J Acoust Soc Am. 2019 Jan;145(1):EL7. doi: 10.1121/1.5085647.

Automatic classification of grouper species by their sounds using deep neural networks.

J Acoust Soc Am. 2018 Sep;144(3):EL196. doi: 10.1121/1.5054911.

Automatic detection and classification of marmoset vocalizations using deep and recurrent neural networks.

J Acoust Soc Am. 2018 Jul;144(1):478. doi: 10.1121/1.5047743.

Fin whale density and distribution estimation using acoustic bearings derived from sparse arrays.

J Acoust Soc Am. 2018 May;143(5):2980. doi: 10.1121/1.5031111.

Spatial and temporal trends in fin whale vocalizations recorded in the NE Pacific Ocean between 2003-2013.

PLoS One. 2017 Oct 26;12(10):e0186127. doi: 10.1371/journal.pone.0186127. eCollection 2017.

Fin whale song variability in southern California and the Gulf of California.

Sci Rep. 2017 Aug 31;7(1):10126. doi: 10.1038/s41598-017-09979-4.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

利用时间上下文提高动物叫声序列的自动检测。

Improve automatic detection of animal call sequences with temporal context.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献