• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

利用时间上下文提高动物叫声序列的自动检测。

Improve automatic detection of animal call sequences with temporal context.

机构信息

K. Lisa Yang Center for Conservation Bioacoustics, Cornell Lab of Ornithology, Cornell University, Ithaca, NY, USA.

Marine Mammal Institute, Department of Fisheries, Wildlife, and Conservation Sciences, Oregon State University, Corvallis, OR, USA.

出版信息

J R Soc Interface. 2021 Jul;18(180):20210297. doi: 10.1098/rsif.2021.0297. Epub 2021 Jul 21.

DOI:10.1098/rsif.2021.0297
PMID:34283944
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8292017/
Abstract

Many animals rely on long-form communication, in the form of songs, for vital functions such as mate attraction and territorial defence. We explored the prospect of improving automatic recognition performance by using the temporal context inherent in song. The ability to accurately detect sequences of calls has implications for conservation and biological studies. We show that the performance of a convolutional neural network (CNN), designed to detect song notes (calls) in short-duration audio segments, can be improved by combining it with a recurrent network designed to process sequences of learned representations from the CNN on a longer time scale. The combined system of independently trained CNN and long short-term memory (LSTM) network models exploits the temporal patterns between song notes. We demonstrate the technique using recordings of fin whale () songs, which comprise patterned sequences of characteristic notes. We evaluated several variants of the CNN + LSTM network. Relative to the baseline CNN model, the CNN + LSTM models reduced performance variance, offering a 9-17% increase in area under the precision-recall curve and a 9-18% increase in peak F1-scores. These results show that the inclusion of temporal information may offer a valuable pathway for improving the automatic recognition and transcription of wildlife recordings.

摘要

许多动物依赖于长形式的通讯,例如歌曲,以实现重要的功能,如吸引配偶和防御领地。我们探索了利用歌曲中固有的时间上下文来提高自动识别性能的可能性。准确检测呼叫序列的能力对保护和生物研究具有重要意义。我们表明,设计用于在短持续时间音频段中检测歌曲音符(呼叫)的卷积神经网络(CNN)的性能可以通过将其与循环网络相结合来提高,该网络旨在在更长的时间尺度上处理从 CNN 学习的表示序列。独立训练的 CNN 和长短时记忆 (LSTM) 网络模型的组合系统利用了歌曲音符之间的时间模式。我们使用长须鲸()歌曲的录音演示了该技术,其中包括特征音符的模式序列。我们评估了 CNN + LSTM 网络的几种变体。与基线 CNN 模型相比,CNN + LSTM 模型降低了性能方差,在精度-召回曲线下面积方面提高了 9-17%,在峰值 F1 得分方面提高了 9-18%。这些结果表明,包含时间信息可能为提高野生动物录音的自动识别和转录提供有价值的途径。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a755/8292017/b2ab108775c3/rsif20210297f14.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a755/8292017/e8d5e4351f40/rsif20210297f01.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a755/8292017/ed4161e3bd79/rsif20210297f02.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a755/8292017/64a76f55bdba/rsif20210297f03.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a755/8292017/362bfc491762/rsif20210297f04.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a755/8292017/a6b3eded137f/rsif20210297f05.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a755/8292017/d2fadaf75be1/rsif20210297f06.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a755/8292017/0870097db8bc/rsif20210297f07.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a755/8292017/24185cba2f7a/rsif20210297f08.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a755/8292017/73f2c7a4eb03/rsif20210297f09.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a755/8292017/4147970454b0/rsif20210297f10.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a755/8292017/e96af587934b/rsif20210297f11.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a755/8292017/8fd44b302e96/rsif20210297f12.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a755/8292017/ddb56048ee09/rsif20210297f13.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a755/8292017/b2ab108775c3/rsif20210297f14.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a755/8292017/e8d5e4351f40/rsif20210297f01.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a755/8292017/ed4161e3bd79/rsif20210297f02.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a755/8292017/64a76f55bdba/rsif20210297f03.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a755/8292017/362bfc491762/rsif20210297f04.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a755/8292017/a6b3eded137f/rsif20210297f05.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a755/8292017/d2fadaf75be1/rsif20210297f06.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a755/8292017/0870097db8bc/rsif20210297f07.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a755/8292017/24185cba2f7a/rsif20210297f08.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a755/8292017/73f2c7a4eb03/rsif20210297f09.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a755/8292017/4147970454b0/rsif20210297f10.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a755/8292017/e96af587934b/rsif20210297f11.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a755/8292017/8fd44b302e96/rsif20210297f12.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a755/8292017/ddb56048ee09/rsif20210297f13.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a755/8292017/b2ab108775c3/rsif20210297f14.jpg

相似文献

1
Improve automatic detection of animal call sequences with temporal context.利用时间上下文提高动物叫声序列的自动检测。
J R Soc Interface. 2021 Jul;18(180):20210297. doi: 10.1098/rsif.2021.0297. Epub 2021 Jul 21.
2
Automatic detection and classification of baleen whale social calls using convolutional neural networks.使用卷积神经网络自动检测和分类须鲸的社会叫声。
J Acoust Soc Am. 2021 May;149(5):3635. doi: 10.1121/10.0005047.
3
Temporal indexing of medical entity in Chinese clinical notes.中文临床记录中医疗实体的时间索引。
BMC Med Inform Decis Mak. 2019 Jan 31;19(Suppl 1):17. doi: 10.1186/s12911-019-0735-x.
4
Automatic monitoring and detection of tail-biting behavior in groups of pigs using video-based deep learning methods.使用基于视频的深度学习方法对猪群中的咬尾行为进行自动监测和检测。
Front Vet Sci. 2023 Jan 11;9:1099347. doi: 10.3389/fvets.2022.1099347. eCollection 2022.
5
Detecting, classifying, and counting blue whale calls with Siamese neural networks.用孪生神经网络检测、分类和计数蓝鲸叫声。
J Acoust Soc Am. 2021 May;149(5):3086. doi: 10.1121/10.0004828.
6
A Novel Gait Phase Recognition Method Based on DPF-LSTM-CNN Using Wearable Inertial Sensors.基于穿戴式惯性传感器的 DPF-LSTM-CNN 的新型步态相位识别方法。
Sensors (Basel). 2023 Jun 26;23(13):5905. doi: 10.3390/s23135905.
7
Temporal evolution of the Mediterranean fin whale song.地中海长须鲸歌声的时间演化。
Sci Rep. 2022 Aug 9;12(1):13565. doi: 10.1038/s41598-022-15379-0.
8
A comparative study on deep learning models for text classification of unstructured medical notes with various levels of class imbalance.深度学习模型在不同类别不平衡程度的非结构化医疗记录文本分类中的对比研究。
BMC Med Res Methodol. 2022 Jul 2;22(1):181. doi: 10.1186/s12874-022-01665-y.
9
Cohort selection for clinical trials using hierarchical neural network.使用分层神经网络进行临床试验的队列选择。
J Am Med Inform Assoc. 2019 Nov 1;26(11):1203-1208. doi: 10.1093/jamia/ocz099.
10
An End-to-End Multi-Channel Convolutional Bi-LSTM Network for Automatic Sleep Stage Detection.端到端多通道卷积双向长短时记忆网络在自动睡眠分期检测中的应用。
Sensors (Basel). 2023 May 21;23(10):4950. doi: 10.3390/s23104950.

引用本文的文献

1
Multi-year soundscape recordings and automated call detection reveals varied impact of moonlight on calling activity of neotropical forest katydids.多年的声景记录和自动鸣叫检测揭示了月光对新热带森林蟋蟀鸣叫活动的不同影响。
Philos Trans R Soc Lond B Biol Sci. 2024 Jun 24;379(1904):20230110. doi: 10.1098/rstb.2023.0110. Epub 2024 May 6.
2
Extensive data engineering to the rescue: building a multi-species katydid detector from unbalanced, atypical training datasets.广泛的数据工程来救援:从不平衡、非典型的训练数据集中构建一个多物种蝈蝈探测器。
Philos Trans R Soc Lond B Biol Sci. 2024 Jun 24;379(1904):20230444. doi: 10.1098/rstb.2023.0444. Epub 2024 May 6.
3

本文引用的文献

1
Exploring movement patterns and changing distributions of baleen whales in the western North Atlantic using a decade of passive acoustic data.利用十年来的被动声学数据探索北大西洋西部须鲸的运动模式和分布变化。
Glob Chang Biol. 2020 Sep;26(9):4812-4840. doi: 10.1111/gcb.15191. Epub 2020 Jul 12.
2
Fin whale acoustic presence and song characteristics in seas to the southwest of Portugal.葡萄牙西南部海域长须鲸的声学存在及歌声特征
J Acoust Soc Am. 2020 Apr;147(4):2235. doi: 10.1121/10.0001066.
3
Deep neural networks for automated detection of marine mammal species.
ORCA-SPY enables killer whale sound source simulation, detection, classification and localization using an integrated deep learning-based segmentation.
ORCA-SPY 使用基于集成深度学习的分割实现虎鲸声源模拟、检测、分类和定位。
Sci Rep. 2023 Jul 10;13(1):11106. doi: 10.1038/s41598-023-38132-7.
4
Temporal evolution of the Mediterranean fin whale song.地中海长须鲸歌声的时间演化。
Sci Rep. 2022 Aug 9;12(1):13565. doi: 10.1038/s41598-022-15379-0.
5
Automated identification of chicken distress vocalizations using deep learning models.使用深度学习模型自动识别鸡的 distress 叫声。
J R Soc Interface. 2022 Jun;19(191):20210921. doi: 10.1098/rsif.2021.0921. Epub 2022 Jun 29.
6
Computational bioacoustics with deep learning: a review and roadmap.深度学习的计算生物声学:综述与路线图。
PeerJ. 2022 Mar 21;10:e13152. doi: 10.7717/peerj.13152. eCollection 2022.
7
A machine learning pipeline for classification of cetacean echolocation clicks in large underwater acoustic datasets.一种用于在大型水下声数据集分类鲸目动物声纳 click 的机器学习管道。
PLoS Comput Biol. 2021 Dec 3;17(12):e1009613. doi: 10.1371/journal.pcbi.1009613. eCollection 2021 Dec.
用于自动检测海洋哺乳动物物种的深度神经网络。
Sci Rep. 2020 Jan 17;10(1):607. doi: 10.1038/s41598-020-57549-y.
4
Deep Machine Learning Techniques for the Detection and Classification of Sperm Whale Bioacoustics.深度学习技术在抹香鲸生物声学检测与分类中的应用。
Sci Rep. 2019 Aug 29;9(1):12588. doi: 10.1038/s41598-019-48909-4.
5
Convolutional neural network for detecting odontocete echolocation clicks.用于检测齿鲸类回声定位点击的卷积神经网络。
J Acoust Soc Am. 2019 Jan;145(1):EL7. doi: 10.1121/1.5085647.
6
Automatic classification of grouper species by their sounds using deep neural networks.利用深度神经网络对石斑鱼进行自动分类。
J Acoust Soc Am. 2018 Sep;144(3):EL196. doi: 10.1121/1.5054911.
7
Automatic detection and classification of marmoset vocalizations using deep and recurrent neural networks.使用深度和循环神经网络自动检测和分类狨猴叫声。
J Acoust Soc Am. 2018 Jul;144(1):478. doi: 10.1121/1.5047743.
8
Fin whale density and distribution estimation using acoustic bearings derived from sparse arrays.利用稀疏阵声 bearings 估算长须鲸的密度和分布。
J Acoust Soc Am. 2018 May;143(5):2980. doi: 10.1121/1.5031111.
9
Spatial and temporal trends in fin whale vocalizations recorded in the NE Pacific Ocean between 2003-2013.2003年至2013年间在东北太平洋记录的长须鲸发声的时空趋势。
PLoS One. 2017 Oct 26;12(10):e0186127. doi: 10.1371/journal.pone.0186127. eCollection 2017.
10
Fin whale song variability in southern California and the Gulf of California.南加利福尼亚和加利福尼亚湾长须鲸歌声的可变性。
Sci Rep. 2017 Aug 31;7(1):10126. doi: 10.1038/s41598-017-09979-4.