• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

深度学习技术在抹香鲸生物声学检测与分类中的应用。

Deep Machine Learning Techniques for the Detection and Classification of Sperm Whale Bioacoustics.

机构信息

Radcliffe Institute for Advanced Study, Harvard University, Cambridge, MA, USA.

Department of Computing, Imperial College, London, MA, UK.

出版信息

Sci Rep. 2019 Aug 29;9(1):12588. doi: 10.1038/s41598-019-48909-4.

DOI:10.1038/s41598-019-48909-4
PMID:31467331
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6715799/
Abstract

We implemented Machine Learning (ML) techniques to advance the study of sperm whale (Physeter macrocephalus) bioacoustics. This entailed employing Convolutional Neural Networks (CNNs) to construct an echolocation click detector designed to classify spectrograms generated from sperm whale acoustic data according to the presence or absence of a click. The click detector achieved 99.5% accuracy in classifying 650 spectrograms. The successful application of CNNs to clicks reveals the potential of future studies to train CNN-based architectures to extract finer-scale details from cetacean spectrograms. Long short-term memory and gated recurrent unit recurrent neural networks were trained to perform classification tasks, including (1) "coda type classification" where we obtained 97.5% accuracy in categorizing 23 coda types from a Dominica dataset containing 8,719 codas and 93.6% accuracy in categorizing 43 coda types from an Eastern Tropical Pacific (ETP) dataset with 16,995 codas; (2) "vocal clan classification" where we obtained 95.3% accuracy for two clan classes from Dominica and 93.1% for four ETP clan types; and (3) "individual whale identification" where we obtained 99.4% accuracy using two Dominica sperm whales. These results demonstrate the feasibility of applying ML to sperm whale bioacoustics and establish the validity of constructing neural networks to learn meaningful representations of whale vocalizations.

摘要

我们采用机器学习 (ML) 技术来推进抹香鲸 (Physeter macrocephalus) 生物声学的研究。这包括使用卷积神经网络 (CNNs) 构建一个回声定位点击探测器,旨在根据声纳数据中是否存在点击来对声谱图进行分类。点击探测器在对 650 个声谱图进行分类时达到了 99.5%的准确率。CNN 在点击分类中的成功应用表明,未来的研究有可能训练基于 CNN 的架构,从鲸目动物的声谱图中提取更精细的细节。长短期记忆和门控循环单元递归神经网络被训练来执行分类任务,包括 (1) “尾音类型分类”,我们在包含 8719 个尾音的多米尼加数据集和包含 16995 个尾音的东热带太平洋 (ETP) 数据集上,分别以 97.5%和 93.6%的准确率对 23 种和 43 种尾音类型进行了分类;(2) “发声族群分类”,在多米尼加,两种族群类型的准确率为 95.3%,而在 ETP,四种族群类型的准确率为 93.1%;以及 (3) “个体鲸鱼识别”,使用来自多米尼加的两只抹香鲸,我们获得了 99.4%的准确率。这些结果证明了将 ML 应用于抹香鲸生物声学的可行性,并建立了构建神经网络以学习鲸鱼发声有意义表示的有效性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/42df/6715799/95380e7e4d2e/41598_2019_48909_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/42df/6715799/29529e325a1c/41598_2019_48909_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/42df/6715799/49530c3ca945/41598_2019_48909_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/42df/6715799/cd9f8e6483f6/41598_2019_48909_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/42df/6715799/95380e7e4d2e/41598_2019_48909_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/42df/6715799/29529e325a1c/41598_2019_48909_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/42df/6715799/49530c3ca945/41598_2019_48909_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/42df/6715799/cd9f8e6483f6/41598_2019_48909_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/42df/6715799/95380e7e4d2e/41598_2019_48909_Fig4_HTML.jpg

相似文献

1
Deep Machine Learning Techniques for the Detection and Classification of Sperm Whale Bioacoustics.深度学习技术在抹香鲸生物声学检测与分类中的应用。
Sci Rep. 2019 Aug 29;9(1):12588. doi: 10.1038/s41598-019-48909-4.
2
The active space of sperm whale codas: inter-click information for intra-unit communication.抹香鲸尾音的活跃空间:单元内通讯的点击间信息。
J Exp Biol. 2024 Feb 15;227(4). doi: 10.1242/jeb.246442. Epub 2024 Feb 21.
3
Sperm whale codas may encode individuality as well as clan identity.抹香鲸的信号可能编码了个体特征以及群体身份。
J Acoust Soc Am. 2016 May;139(5):2860. doi: 10.1121/1.4949478.
4
Evidence from sperm whale clans of symbolic marking in non-human cultures.非人类文化中以抹香鲸族群为证的象征标记。
Proc Natl Acad Sci U S A. 2022 Sep 13;119(37):e2201692119. doi: 10.1073/pnas.2201692119. Epub 2022 Sep 8.
5
The function of male sperm whale slow clicks in a high latitude habitat: communication, echolocation, or prey debilitation?雄性抹香鲸在高纬度栖息地发出低频咔嗒声的功能:是用于交流、回声定位还是使猎物虚弱?
J Acoust Soc Am. 2013 May;133(5):3135-44. doi: 10.1121/1.4795798.
6
Identification of western North Atlantic odontocete echolocation click types using machine learning and spatiotemporal correlates.利用机器学习和时空关联识别北大西洋西部齿鲸的回声定位咔哒声类型。
PLoS One. 2022 Mar 24;17(3):e0264988. doi: 10.1371/journal.pone.0264988. eCollection 2022.
7
Click detection rate variability of central North Pacific sperm whales from passive acoustic towed arrays.从中太平洋被动声学拖曳阵获取的北太平洋中央座头鲸的点击探测率变化。
J Acoust Soc Am. 2024 Apr 1;155(4):2627-2635. doi: 10.1121/10.0025540.
8
Using a coherent hydrophone array for observing sperm whale range, classification, and shallow-water dive profiles.使用相干水听器阵列来观测抹香鲸的活动范围、分类以及浅水潜水剖面。
J Acoust Soc Am. 2014 Jun;135(6):3352-63. doi: 10.1121/1.4874601.
9
Foraging activity of sperm whales (Physeter macrocephalus) off the east coast of New Zealand.新西兰东海岸的抹香鲸(Physeter macrocephalus)觅食活动。
Sci Rep. 2019 Aug 21;9(1):12182. doi: 10.1038/s41598-019-48417-5.
10
Off-axis effects on the multi-pulse structure of sperm whale coda clicks.离轴对抹香鲸尾声滴答声多脉冲结构的影响。
J Acoust Soc Am. 2009 Mar;125(3):1768-73. doi: 10.1121/1.3075598.

引用本文的文献

1
Deep learning-based morphological analysis of human sperm.基于深度学习的人类精子形态分析
Med Biol Eng Comput. 2025 Sep 2. doi: 10.1007/s11517-025-03418-7.
2
AI-Powered Vocalization Analysis in Poultry: Systematic Review of Health, Behavior, and Welfare Monitoring.家禽中基于人工智能的发声分析:健康、行为和福利监测的系统综述
Sensors (Basel). 2025 Jun 29;25(13):4058. doi: 10.3390/s25134058.
3
Artistic Value as a New Paradigm to Promote Ocean Conservation.艺术价值作为促进海洋保护的新范式。

本文引用的文献

1
Human impact erodes chimpanzee behavioral diversity.人类活动破坏了黑猩猩的行为多样性。
Science. 2019 Mar 29;363(6434):1453-1455. doi: 10.1126/science.aau4532. Epub 2019 Mar 7.
2
Animal cultures matter for conservation.动物文化对保护至关重要。
Science. 2019 Mar 8;363(6431):1032-1034. doi: 10.1126/science.aaw3557. Epub 2019 Feb 26.
3
Automatic acoustic estimation of sperm whale size distributions achieved through machine recognition of on-axis clicks.通过对轴上咔嗒声的机器识别实现对抹香鲸大小分布的自动声学估计。
Hist Philos Life Sci. 2025 Jul 2;47(3):33. doi: 10.1007/s40656-025-00679-1.
4
Evaluating the performance of automated detection systems for long-term monitoring of delphinids in diverse marine soundscapes.评估自动检测系统在不同海洋声景中对海豚进行长期监测的性能。
PLoS One. 2025 Jun 11;20(6):e0323768. doi: 10.1371/journal.pone.0323768. eCollection 2025.
5
Multiscale Sample Entropy-Based Feature Extraction with Gaussian Mixture Model for Detection and Classification of Blue Whale Vocalization.基于多尺度样本熵特征提取与高斯混合模型的蓝鲸发声检测与分类
Entropy (Basel). 2025 Mar 28;27(4):355. doi: 10.3390/e27040355.
6
Automatic detection for bioacoustic research: a practical guide from and for biologists and computer scientists.生物声学研究中的自动检测:面向生物学家和计算机科学家的实用指南
Biol Rev Camb Philos Soc. 2025 Apr;100(2):620-646. doi: 10.1111/brv.13155. Epub 2024 Oct 17.
7
Elephants and algorithms: a review of the current and future role of AI in elephant monitoring.大象与算法:人工智能在大象监测中的当前和未来作用综述。
J R Soc Interface. 2023 Nov;20(208):20230367. doi: 10.1098/rsif.2023.0367. Epub 2023 Nov 15.
8
Towards automated long-term acoustic monitoring of endangered river dolphins: a case study in the Brazilian Amazon floodplains.迈向濒危河豚自动长期声学监测:巴西亚马逊泛滥平原案例研究。
Sci Rep. 2023 Jul 27;13(1):10801. doi: 10.1038/s41598-023-36518-1.
9
Applications of Machine Learning in Chemical and Biological Oceanography.机器学习在化学与生物海洋学中的应用。
ACS Omega. 2023 Apr 27;8(18):15831-15853. doi: 10.1021/acsomega.2c06441. eCollection 2023 May 9.
10
There You Are! Automated Detection of Indris' Songs on Features Extracted from Passive Acoustic Recordings.你在这里!基于从被动声学记录中提取的特征自动检测懒猴的歌声。
Animals (Basel). 2023 Jan 9;13(2):241. doi: 10.3390/ani13020241.
J Acoust Soc Am. 2018 Dec;144(6):3485. doi: 10.1121/1.5082291.
4
Kinship influences sperm whale social organization within, but generally not among, social units.亲属关系影响抹香鲸在社会单元内部的社会组织,但通常不影响不同社会单元之间的社会组织。
R Soc Open Sci. 2018 Aug 29;5(8):180914. doi: 10.1098/rsos.180914. eCollection 2018 Aug.
5
Deep Convolutional Neural Networks for Image Classification: A Comprehensive Review.用于图像分类的深度卷积神经网络:全面综述
Neural Comput. 2017 Sep;29(9):2352-2449. doi: 10.1162/NECO_a_00990. Epub 2017 Jun 9.
6
Cultural turnover among Galápagos sperm whales.加拉帕戈斯抹香鲸的文化更替
R Soc Open Sci. 2016 Oct 19;3(10):160615. doi: 10.1098/rsos.160615. eCollection 2016 Oct.
7
Socially segregated, sympatric sperm whale clans in the Atlantic Ocean.大西洋中社会隔离的、同域的抹香鲸族群。
R Soc Open Sci. 2016 Jun 8;3(6):160061. doi: 10.1098/rsos.160061. eCollection 2016 Jun.
8
Sperm whale codas may encode individuality as well as clan identity.抹香鲸的信号可能编码了个体特征以及群体身份。
J Acoust Soc Am. 2016 May;139(5):2860. doi: 10.1121/1.4949478.
9
Individual, unit and vocal clan level identity cues in sperm whale codas.抹香鲸尾音中个体、群体和声学群落层面的身份线索。
R Soc Open Sci. 2016 Jan 20;3(1):150372. doi: 10.1098/rsos.150372. eCollection 2016 Jan.
10
Deep learning.深度学习。
Nature. 2015 May 28;521(7553):436-44. doi: 10.1038/nature14539.