• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于海豚发声自动分类的多类卷积神经网络方法。

Multiclass CNN Approach for Automatic Classification of Dolphin Vocalizations.

作者信息

Di Nardo Francesco, De Marco Rocco, Li Veli Daniel, Screpanti Laura, Castagna Benedetta, Lucchetti Alessandro, Scaradozzi David

机构信息

Dipartimento di Ingegneria dell'informazione, Università Politecnica delle Marche, 60131 Ancona, Italy.

Institute of Biological Resources and Marine Biotechnology (IRBIM), National Research Council (CNR), 60125 Ancona, Italy.

出版信息

Sensors (Basel). 2025 Apr 16;25(8):2499. doi: 10.3390/s25082499.

DOI:10.3390/s25082499
PMID:40285189
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12031246/
Abstract

Monitoring dolphins in the open sea is essential for understanding their behavior and the impact of human activities on the marine ecosystems. Passive Acoustic Monitoring (PAM) is a non-invasive technique for tracking dolphins, providing continuous data. This study presents a novel approach for classifying dolphin vocalizations from a PAM acoustic recording using a convolutional neural network (CNN). Four types of common bottlenose dolphin () vocalizations were identified from underwater recordings: whistles, echolocation clicks, burst pulse sounds, and feeding buzzes. To enhance classification performances, edge-detection filters were applied to spectrograms, with the aim of removing unwanted noise components. A dataset of nearly 10,000 spectrograms was used to train and test the CNN through a 10-fold cross-validation procedure. The results showed that the CNN achieved an average accuracy of 95.2% and an F1-score of 87.8%. The class-specific results showed a high accuracy for whistles (97.9%), followed by echolocation clicks (94.5%), feeding buzzes (94.0%), and burst pulse sounds (92.3%). The highest F1-score was obtained for whistles, exceeding 95%, while the other three vocalization typologies maintained an F1-score above 80%. This method provides a promising step toward improving the passive acoustic monitoring of dolphins, contributing to both species conservation and the mitigation of conflicts with fisheries.

摘要

在公海监测海豚对于了解它们的行为以及人类活动对海洋生态系统的影响至关重要。被动声学监测(PAM)是一种用于追踪海豚的非侵入性技术,可提供连续数据。本研究提出了一种使用卷积神经网络(CNN)从PAM声学记录中对海豚叫声进行分类的新方法。从水下记录中识别出了四种常见宽吻海豚的叫声:哨声、回声定位咔哒声、脉冲猝发声和摄食嗡声。为了提高分类性能,将边缘检测滤波器应用于频谱图,以去除不需要的噪声成分。通过10折交叉验证程序,使用一个近10000个频谱图的数据集来训练和测试CNN。结果表明,CNN的平均准确率达到95.2%,F1分数为87.8%。特定类别的结果显示,哨声的准确率很高(97.9%),其次是回声定位咔哒声(94.5%)、摄食嗡声(94.0%)和脉冲猝发声(92.3%)。哨声获得了最高的F1分数,超过95%,而其他三种叫声类型的F1分数保持在80%以上。该方法为改进海豚的被动声学监测迈出了有希望的一步,有助于物种保护和缓解与渔业的冲突。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/85b3/12031246/52931a133a36/sensors-25-02499-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/85b3/12031246/2d6891e0d85f/sensors-25-02499-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/85b3/12031246/831c3e7c8403/sensors-25-02499-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/85b3/12031246/52931a133a36/sensors-25-02499-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/85b3/12031246/2d6891e0d85f/sensors-25-02499-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/85b3/12031246/831c3e7c8403/sensors-25-02499-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/85b3/12031246/52931a133a36/sensors-25-02499-g003.jpg

相似文献

1
Multiclass CNN Approach for Automatic Classification of Dolphin Vocalizations.用于海豚发声自动分类的多类卷积神经网络方法。
Sensors (Basel). 2025 Apr 16;25(8):2499. doi: 10.3390/s25082499.
2
A WAV file dataset of bottlenose dolphin whistles, clicks, and pulse sounds during trawling interactions.在拖网作业过程中宽吻海豚哨声、咔哒声和脉冲声的 WAV 文件数据集。
Sci Data. 2023 Sep 22;10(1):650. doi: 10.1038/s41597-023-02547-8.
3
Description and classification of echolocation clicks of Indian Ocean humpback (Sousa plumbea) and Indo-Pacific bottlenose (Tursiops aduncus) dolphins from Menai Bay, Zanzibar, East Africa.描述和分类印度洋驼背豚( Sousa plumbea )和东非桑给巴尔梅奈湾印度太平洋瓶鼻海豚( Tursiops aduncus )的回声定位点击声。
PLoS One. 2020 Mar 13;15(3):e0230319. doi: 10.1371/journal.pone.0230319. eCollection 2020.
4
Observational study on the non-linear response of dolphins to the presence of vessels.海豚对船只存在的非线性反应的观测研究。
Sci Rep. 2024 Mar 13;14(1):6062. doi: 10.1038/s41598-024-56654-6.
5
Bottlenose dolphins exchange signature whistles when meeting at sea.宽吻海豚在海上相遇时会交换标志性的哨声。
Proc Biol Sci. 2012 Jul 7;279(1738):2539-45. doi: 10.1098/rspb.2011.2537. Epub 2012 Feb 29.
6
Directional properties of bottlenose dolphin (Tursiops truncatus) clicks, burst-pulse, and whistle sounds.宽吻海豚(Tursiops truncatus)咔哒声、爆发脉冲和哨声的指向性特征。
J Acoust Soc Am. 2012 Feb;131(2):1613-21. doi: 10.1121/1.3676694.
7
Discriminating features of echolocation clicks of melon-headed whales (Peponocephala electra), bottlenose dolphins (Tursiops truncatus), and Gray's spinner dolphins (Stenella longirostris longirostris).瓜头鲸(Peponocephala electra)、宽吻海豚(Tursiops truncatus)和长嘴海豚(Stenella longirostris longirostris)回声定位声纳脉冲的鉴别特征。
J Acoust Soc Am. 2010 Oct;128(4):2212-24. doi: 10.1121/1.3479549.
8
Changes in whistle structure of resident bottlenose dolphins in relation to underwater noise and boat traffic.海豚鸣叫声结构变化与水下噪声和船只交通的关系
Mar Pollut Bull. 2016 Apr 15;105(1):193-8. doi: 10.1016/j.marpolbul.2016.02.030. Epub 2016 Feb 23.
9
Increased number of whistles of bottlenose dolphins, Tursiops truncatus, arising from interaction with people.宽吻海豚(Tursiops truncatus)与人类互动导致口哨声数量增加。
J Vet Med Sci. 2007 Feb;69(2):165-70. doi: 10.1292/jvms.69.165.
10
Linking the sounds of dolphins to their locations and behavior using video and multichannel acoustic recordings.利用视频和多通道声学记录将海豚的声音与其位置和行为联系起来。
J Acoust Soc Am. 2002 Oct;112(4):1692-701. doi: 10.1121/1.1494805.

本文引用的文献

1
A WAV file dataset of bottlenose dolphin whistles, clicks, and pulse sounds during trawling interactions.在拖网作业过程中宽吻海豚哨声、咔哒声和脉冲声的 WAV 文件数据集。
Sci Data. 2023 Sep 22;10(1):650. doi: 10.1038/s41597-023-02547-8.
2
Automated detection of dolphin whistles with convolutional networks and transfer learning.利用卷积网络和迁移学习自动检测海豚哨声。
Front Artif Intell. 2023 Jan 26;6:1099022. doi: 10.3389/frai.2023.1099022. eCollection 2023.
3
Vocal universals and geographic variations in the acoustic repertoire of the common bottlenose dolphin.
常见宽吻海豚声学曲目中的语音共性和地理差异。
Sci Rep. 2021 Jun 4;11(1):11847. doi: 10.1038/s41598-021-90710-9.
4
Review of deep learning: concepts, CNN architectures, challenges, applications, future directions.深度学习综述:概念、卷积神经网络架构、挑战、应用及未来方向。
J Big Data. 2021;8(1):53. doi: 10.1186/s40537-021-00444-8. Epub 2021 Mar 31.
5
Beluga whale acoustic signal classification using deep learning neural network models.使用深度学习神经网络模型对白鲸声学信号进行分类。
J Acoust Soc Am. 2020 Mar;147(3):1834. doi: 10.1121/10.0000921.
6
Deep neural networks for automated detection of marine mammal species.用于自动检测海洋哺乳动物物种的深度神经网络。
Sci Rep. 2020 Jan 17;10(1):607. doi: 10.1038/s41598-020-57549-y.
7
Forward shift of feeding buzz components of dolphins and belugas during associative learning reveals a likely connection to reward expectation, pleasure and brain dopamine activation.在联想学习过程中,海豚和白鲸进食嗡嗡声成分的向前偏移揭示了其与奖励期望、愉悦感及大脑多巴胺激活之间可能存在的联系。
J Exp Biol. 2014 Aug 15;217(Pt 16):2910-9. doi: 10.1242/jeb.100511.
8
The encoding of individual identity in dolphin signature whistles: how much information is needed?海豚特征哨声中个体身份的编码:需要多少信息?
PLoS One. 2013 Oct 23;8(10):e77671. doi: 10.1371/journal.pone.0077671. eCollection 2013.
9
Automatic detection and classification of odontocete whistles.齿鲸哨声的自动检测和分类。
J Acoust Soc Am. 2013 Sep;134(3):2427-37. doi: 10.1121/1.4816555.
10
Listening to the Deep: live monitoring of ocean noise and cetacean acoustic signals.倾听深海:海洋噪声和鲸类声信号的实时监测。
Mar Pollut Bull. 2011;63(1-4):18-26. doi: 10.1016/j.marpolbul.2011.04.038.