• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

动物发声的无监督、声谱图为基础的潜在空间表示的实用指南。

A practical guide for generating unsupervised, spectrogram-based latent space representations of animal vocalizations.

机构信息

Department for the Ecology of Animal Societies, Max Planck Institute of Animal Behavior, Constance, Germany.

Department of Biology, University of Konstanz, Constance, Germany.

出版信息

J Anim Ecol. 2022 Aug;91(8):1567-1581. doi: 10.1111/1365-2656.13754. Epub 2022 Jun 9.

DOI:10.1111/1365-2656.13754
PMID:35657634
Abstract

The manual detection, analysis and classification of animal vocalizations in acoustic recordings is laborious and requires expert knowledge. Hence, there is a need for objective, generalizable methods that detect underlying patterns in these data, categorize sounds into distinct groups and quantify similarities between them. Among all computational methods that have been proposed to accomplish this, neighbourhood-based dimensionality reduction of spectrograms to produce a latent space representation of calls stands out for its conceptual simplicity and effectiveness. Goal of the study/what was done: Using a dataset of manually annotated meerkat Suricata suricatta vocalizations, we demonstrate how this method can be used to obtain meaningful latent space representations that reflect the established taxonomy of call types. We analyse strengths and weaknesses of the proposed approach, give recommendations for its usage and show application examples, such as the classification of ambiguous calls and the detection of mislabelled calls. What this means: All analyses are accompanied by example code to help researchers realize the potential of this method for the study of animal vocalizations.

摘要

在声学记录中手动检测、分析和分类动物叫声既费力又需要专业知识。因此,需要客观、可推广的方法来检测这些数据中的潜在模式,将声音分为不同的类别,并量化它们之间的相似性。在所有被提出的用于实现这一目标的计算方法中,基于邻域的声谱图降维方法因其概念简单和有效性而脱颖而出,可生成叫声的潜在空间表示。

研究目的/所做工作:使用经过手动标注的猫鼬 Suricata suricatta 叫声数据集,我们演示了如何使用这种方法获得有意义的潜在空间表示,反映已建立的叫声类型分类法。我们分析了所提出方法的优缺点,为其使用提供了建议,并展示了应用示例,如模糊叫声的分类和误标记叫声的检测。

这意味着

所有分析都附有示例代码,以帮助研究人员认识到该方法在动物叫声研究中的潜力。

相似文献

1
A practical guide for generating unsupervised, spectrogram-based latent space representations of animal vocalizations.动物发声的无监督、声谱图为基础的潜在空间表示的实用指南。
J Anim Ecol. 2022 Aug;91(8):1567-1581. doi: 10.1111/1365-2656.13754. Epub 2022 Jun 9.
2
The function of nonlinear phenomena in meerkat alarm calls.沙猫警报叫声中的非线性现象的功能。
Biol Lett. 2011 Feb 23;7(1):47-9. doi: 10.1098/rsbl.2010.0537. Epub 2010 Jul 21.
3
Vocal complexity in the long calls of Bornean orangutans.婆罗洲猩猩长叫声中的声音复杂性。
PeerJ. 2024 May 14;12:e17320. doi: 10.7717/peerj.17320. eCollection 2024.
4
Mapping vocal interactions in space and time differentiates signal broadcast versus signal exchange in meerkat groups.在空间和时间上绘制声音互动图谱,区分了阔嘴侏獴群体中信号广播与信号交流。
Philos Trans R Soc Lond B Biol Sci. 2024 Jul 8;379(1905):20230188. doi: 10.1098/rstb.2023.0188. Epub 2024 May 20.
5
Meerkat close calling patterns are linked to sex, social category, season and wind, but not fecal glucocorticoid metabolite concentrations.狐獴的近距离叫声模式与性别、社会类别、季节和风向有关,但与粪便糖皮质激素代谢物浓度无关。
PLoS One. 2017 May 3;12(5):e0175371. doi: 10.1371/journal.pone.0175371. eCollection 2017.
6
Motivation before meaning: motivational information encoded in meerkat alarm calls develops earlier than referential information.意义之前的动机:猫鼬警报叫声中编码的动机信息比指称信息发展得更早。
Am Nat. 2007 Jun;169(6):758-67. doi: 10.1086/516719. Epub 2007 Apr 4.
7
Moving calls: a vocal mechanism underlying quorum decisions in cohesive groups.迁移鸣叫:群体聚集决策的发声机制。
Proc Biol Sci. 2011 May 22;278(1711):1482-8. doi: 10.1098/rspb.2010.1739. Epub 2010 Nov 3.
8
Discrete call types referring to predation risk enhance the efficiency of the meerkat sentinel system.离散的报警类型与捕食风险相关,提高了猫鼬放哨系统的效率。
Sci Rep. 2017 Mar 17;7:44436. doi: 10.1038/srep44436.
9
Call order within vocal sequences of meerkats contains temporary contextual and individual information.在猫鼬的发声序列中,鸣叫的顺序包含临时的上下文和个体信息。
BMC Biol. 2020 Sep 9;18(1):119. doi: 10.1186/s12915-020-00847-8.
10
Utilizing DeepSqueak for automatic detection and classification of mammalian vocalizations: a case study on primate vocalizations.利用 DeepSqueak 进行哺乳动物发声的自动检测和分类:以灵长类动物发声为例。
Sci Rep. 2021 Dec 27;11(1):24463. doi: 10.1038/s41598-021-03941-1.

引用本文的文献

1
AI-Powered Vocalization Analysis in Poultry: Systematic Review of Health, Behavior, and Welfare Monitoring.家禽中基于人工智能的发声分析:健康、行为和福利监测的系统综述
Sensors (Basel). 2025 Jun 29;25(13):4058. doi: 10.3390/s25134058.
2
Representation of high-dimensional cell morphology and morphodynamics in 2D latent space.二维潜在空间中高维细胞形态和形态动力学的表示。
Phys Biol. 2025 Apr 24;22(3). doi: 10.1088/1478-3975/adcd37.
3
Vocal repertoire and individuality in the plains zebra ().平原斑马的鸣声 repertoire 与个体特征()。 需注意,这里“Vocal repertoire”准确意思不太明确,可结合具体语境进一步优化,括号内容原文缺失完整信息无法准确翻译。
R Soc Open Sci. 2024 Jul 10;11(7):240477. doi: 10.1098/rsos.240477. eCollection 2024 Jul.
4
Mapping vocal interactions in space and time differentiates signal broadcast versus signal exchange in meerkat groups.在空间和时间上绘制声音互动图谱,区分了阔嘴侏獴群体中信号广播与信号交流。
Philos Trans R Soc Lond B Biol Sci. 2024 Jul 8;379(1905):20230188. doi: 10.1098/rstb.2023.0188. Epub 2024 May 20.
5
Finding the semantic similarity in single-particle diffraction images using self-supervised contrastive projection learning.使用自监督对比投影学习在单粒子衍射图像中寻找语义相似性。
NPJ Comput Mater. 2023;9(1):24. doi: 10.1038/s41524-023-00966-0. Epub 2023 Feb 16.
6
Acoustic features as a tool to visualize and explore marine soundscapes: Applications illustrated using marine mammal passive acoustic monitoring datasets.声学特征作为可视化和探索海洋声景的工具:利用海洋哺乳动物被动声学监测数据集进行说明的应用
Ecol Evol. 2024 Feb 21;14(2):e10951. doi: 10.1002/ece3.10951. eCollection 2024 Feb.
7
Vocal complexity in a socially complex corvid: gradation, diversity and lack of common call repertoire in male rooks.一种社会行为复杂的鸦科鸟类的发声复杂性:雄性白嘴鸦叫声的渐变、多样性及缺乏共同的叫声曲目
R Soc Open Sci. 2024 Jan 10;11(1):231713. doi: 10.1098/rsos.231713. eCollection 2024 Jan.
8
Deep audio embeddings for vocalisation clustering.用于发声聚类的深度音频嵌入。
PLoS One. 2023 Jul 10;18(7):e0283396. doi: 10.1371/journal.pone.0283396. eCollection 2023.
9
Improving the workflow to crack Small, Unbalanced, Noisy, but Genuine (SUNG) datasets in bioacoustics: The case of bonobo calls.改进生物声学中小、不平衡、嘈杂但真实(SUNG)数据集的工作流程:以倭黑猩猩叫声为例。
PLoS Comput Biol. 2023 Apr 13;19(4):e1010325. doi: 10.1371/journal.pcbi.1010325. eCollection 2023 Apr.
10
Multi-level combinatoriality in magpie non-song vocalizations.喜鹊非鸣叫声中的多层次组合性。
J R Soc Interface. 2023 Feb;20(199):20220679. doi: 10.1098/rsif.2022.0679. Epub 2023 Feb 1.