使用动态时间规整和隐马尔可夫模型从连续录音中自动识别鸟鸣元素：一项比较研究。

Automated recognition of bird song elements from continuous recordings using dynamic time warping and hidden Markov models: a comparative study.

作者信息

Kogan J A, Margoliash D

机构信息

Department of Organismal Biology and Anatomy, University of Chicago, Illinois 60637, USA.

出版信息

J Acoust Soc Am. 1998 Apr;103(4):2185-96. doi: 10.1121/1.421364.

DOI:10.1121/1.421364

PMID:9566338

Abstract

The performance of two techniques is compared for automated recognition of bird song units from continuous recordings. The advantages and limitations of dynamic time warping (DTW) and hidden Markov models (HMMs) are evaluated on a large database of male songs of zebra finches (Taeniopygia guttata) and indigo buntings (Passerina cyanea), which have different types of vocalizations and have been recorded under different laboratory conditions. Depending on the quality of recordings and complexity of song, the DTW-based technique gives excellent to satisfactory performance. Under challenging conditions such as noisy recordings or presence of confusing short-duration calls, good performance of the DTW-based technique requires careful selection of templates that may demand expert knowledge. Because HMMs are trained, equivalent or even better performance of HMMs can be achieved based only on segmentation and labeling of constituent vocalizations, albeit with many more training examples than DTW templates. One weakness in HMM performance is the misclassification of short-duration vocalizations or song units with more variable structure (e.g., some calls, and syllables of plastic songs). To address these and other limitations, new approaches for analyzing bird vocalizations are discussed.

摘要

对两种技术从连续录音中自动识别鸟鸣单元的性能进行了比较。在一个包含斑胸草雀（Taeniopygia guttata）和靛蓝彩鹀（Passerina cyanea）雄鸟鸣唱的大型数据库上评估了动态时间规整（DTW）和隐马尔可夫模型（HMM）的优缺点，这两种鸟类具有不同类型的发声，且在不同实验室条件下进行了录音。根据录音质量和歌声复杂度，基于DTW的技术表现优异至令人满意。在具有挑战性的条件下，如嘈杂录音或存在令人混淆的短时长叫声时，基于DTW的技术要取得良好性能需要仔细选择模板，这可能需要专业知识。由于HMM是经过训练的，基于组成发声的分割和标记，HMM可以实现同等甚至更好的性能，尽管训练示例比DTW模板多得多。HMM性能的一个弱点是对短时长发声或结构更具变异性的歌声单元（例如，一些叫声和可塑性歌声的音节）的错误分类。为了解决这些及其他限制，讨论了分析鸟鸣的新方法。

相似文献

Automated recognition of bird song elements from continuous recordings using dynamic time warping and hidden Markov models: a comparative study.使用动态时间规整和隐马尔可夫模型从连续录音中自动识别鸟鸣元素：一项比较研究。

J Acoust Soc Am. 1998 Apr;103(4):2185-96. doi: 10.1121/1.421364.

Template-based automatic recognition of birdsong syllables from continuous recordings.基于模板的从连续录音中自动识别鸟鸣音节

J Acoust Soc Am. 1996 Aug;100(2 Pt 1):1209-19. doi: 10.1121/1.415968.

Testosterone facilitates some conspecific song discriminations in castrated zebra finches (Taeniopygia guttata).睾酮促进阉割的斑胸草雀（Taeniopygia guttata）对同种鸣叫声的某些辨别。

Proc Natl Acad Sci U S A. 1992 Feb 15;89(4):1376-8. doi: 10.1073/pnas.89.4.1376.

A robust automatic birdsong phrase classification: A template-based approach.一种强大的自动鸟鸣短语分类：基于模板的方法。

J Acoust Soc Am. 2016 Nov;140(5):3691. doi: 10.1121/1.4966592.

Song perception during the sensitive period of song learning in zebra finches (Taeniopygia guttata).斑胸草雀（Taeniopygia guttata）鸣叫学习敏感期的鸣叫感知

J Comp Psychol. 2006 May;120(2):79-88. doi: 10.1037/0735-7036.120.2.79.

Role of gender, season, and familiarity in discrimination of conspecific song by zebra finches (Taeniopygia guttata).性别、季节和熟悉程度在斑胸草雀（Taeniopygia guttata）对同种鸟鸣声辨别中的作用。

Proc Natl Acad Sci U S A. 1992 Feb 15;89(4):1368-71. doi: 10.1073/pnas.89.4.1368.

Cross-fostering diminishes song discrimination in zebra finches (Taeniopygia guttata).交叉寄养会降低斑胸草雀（Taeniopygia guttata）的鸣声辨别能力。

Anim Cogn. 2009 May;12(3):481-90. doi: 10.1007/s10071-008-0209-5. Epub 2009 Jan 7.

Global synchronous response to autogenous song in zebra finch HVc.斑胸草雀HVC对自身鸣叫的全局同步反应。

J Neurophysiol. 1994 Nov;72(5):2105-23. doi: 10.1152/jn.1994.72.5.2105.

Neural auditory selectivity develops in parallel with song.神经听觉选择性与鸣唱同步发展。

J Neurobiol. 2005 Mar;62(4):469-81. doi: 10.1002/neu.20115.

Comparisons of different methods to train a young zebra finch (Taeniopygia guttata) to learn a song.训练幼年斑胸草雀（斑胸草雀）学习歌曲的不同方法比较。

J Physiol Paris. 2013 Jun;107(3):210-8. doi: 10.1016/j.jphysparis.2012.08.003. Epub 2012 Sep 8.

引用本文的文献

Embedding stochastic dynamics of the environment in spontaneous activity by prediction-based plasticity.通过基于预测的可塑性将环境的随机动力学嵌入自发活动中。

Elife. 2025 Jun 11;13:RP95243. doi: 10.7554/eLife.95243.

Song in a Social and Sexual Context: Vocalizations Signal Identity and Rank in Both Sexes of a Cooperative Breeder.社交与性情境中的鸣叫：发声在一种合作繁殖鸟类的两性中均能传递身份和等级信息

Front Ecol Evol. 2016 May;4. doi: 10.3389/fevo.2016.00046. Epub 2016 May 3.

Recent Advances at the Interface of Neuroscience and Artificial Neural Networks.神经科学与人工神经网络的界面的最新进展。

J Neurosci. 2022 Nov 9;42(45):8514-8523. doi: 10.1523/JNEUROSCI.1503-22.2022.

Automated annotation of birdsong with a neural network that segments spectrograms.使用对声谱图进行分割的神经网络自动标注鸟鸣。

Elife. 2022 Jan 20;11:e63853. doi: 10.7554/eLife.63853.

Toward a Computational Neuroethology of Vocal Communication: From Bioacoustics to Neurophysiology, Emerging Tools and Future Directions.迈向声音交流的计算神经行为学：从生物声学至神经生理学，新兴工具与未来方向。

Front Behav Neurosci. 2021 Dec 20;15:811737. doi: 10.3389/fnbeh.2021.811737. eCollection 2021.

Neurally driven synthesis of learned, complex vocalizations.神经驱动的学习型复杂发声合成。

Curr Biol. 2021 Aug 9;31(15):3419-3425.e5. doi: 10.1016/j.cub.2021.05.035. Epub 2021 Jun 16.

Finding, visualizing, and quantifying latent structure across diverse animal vocal repertoires.发现、可视化和量化不同动物声谱中的潜在结构。

PLoS Comput Biol. 2020 Oct 15;16(10):e1008228. doi: 10.1371/journal.pcbi.1008228. eCollection 2020 Oct.

Recognition of bird species based on spike model using bird dataset.基于鸟类数据集的尖峰模型对鸟类物种的识别。

Data Brief. 2020 Feb 20;29:105301. doi: 10.1016/j.dib.2020.105301. eCollection 2020 Apr.

ORCA-SPOT: An Automatic Killer Whale Sound Detection Toolkit Using Deep Learning.ORCA-SPOT：一个使用深度学习的自动虎鲸声音检测工具包。

Sci Rep. 2019 Jul 29;9(1):10997. doi: 10.1038/s41598-019-47335-w.

Quantitative acoustic differentiation of cryptic species illustrated with King and Clapper rails.以王秧鸡和长嘴秧鸡为例说明隐存种的定量声学鉴别

Ecol Evol. 2018 Nov 20;8(24):12821-12831. doi: 10.1002/ece3.4711. eCollection 2018 Dec.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

使用动态时间规整和隐马尔可夫模型从连续录音中自动识别鸟鸣元素：一项比较研究。

Automated recognition of bird song elements from continuous recordings using dynamic time warping and hidden Markov models: a comparative study.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献