Suppr超能文献

鸟类发声通讯基本单元的识别、分析与特征描述:以白喉红臀鹎为例的案例研究

Identification, Analysis and Characterization of Base Units of Bird Vocal Communication: The White Spectacled Bulbul () as a Case Study.

作者信息

Marck Aya, Vortman Yoni, Kolodny Oren, Lavner Yizhar

机构信息

The Department of Ecology, Evolution and Behavior, The Hebrew University of Jerusalem, Jerusalem, Israel.

Department of Animal Sciences, Hula Research Center, Tel-Hai College, Tel-Hai, Israel.

出版信息

Front Behav Neurosci. 2022 Feb 14;15:812939. doi: 10.3389/fnbeh.2021.812939. eCollection 2021.

Abstract

Animal vocal communication is a broad and multi-disciplinary field of research. Studying various aspects of communication can provide key elements for understanding animal behavior, evolution, and cognition. Given the large amount of acoustic data accumulated from automated recorders, for which manual annotation and analysis is impractical, there is a growing need to develop algorithms and automatic methods for analyzing and identifying animal sounds. In this study we developed an automatic detection and analysis system based on audio signal processing algorithms and deep learning that is capable of processing and analyzing large volumes of data without human bias. We selected the White Spectacled Bulbul () as our bird model because it has a complex vocal communication system with a large repertoire which is used by both sexes, year-round. It is a common, widespread passerine in Israel, which is relatively easy to locate and record in a broad range of habitats. Like many passerines, the Bulbul's vocal communication consists of two primary hierarchies of utterances, and . To extract each of these units' characteristics, the fundamental frequency contour was modeled using a low degree Legendre polynomial, enabling it to capture the different patterns of variation from different vocalizations, so that each pattern could be effectively expressed using very few coefficients. In addition, a mel-spectrogram was computed for each unit, and several features were extracted both in the time-domain (e.g., zero-crossing rate and energy) and frequency-domain (e.g., spectral centroid and spectral flatness). We applied both linear and non-linear dimensionality reduction algorithms on feature vectors and validated the findings that were obtained manually, namely by listening and examining the spectrograms visually. Using these algorithms, we show that the Bulbul has a complex vocabulary of more than 30 words, that there are multiple syllables that are combined in different words, and that a particular syllable can appear in several words. Using our system, researchers will be able to analyze hundreds of hours of audio recordings, to obtain objective evaluation of repertoires, and to identify different vocal units and distinguish between them, thus gaining a broad perspective on bird vocal communication.

摘要

动物声音交流是一个广泛的多学科研究领域。研究交流的各个方面可以为理解动物行为、进化和认知提供关键要素。鉴于从自动记录仪积累了大量声学数据,手动标注和分析不切实际,因此越来越需要开发用于分析和识别动物声音的算法和自动方法。在本研究中,我们基于音频信号处理算法和深度学习开发了一个自动检测和分析系统,该系统能够处理和分析大量数据而不受人为偏差影响。我们选择白眼镜鹎作为我们的鸟类模型,因为它具有复杂的声音交流系统,有大量的发声曲目,两性全年都使用。它是以色列常见且分布广泛的雀形目鸟类,在广泛的栖息地中相对容易定位和记录。像许多雀形目鸟类一样,鹎的声音交流由两种主要的发声层次组成,即 和 。为了提取这些单元各自的特征,使用低阶勒让德多项式对基频轮廓进行建模,使其能够捕捉不同发声的不同变化模式,从而可以用很少的系数有效地表达每种模式。此外,为每个单元计算了梅尔频谱图,并在时域(例如过零率和能量)和频域(例如谱质心和谱平坦度)中提取了几个特征。我们对特征向量应用了线性和非线性降维算法,并验证了通过人工(即通过聆听和直观检查频谱图)获得的结果。使用这些算法,我们表明鹎有一个由30多个单词组成的复杂词汇表,有多个音节组合在不同的单词中,并且一个特定的音节可以出现在几个单词中。使用我们的系统,研究人员将能够分析数百小时的音频记录,获得对发声曲目的客观评估,并识别不同的声音单元并区分它们,从而对鸟类声音交流有更广泛的了解。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/839f/8884146/5e220728867b/fnbeh-15-812939-g001.jpg

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验