ERATO, Okanoya Emotional Information Project, Japan Science Technology Agency, Wako, Saitama, Japan.
PLoS One. 2011;6(9):e24516. doi: 10.1371/journal.pone.0024516. Epub 2011 Sep 7.
Complex sequencing rules observed in birdsongs provide an opportunity to investigate the neural mechanism for generating complex sequential behaviors. To relate the findings from studying birdsongs to other sequential behaviors such as human speech and musical performance, it is crucial to characterize the statistical properties of the sequencing rules in birdsongs. However, the properties of the sequencing rules in birdsongs have not yet been fully addressed. In this study, we investigate the statistical properties of the complex birdsong of the Bengalese finch (Lonchura striata var. domestica). Based on manual-annotated syllable labeles, we first show that there are significant higher-order context dependencies in Bengalese finch songs, that is, which syllable appears next depends on more than one previous syllable. We then analyze acoustic features of the song and show that higher-order context dependencies can be explained using first-order hidden state transition dynamics with redundant hidden states. This model corresponds to hidden Markov models (HMMs), well known statistical models with a large range of application for time series modeling. The song annotation with these models with first-order hidden state dynamics agreed well with manual annotation, the score was comparable to that of a second-order HMM, and surpassed the zeroth-order model (the Gaussian mixture model; GMM), which does not use context information. Our results imply that the hierarchical representation with hidden state dynamics may underlie the neural implementation for generating complex behavioral sequences with higher-order dependencies.
鸟类鸣叫中的复杂序列规则为研究生成复杂序列行为的神经机制提供了机会。为了将鸟类鸣叫研究中的发现与其他序列行为(如人类言语和音乐演奏)联系起来,刻画鸟类鸣叫中序列规则的统计特性至关重要。然而,鸟类鸣叫中序列规则的特性尚未得到充分研究。在这项研究中,我们研究了孟加拉雀(Lonchura striata var. domestica)复杂鸣叫的统计特性。基于手动标注的音节标签,我们首先表明,孟加拉雀鸣叫中存在显著的高阶上下文依赖性,即下一个出现的音节不仅取决于前一个音节,还取决于前多个音节。然后,我们分析了歌曲的声学特征,并表明高阶上下文依赖性可以用具有冗余隐藏状态的一阶隐状态转移动力学来解释。该模型对应于隐马尔可夫模型(HMM),这是一种具有广泛应用范围的时间序列建模的统计模型。这些具有一阶隐状态动力学的模型对歌曲的标注与手动标注非常吻合,其得分可与二阶 HMM 相媲美,且优于不使用上下文信息的零阶模型(高斯混合模型;GMM)。我们的结果表明,具有隐状态动力学的层次表示可能是生成具有高阶依赖性的复杂行为序列的神经实现基础。