Kershenbaum Arik, Sayigh Laela S, Janik Vincent M
National Institute for Mathematical and Biological Synthesis, Knoxville, Tennessee, United States of America ; Department of Evolutionary and Environmental Biology, University of Haifa, Haifa, Israel.
PLoS One. 2013 Oct 23;8(10):e77671. doi: 10.1371/journal.pone.0077671. eCollection 2013.
Bottlenose dolphins (Tursiops truncatus) produce many vocalisations, including whistles that are unique to the individual producing them. Such "signature whistles" play a role in individual recognition and maintaining group integrity. Previous work has shown that humans can successfully group the spectrographic representations of signature whistles according to the individual dolphins that produced them. However, attempts at using mathematical algorithms to perform a similar task have been less successful. A greater understanding of the encoding of identity information in signature whistles is important for assessing similarity of whistles and thus social influences on the development of these learned calls. We re-examined 400 signature whistles from 20 individual dolphins used in a previous study, and tested the performance of new mathematical algorithms. We compared the measure used in the original study (correlation matrix of evenly sampled frequency measurements) to one used in several previous studies (similarity matrix of time-warped whistles), and to a new algorithm based on the Parsons code, used in music retrieval databases. The Parsons code records the direction of frequency change at each time step, and is effective at capturing human perception of music. We analysed similarity matrices from each of these three techniques, as well as a random control, by unsupervised clustering using three separate techniques: k-means clustering, hierarchical clustering, and an adaptive resonance theory neural network. For each of the three clustering techniques, a seven-level Parsons algorithm provided better clustering than the correlation and dynamic time warping algorithms, and was closer to the near-perfect visual categorisations of human judges. Thus, the Parsons code captures much of the individual identity information present in signature whistles, and may prove useful in studies requiring quantification of whistle similarity.
宽吻海豚(瓶鼻海豚)会发出多种声音,包括每个个体所特有的哨声。这种“特征哨声”在个体识别和维持群体完整性方面发挥着作用。先前的研究表明,人类能够根据发出特征哨声的个体,成功地将特征哨声的频谱表示进行分组。然而,使用数学算法执行类似任务的尝试却不太成功。更深入地了解特征哨声中身份信息的编码,对于评估哨声的相似性以及这些习得叫声发展过程中的社会影响至关重要。我们重新审视了先前一项研究中使用的来自20只个体宽吻海豚的400个特征哨声,并测试了新数学算法的性能。我们将原始研究中使用的测量方法(均匀采样频率测量的相关矩阵)与先前几项研究中使用的一种方法(时间规整后的哨声相似性矩阵),以及一种基于帕森斯编码的新算法进行了比较,该算法用于音乐检索数据库。帕森斯编码记录了每个时间步长的频率变化方向,并且在捕捉人类对音乐的感知方面很有效。我们使用三种单独的技术——k均值聚类、层次聚类和自适应共振理论神经网络,通过无监督聚类分析了这三种技术以及随机对照的相似性矩阵。对于这三种聚类技术中的每一种,七级帕森斯算法都比相关算法和动态时间规整算法提供了更好的聚类效果,并且更接近人类评判者近乎完美的视觉分类。因此,帕森斯编码捕捉到了特征哨声中存在的许多个体身份信息,并且可能在需要量化哨声相似性的研究中证明是有用的。