Crampton William G R, Davis Justin K, Lovejoy Nathan R, Pensky Marianna
Department of Biology, University of Central Florida, Orlando, FL 32816-2368, USA.
J Physiol Paris. 2008 Jul-Nov;102(4-6):304-21. doi: 10.1016/j.jphysparis.2008.10.001. Epub 2008 Nov 1.
Evolutionary studies of communication can benefit from classification procedures that allow individual animals to be assigned to groups (e.g. species) on the basis of high-dimension data representing their signals. Prior to classification, signals are usually transformed by a signal processing procedure into structural features. Applications of these signal processing procedures to animal communication have been largely restricted to the manual or semi-automated identification of landmark features from graphical representations of signals. Nonetheless, theory predicts that automated time-frequency-based digital signal processing (DSP) procedures can represent signals more efficiently (using fewer features) than can landmark procedures or frequency-based DSP - allowing more accurate classification. Moreover, DSP procedures are objective in that they require little previous knowledge of signal diversity, and are relatively free from potentially ungrounded assumptions of cross-taxon homology. Using a model data set of electric organ discharge waveforms from five sympatric species of the electric fish Gymnotus, we adopted an exhaustive simulation approach to investigate the classificatory performance of different signal processing procedures. We considered a landmark procedure, a frequency-based DSP procedure (the fast Fourier transform), and two kinds of time-frequency-based DSP procedures (a short-time Fourier transform, and several implementations of the discrete wavelet transform -DWT). The features derived from each of these signal processing procedures were then subjected to dimension reduction procedures to separate those features which permit the most effective discrimination among groups of signalers. We considered four alternative dimension reduction methods. Finally, each combination of reduced data was submitted to classification by linear discriminant analysis. Our results support theoretical predictions that time-frequency DSP procedures (especially DWT) permit more efficient discrimination of groups. The performance of signal processing was found to depend largely upon the dimension reduction procedure employed, and upon the number of resulting features. Because the best combinations of procedures are dataset-dependent and difficult to predict, we conclude that simulations of the kind described here, or at least simplified versions of them, should be routinely executed before classification of animal signals - especially unfamiliar ones.
通信的进化研究可以从分类程序中受益,这些程序允许根据代表动物信号的高维数据将个体动物归为不同的组(如物种)。在分类之前,信号通常通过信号处理程序转换为结构特征。这些信号处理程序在动物通信中的应用很大程度上局限于从信号的图形表示中手动或半自动识别标志性特征。尽管如此,理论预测,基于时频的自动数字信号处理(DSP)程序比标志性程序或基于频率的DSP程序能更有效地(使用更少的特征)表示信号,从而实现更准确的分类。此外,DSP程序具有客观性,因为它们几乎不需要信号多样性的先验知识,并且相对不受跨分类群同源性潜在无根据假设的影响。我们使用来自电鱼裸背电鳗属五个同域物种的电器官放电波形的模型数据集,采用详尽的模拟方法来研究不同信号处理程序的分类性能。我们考虑了一种标志性程序、一种基于频率的DSP程序(快速傅里叶变换)以及两种基于时频的DSP程序(短时傅里叶变换和离散小波变换-DWT的几种实现)。然后,对从这些信号处理程序中得出的特征进行降维处理,以分离出那些能够最有效地区分信号发送者群体的特征。我们考虑了四种替代降维方法。最后,将降维后的数据的每种组合提交给线性判别分析进行分类。我们的结果支持理论预测,即时频DSP程序(尤其是DWT)能够更有效地区分群体。发现信号处理的性能在很大程度上取决于所采用的降维程序以及所得特征的数量。由于最佳的程序组合取决于数据集且难以预测,我们得出结论,在此类动物信号分类之前,尤其是对于不熟悉的信号,应常规执行此处所述的模拟或至少其简化版本。