Fort A, Manfredi C
Electronic Engineering Department, University of Florence, Italy.
Med Eng Phys. 1998 Sep;20(6):432-42. doi: 10.1016/s1350-4533(98)00045-9.
This paper aims at estimating the fundamental frequency (pitch) and the vocal tract resonant frequencies (formants) from newborn infant cry signals. Such parameters are of interest in exploring brain function at early stages of child development, for the timely diagnosis of neonatal disease and malformation. The paper compares a spectral parametric technique and the cepstrum approach, extending previous results. The parametric technique is based on autoregressive models whose order is adaptively estimated on subsequent signal frames by means of a new method. This allows the correct tracking of pitch and formant variations with time. The traditional cepstrum approach is modified in order to follow signal variability. In particular, the cepstrum spectral resolution is improved by applying the chirp Z-transform (CZT) and by adaptively varying the 'lifter' length. The two methods are tested on simulated data, as far as robustness to noise and spectral resolution are concerned, and are then applied to real baby cry data.
本文旨在从新生儿啼哭信号中估计基频(音高)和声道共振频率(共振峰)。在探索儿童发育早期阶段的脑功能、及时诊断新生儿疾病和畸形方面,这些参数具有重要意义。本文比较了一种谱参数技术和倒谱方法,并扩展了先前的结果。参数技术基于自回归模型,其阶数通过一种新方法在后续信号帧上进行自适应估计。这使得能够正确跟踪音高和共振峰随时间的变化。传统的倒谱方法经过修改以适应信号的变化。特别是,通过应用线性调频Z变换(CZT)和自适应改变“升余弦加权”长度来提高倒谱的频谱分辨率。就对噪声的鲁棒性和频谱分辨率而言,这两种方法在模拟数据上进行了测试,然后应用于真实的婴儿啼哭数据。