Fort A, Ismaelli A, Manfredi C, Bruscaglioni P
Electronic Engineering Department, University of Florence, Italy.
Med Eng Phys. 1996 Dec;18(8):677-91. doi: 10.1016/s1350-4533(96)00020-3.
The present paper addresses the issue of correctly estimating the peaks in the speech envelope (formants) occurring in newborn infant cry. Clinical studies have shown that the analysis of such spectral characteristics is a helpful noninvasive diagnostic tool. In fact it can be applied to explore brain function at very early stage of child development, for a timely diagnosis of neonatal disease and malformation. The paper focuses on the performance comparison between some classical parametric and non-parametric estimation techniques particularly well suited for the present application, specifically the LP, ARX and cepstrum approaches. It is shown that, if the model order is correctly chosen, parametric methods are in general more reliable and robust against noise, but exhibit a less uniform behaviour than cepstrum. The methods are compared also in terms of tracking capability, since the signals under study are nonstationary. Both simulated and real signals are used in order to outline the relevant features of the proposed approaches.
本文探讨了正确估计新生儿啼哭语音包络(共振峰)中峰值的问题。临床研究表明,对这种频谱特征的分析是一种有用的非侵入性诊断工具。事实上,它可用于在儿童发育的早期阶段探索脑功能,以便及时诊断新生儿疾病和畸形。本文重点比较了一些特别适用于本应用的经典参数估计和非参数估计技术的性能,具体为线性预测(LP)、自回归外生(ARX)和倒谱方法。结果表明,如果正确选择模型阶数,参数方法通常对噪声更可靠、更稳健,但与倒谱相比,其行为不太均匀。由于所研究的信号是非平稳的,因此还从跟踪能力方面对这些方法进行了比较。为了概述所提方法的相关特征,使用了模拟信号和真实信号。