Behroozmand Roozbeh, Almasganj Farshad
Faculty of Biomedical Engineering, Amirkabir University of Technology, Tehran, Iran.
Comput Biol Med. 2007 Apr;37(4):474-85. doi: 10.1016/j.compbiomed.2006.08.016. Epub 2006 Oct 10.
Unilateral vocal fold paralysis (UVFP) is one of the most severe types of neurogenic laryngeal disorder in which the patients, due to their vocal cords malfunction, are confronted by some serious problems. As the effect of such pathologies would be significantly evident in the reduced quality and feature variation of dysphonic voices, this study is designed to scrutinize the piecewise variation of some specific types of these features, known as energy and entropy, all over the frequency range of pathological speech signals. In order to do so, the wavelet-packet coefficients, in five consecutive levels of decomposition, are used to extract the energy and entropy measures at different spectral sub-bands. As the decomposition procedure leads to a set of high-dimensional feature vectors, genetic algorithm is invoked to search for a group of optimal sub-band indexes for which the extracted features result in the highest recognition rate for pathological and normal subjects' classification. The results of our simulations, using support vector machine classifier, show that the highest recognition rate, for both optimized energy and entropy measures, is achieved at the fifth level of wavelet-packet decomposition. It is also found that entropy feature, with the highest recognition rate of 100% vs. 93.62% for energy, is more prominent in discriminating patients with UVFP from normal subjects. Therefore, entropy feature, in comparison with energy, demonstrates a more efficient description of such pathological voices and provides us a valuable tool for clinical diagnosis of unilateral laryngeal paralysis.
单侧声带麻痹(UVFP)是最严重的神经性喉部疾病之一,患者由于声带功能障碍而面临一些严重问题。由于此类病症的影响在发音障碍声音的质量下降和特征变化中会明显体现,本研究旨在仔细检查这些特定类型特征(即能量和熵)在病理性语音信号整个频率范围内的分段变化。为此,在连续五级分解中使用小波包系数来提取不同频谱子带处的能量和熵测度。由于分解过程会产生一组高维特征向量,因此调用遗传算法来搜索一组最优子带索引,对于这些索引,提取的特征能使病理性和正常受试者分类的识别率最高。我们使用支持向量机分类器进行模拟的结果表明,对于优化后的能量和熵测度,在小波包分解的第五级时识别率最高。还发现,熵特征在区分UVFP患者与正常受试者方面更为突出,其最高识别率为100%,而能量的最高识别率为93.62%。因此,与能量相比,熵特征能更有效地描述此类病理性声音,并为单侧喉麻痹的临床诊断提供了一个有价值的工具。