de Abreu Samuel Ribeiro, Sousa Estevão Silvestre da Silva, de Moraes Ronei Marcos, Lopes Leonardo Wanderley
Graduate Program in Decision Models and Health, Statistics Departament, Universidade Federal da Paraíba, João Pessoa, Paraíba, Brazil.
Statistics Departament, Universidade Federal da Paraíba, João Pessoa, Paraíba, Brazil.
J Voice. 2025 Jan;39(1):1-9. doi: 10.1016/j.jvoice.2022.07.002. Epub 2022 Aug 23.
To identify and evaluate the best set of acoustic measures to discriminate among healthy, rough, breathy, and strained voices.
This study used the vocal samples of the sustained /ε/ vowel from 251 patients with the vocal complaints, among which 51, 80, 63, and 57 patients exhibited healthy, rough, breathy, and strained voices, respectively. Twenty-two acoustic measures were extracted, and feature selection was applied to reduce the number of combinations of acoustic measures and obtain an optimal subset of measures according to the information gain attribute ranking algorithm. To classify signals as a function of predominant voice quality, a feedforward neural network was applied using a Levenberg-Marquardt supervised learning algorithm.
The best results were obtained from 11 combinations, with each combination presenting six acoustic measures. Kappa indices ranged from 0.7527 to 0.7743, the overall hit rates are 81.67%-83.27%, and the hit rates of healthy, rough, breathy, and strained voices are 74.51%-84.31%, 78.75%-90.00%, 85.71%-98.41%, and 68.42%-82.46%, respectively.
We obtained the best results from 11 combinations, with each combination exhibiting six acoustic measures for discriminating among healthy, rough, breathy, and strained voices. These sets exhibited good Kappa performance and a good overall hit rate. The hit rate varied between acceptable and good for healthy voices, acceptable and excellent for rough voices, good and excellent for breathy voices, and poor and good for strained voices.
识别并评估用于区分健康嗓音、粗糙嗓音、气息声嗓音和紧张嗓音的最佳声学指标组合。
本研究使用了251例有嗓音问题患者的持续发/ε/元音的嗓音样本,其中分别有51例、80例、63例和57例患者表现出健康嗓音、粗糙嗓音、气息声嗓音和紧张嗓音。提取了22项声学指标,并应用特征选择来减少声学指标的组合数量,并根据信息增益属性排名算法获得指标的最佳子集。为了根据主要嗓音质量对信号进行分类,使用Levenberg-Marquardt监督学习算法应用前馈神经网络。
从11种组合中获得了最佳结果,每种组合呈现6项声学指标。卡帕指数范围为0.7527至0.7743,总体命中率为81.67%-83.27%,健康嗓音、粗糙嗓音、气息声嗓音和紧张嗓音的命中率分别为74.51%-84.31%、78.75%-90.00%、85.71%-98.41%和68.42%-82.46%。
我们从11种组合中获得了最佳结果,每种组合展示了6项用于区分健康嗓音、粗糙嗓音、气息声嗓音和紧张嗓音的声学指标。这些组合表现出良好的卡帕性能和较高的总体命中率。健康嗓音的命中率在可接受和良好之间,粗糙嗓音的命中率在可接受和优秀之间,气息声嗓音的命中率在良好和优秀之间,紧张嗓音的命中率在较差和良好之间。