Suppr超能文献

基于倒谱和谱的语音连续语音分析的预测价值和判别能力。

Predictive value and discriminant capacity of cepstral- and spectral-based measures during continuous speech.

机构信息

Department of Communication Sciences and Disorders, Syracuse University, Syracuse, New York 13244, USA.

出版信息

J Voice. 2013 Jul;27(4):393-400. doi: 10.1016/j.jvoice.2013.02.005. Epub 2013 May 16.

Abstract

OBJECTIVES/HYPOTHESIS: The purpose of this study was to determine the relative strength of various cepstral- and spectral-based measures for predicting dysphonia severity and differentiating voice quality types.

STUDY DESIGN

Prospective, quasi-experimental research design.

METHODS

Twenty-eight dysphonic speakers and 14 normal speakers were included in this study. Among the dysphonic speakers, 14 had a predominant voice quality of breathiness and 14 had a predominant voice quality of roughness. Cepstral and spectral analyses of the first and second sentences of the Rainbow passage were performed, along with perceptual ratings of overall dysphonia severity. Linear regression was performed to determine the predictive capacity of each variable for dysphonia severity, and discriminant analysis determined the combination of variables that optimally differentiated the three voice quality types.

RESULTS

A four-factor model that incorporated the cepstral- and spectral-based measures produced an R value of 0.899, explaining 81% of the variance in auditory-perceptual dysphonia severity. Cepstral peak prominence (CPP) showed the greatest predictive contribution to dysphonia severity in the regression model. The discriminant analysis produced two discriminant functions that included both CPP and its standard deviation (CPP SD) as significant contributors (P < 0.001), with an overall classification accuracy for the combined functions of 79%.

CONCLUSIONS

Acoustic measures reflecting the distribution of harmonic energy and low- to high-frequency energy in continuous speech, along with the variability (standard deviations) of each, were highly predictive of dysphonia severity when combined in a multivariate linear model. Cepstral-based measures showed the highest capacity to discriminate voice quality types, with better classification accuracy for normal and dysphonic-breathy than for dysphonic-rough voices.

摘要

目的/假设:本研究旨在确定各种倒谱和光谱测量在预测嗓音障碍严重程度和区分嗓音质量类型方面的相对优势。

研究设计

前瞻性、准实验研究设计。

方法

本研究纳入了 28 名嗓音障碍患者和 14 名正常嗓音者。在嗓音障碍患者中,14 名患者的嗓音质量主要为气息声,14 名患者的嗓音质量主要为粗糙声。对 Rainbow 短文的首句和次句进行了倒谱和光谱分析,并对整体嗓音障碍严重程度进行了感知评估。进行线性回归以确定每个变量对嗓音障碍严重程度的预测能力,判别分析确定了最佳区分三种嗓音质量类型的变量组合。

结果

包含倒谱和光谱测量的四因素模型产生了 0.899 的 R 值,解释了听觉感知嗓音障碍严重程度 81%的方差。倒谱峰突出度(CPP)在回归模型中对嗓音障碍严重程度的预测贡献最大。判别分析产生了两个判别函数,其中包括 CPP 和其标准差(CPP SD)作为重要贡献者(P < 0.001),组合函数的整体分类准确率为 79%。

结论

反映连续语音中谐波能量和低至高频率能量分布的声学测量值,以及每个测量值的变异性(标准差),当结合在多元线性模型中时,对嗓音障碍严重程度具有高度的预测能力。基于倒谱的测量值在区分嗓音质量类型方面显示出最高的能力,对正常嗓音和嗓音障碍性气息声的分类准确率优于嗓音障碍性粗糙声。

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验