Suppr超能文献

感知语音评估中的言语任务与评分者间信度

Speech tasks and interrater reliability in perceptual voice evaluation.

作者信息

Lu Fang-Ling, Matteson Samuel

机构信息

Department of Speech and Hearing Sciences, University of North Texas, Denton, Texas.

Department of Physics, University of North Texas, Denton, Texas.

出版信息

J Voice. 2014 Nov;28(6):725-32. doi: 10.1016/j.jvoice.2014.01.018. Epub 2014 May 17.

Abstract

OBJECTIVE/HYPOTHESIS: The optimal selection of speech task is essential for more reliable perceptual ratings and a better understanding of the perceptual qualities of pathologic voices. Nevertheless, researchers have rarely explored this issue using the GRBAS scale. This study investigates the effect of speech task selection on interrater reliability during perceptual voice assessment.

STUDY DESIGN

Experimental study.

METHODS

Sixty subjects, 39 dysphonic subjects and 21 normal controls, performed 13 speech tasks including three 5-second sustained vowel sounds (/ɑ/, /i/, and /u/) each at three pitch levels (high, habitual, and low), maximum phonation of the vowel /ɑ/, pitch glide, counting from 1 to 10, and oral reading of the Rainbow Passage. A group of 18 graduate students in speech-language pathology served as perceptual judges and rated the dysphonic severity for the speech samples based on three parameters in the GRBAS scale-Grade, Roughness, and Breathiness. The formalism of the AC1 statistic proposed by Gwet was applied to determine relative reliability between the speech tasks and the raters.

RESULTS

The counting task and sustained vowel /ɑ/ in high, habitual, and low registers exhibited the most reproducibility and consequently the highest reliability statistic.

CONCLUSIONS

The counting task and sustained /ɑ/ phonation are the optimal tasks for perceptual voice judgment in regard to interrater reliability. Future perceptional studies may benefit from this finding to determine the relationship between speech task selection and the validity of any given perceptual rating system in terms of sensitivity and specificity.

摘要

目的/假设:言语任务的最佳选择对于获得更可靠的感知评分以及更好地理解病理性嗓音的感知特征至关重要。然而,研究人员很少使用GRBAS量表来探讨这个问题。本研究调查了言语任务选择对感知性嗓音评估中评分者间信度的影响。

研究设计

实验研究。

方法

60名受试者,其中39名嗓音障碍受试者和21名正常对照,进行了13项言语任务,包括三个5秒的持续元音(/ɑ/、/i/和/u/),每个元音在三个音高水平(高、习惯和低)下进行,/ɑ/的最大发声、音高滑动、从1数到10以及朗读彩虹段落。一组18名言语语言病理学研究生作为感知评判者,根据GRBAS量表中的三个参数——等级、粗糙度和气息声,对言语样本的嗓音障碍严重程度进行评分。应用Gwet提出的AC1统计量的形式来确定言语任务和评判者之间的相对信度。

结果

计数任务以及高、习惯和低音调的持续元音/ɑ/表现出最高的可重复性,因此具有最高的信度统计值。

结论

就评分者间信度而言,计数任务和持续/ɑ/发声是感知性嗓音判断的最佳任务。未来的感知研究可能会受益于这一发现,以确定言语任务选择与任何给定感知评分系统在敏感性和特异性方面的有效性之间的关系。

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验