感知语音评估中的言语任务与评分者间信度

Speech tasks and interrater reliability in perceptual voice evaluation.

作者信息

Lu Fang-Ling, Matteson Samuel

机构信息

Department of Speech and Hearing Sciences, University of North Texas, Denton, Texas.

Department of Physics, University of North Texas, Denton, Texas.

出版信息

J Voice. 2014 Nov;28(6):725-32. doi: 10.1016/j.jvoice.2014.01.018. Epub 2014 May 17.

DOI:10.1016/j.jvoice.2014.01.018

PMID:24841668

Abstract

OBJECTIVE/HYPOTHESIS: The optimal selection of speech task is essential for more reliable perceptual ratings and a better understanding of the perceptual qualities of pathologic voices. Nevertheless, researchers have rarely explored this issue using the GRBAS scale. This study investigates the effect of speech task selection on interrater reliability during perceptual voice assessment.

STUDY DESIGN

Experimental study.

METHODS

Sixty subjects, 39 dysphonic subjects and 21 normal controls, performed 13 speech tasks including three 5-second sustained vowel sounds (/ɑ/, /i/, and /u/) each at three pitch levels (high, habitual, and low), maximum phonation of the vowel /ɑ/, pitch glide, counting from 1 to 10, and oral reading of the Rainbow Passage. A group of 18 graduate students in speech-language pathology served as perceptual judges and rated the dysphonic severity for the speech samples based on three parameters in the GRBAS scale-Grade, Roughness, and Breathiness. The formalism of the AC1 statistic proposed by Gwet was applied to determine relative reliability between the speech tasks and the raters.

RESULTS

The counting task and sustained vowel /ɑ/ in high, habitual, and low registers exhibited the most reproducibility and consequently the highest reliability statistic.

CONCLUSIONS

The counting task and sustained /ɑ/ phonation are the optimal tasks for perceptual voice judgment in regard to interrater reliability. Future perceptional studies may benefit from this finding to determine the relationship between speech task selection and the validity of any given perceptual rating system in terms of sensitivity and specificity.

摘要

目的/假设：言语任务的最佳选择对于获得更可靠的感知评分以及更好地理解病理性嗓音的感知特征至关重要。然而，研究人员很少使用GRBAS量表来探讨这个问题。本研究调查了言语任务选择对感知性嗓音评估中评分者间信度的影响。

研究设计

实验研究。

方法

60名受试者，其中39名嗓音障碍受试者和21名正常对照，进行了13项言语任务，包括三个5秒的持续元音（/ɑ/、/i/和/u/），每个元音在三个音高水平（高、习惯和低）下进行，/ɑ/的最大发声、音高滑动、从1数到10以及朗读彩虹段落。一组18名言语语言病理学研究生作为感知评判者，根据GRBAS量表中的三个参数——等级、粗糙度和气息声，对言语样本的嗓音障碍严重程度进行评分。应用Gwet提出的AC1统计量的形式来确定言语任务和评判者之间的相对信度。

结果

计数任务以及高、习惯和低音调的持续元音/ɑ/表现出最高的可重复性，因此具有最高的信度统计值。

结论

就评分者间信度而言，计数任务和持续/ɑ/发声是感知性嗓音判断的最佳任务。未来的感知研究可能会受益于这一发现，以确定言语任务选择与任何给定感知评分系统在敏感性和特异性方面的有效性之间的关系。

相似文献

Speech tasks and interrater reliability in perceptual voice evaluation.

J Voice. 2014 Nov;28(6):725-32. doi: 10.1016/j.jvoice.2014.01.018. Epub 2014 May 17.

Validation of the Acoustic Voice Quality Index in the Lithuanian Language.

J Voice. 2017 Mar;31(2):257.e1-257.e11. doi: 10.1016/j.jvoice.2016.06.002. Epub 2016 Jul 15.

GRBAS and Cape-V scales: high reliability and consensus when applied at different times.

J Voice. 2012 Nov;26(6):812.e17-22. doi: 10.1016/j.jvoice.2012.03.005. Epub 2012 Sep 29.

Perceptual and Quantitative Assessment of Dysphonia Across Vowel Categories.

J Voice. 2019 Jul;33(4):473-481. doi: 10.1016/j.jvoice.2017.12.018. Epub 2018 May 24.

Use of cepstral analyses for differentiating normal from dysphonic voices: a comparative study of connected speech versus sustained vowel in European Portuguese female speakers.

J Voice. 2014 May;28(3):282-6. doi: 10.1016/j.jvoice.2013.10.001. Epub 2014 Feb 1.

Comparison of Rater's reliability on perceptual evaluation of different types of voice sample.

J Voice. 2012 Sep;26(5):666.e13-21. doi: 10.1016/j.jvoice.2011.08.003. Epub 2012 Jan 11.

Auditory-Perceptual and Acoustic Methods in Measuring Dysphonia Severity of Korean Speech.

J Voice. 2016 Sep;30(5):587-94. doi: 10.1016/j.jvoice.2015.06.011. Epub 2015 Aug 25.

Voice in Friedreich Ataxia.

J Voice. 2017 Mar;31(2):243.e9-243.e19. doi: 10.1016/j.jvoice.2016.04.015. Epub 2016 Aug 5.

The Consensus Auditory-Perceptual Evaluation of Voice (CAPE-V) Psychometric Characteristics: II European Portuguese Version (II EP CAPE-V).

J Voice. 2019 Jul;33(4):582.e5-582.e13. doi: 10.1016/j.jvoice.2018.02.013. Epub 2018 Jun 20.

Toward improved ecological validity in the acoustic measurement of overall voice quality: combining continuous speech and sustained vowels.

J Voice. 2010 Sep;24(5):540-55. doi: 10.1016/j.jvoice.2008.12.014. Epub 2009 Nov 2.

引用本文的文献

Machine learning based assessment of hoarseness severity: a multi-sensor approach centered on high-speed videoendoscopy.

Front Artif Intell. 2025 Jun 5;8:1601716. doi: 10.3389/frai.2025.1601716. eCollection 2025.

Identifying and Estimating Frailty Phenotypes by Vocal Biomarkers: Cross-Sectional Study.

J Med Internet Res. 2024 Nov 8;26:e58466. doi: 10.2196/58466.

Item-specific analysis of hoarseness to detect aspiration after cardiac surgery: an exploratory study of adopting an iPhone application "GRBASZero".

Indian J Thorac Cardiovasc Surg. 2024 Nov;40(6):684-689. doi: 10.1007/s12055-024-01758-x. Epub 2024 Jun 7.

Vowel onset measures and their reliability, sensitivity and specificity: A systematic literature review.

PLoS One. 2024 May 2;19(5):e0301786. doi: 10.1371/journal.pone.0301786. eCollection 2024.

The influence of listener experience, measurement scale and speech task on the reliability of auditory-perceptual evaluation of vocal quality.

Codas. 2024 Apr 15;36(3):e20230175. doi: 10.1590/2317-1782/20232023175. eCollection 2024.

Vocal tasks for acoustic and/or auditory perceptual analysis for discriminating individuals with and without voice disorders: a systematic review protocol.

BMJ Open. 2023 Dec 9;13(12):e077398. doi: 10.1136/bmjopen-2023-077398.

Auditory-perceptual evaluation of voice: comparing different speech tasks to identify children with and without laryngeal lesions.

Codas. 2023 Mar 3;35(2):e20210198. doi: 10.1590/2317-1782/20212021198pt. eCollection 2023.

Acoustic Measures of Dysphonia in Amyotrophic Lateral Sclerosis.

J Speech Lang Hear Res. 2023 Mar 7;66(3):872-887. doi: 10.1044/2022_JSLHR-22-00363. Epub 2023 Feb 20.

Outcome measurement tools for communication, voice and speech intelligibility in the ICU and their clinimetric properties: A systematic review.

J Intensive Care Soc. 2022 Nov;23(4):459-472. doi: 10.1177/1751143720963757. Epub 2020 Nov 2.

Perceptual Assessment and Acoustic Voice Analysis as Screening Tests for Vocal Fold Paresis After Thyroid or Parathyroid Surgery.

World J Surg. 2021 Mar;45(3):765-773. doi: 10.1007/s00268-020-05863-x. Epub 2020 Nov 28.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

感知语音评估中的言语任务与评分者间信度

Speech tasks and interrater reliability in perceptual voice evaluation.

作者信息

机构信息

出版信息

STUDY DESIGN

METHODS

RESULTS

CONCLUSIONS

研究设计

方法

结果

结论

相似文献

引用本文的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献