Wu Chia-Hsin, Cantor-Cutiva Lady Catherine, Hunter Eric J
Department of Communication Sciences and Disorders, University of Iowa, Iowa City, IA.
Department of Audiology and Speech-Language Pathology, East Tennessee State University, Johnson City, TN.
J Voice. 2025 Sep 3. doi: 10.1016/j.jvoice.2025.08.018.
Strained voice quality-commonly referred to as vocal strain-is a hallmark of functional voice disorders such as muscle tension dysphonia and is often associated with vocal fatigue and laryngeal hyperfunction. Although listeners describe it as excessive vocal effort, strained voice quality frequently overlaps perceptually with breathiness and roughness, complicating reliable assessment. Despite its clinical relevance, no standardized acoustic definition of strained voice quality has been established.
This review aims to identify and summarize the voice acoustic parameters reported in the literature to quantify the strain dimension of voice quality.
A scoping review with systematic elements was conducted using four databases (ScienceDirect, PubMed, Virtual Health Library, and Web of Science) covering 1996 to 2024. Of 311 identified records, 13 met the inclusion criteria. Extracted data included definitions of vocal strain, perceptual assessment tools, acoustic metrics, and methodological details.
Strain was consistently treated as a perceptual attribute of voice quality, often described as the impression of excessive effort. Common acoustic metrics included cepstral peak prominence (CPP), spectral slope, low-to-high (L/H) spectral ratio, and relative fundamental frequency (RFF). While several measures showed moderate-to-strong correlations with perceptual ratings of strain, methodological variability across studies limited direct comparisons and interpretability.
Strained voice quality is perceptually complex and acoustically multifaceted. While no single metric reliably captures its full scope, acoustic measures-particularly spectral and cepstral features-can complement perceptual assessments. A multimodal approach that integrates listener-based impressions, acoustic analysis, and, where possible, physiological data is recommended to improve diagnostic consistency and guide future research into strain-sensitive voice assessment tools.
嗓音质量紧张——通常称为嗓音紧张——是功能性嗓音障碍(如肌肉紧张性发声障碍)的一个标志,并且常与嗓音疲劳和喉功能亢进相关。尽管听众将其描述为过度的发声努力,但紧张的嗓音质量在感知上常常与呼吸声和粗糙声重叠,这使得可靠的评估变得复杂。尽管其具有临床相关性,但尚未建立紧张嗓音质量的标准化声学定义。
本综述旨在识别和总结文献中报道的嗓音声学参数,以量化嗓音质量的紧张维度。
使用四个数据库(科学Direct、PubMed、虚拟健康图书馆和科学网)进行了一项包含系统元素的范围综述,涵盖1996年至2024年。在311条识别出的记录中,13条符合纳入标准。提取的数据包括嗓音紧张的定义、感知评估工具、声学指标和方法学细节。
紧张一直被视为嗓音质量的一种感知属性,常被描述为过度努力的印象。常见的声学指标包括谐波峰值突出度(CPP)、频谱斜率、低到高(L/H)频谱比和相对基频(RFF)。虽然有几项测量显示与紧张的感知评分存在中度到强的相关性,但各研究之间的方法学差异限制了直接比较和可解释性。
紧张的嗓音质量在感知上很复杂,在声学上具有多面性。虽然没有单一指标能可靠地涵盖其全部范围,但声学测量——特别是频谱和谐波特征——可以补充感知评估。建议采用一种多模态方法,将基于听众的印象、声学分析以及可能的生理数据整合起来,以提高诊断一致性,并指导未来对紧张敏感嗓音评估工具的研究。