Park Yeonggwang, Baker Brehm Susan, Kelchner Lisa, Weinrich Barbara, McElfresh Kevin, Anand Supraja, Shrivastav Rahul, de Alarcon Alessandro, Eddins David A
Department of Communication Sciences and Disorders, University of Central Florida, Orlando, Florida.
Department of Speech Pathology and Audiology, Miami University, Oxford, Ohio; Division of Speech-Language Pathology, Cincinnati Children's Hospital Medical Center, Cincinnati, Ohio.
J Voice. 2023 Sep 20. doi: 10.1016/j.jvoice.2023.08.016.
The vibratory source for voicing in children with dysphonia is classified into three categories including a glottal vibratory source (GVS) observed in those with vocal lesions or hyperfunction; supraglottal vibratory sources (SGVS) observed secondary to laryngeal airway injuries, malformations, or reconstruction surgeries; and a combination of both glottal and supraglottal vibratory sources called mixed vibratory source (MVS). This study evaluated the effects of vibratory source on three primary dimensions of voice quality (breathiness, roughness, and strain) in children with GVS, SGVS, and MVS using single-variable matching tasks and computational measures obtained from bio-inspired auditory models.
A total of 44 dysphonic voice samples from children aged 4-11 years were selected. Seven listeners rated breathiness, roughness, and strain of 1000-ms /ɑ/ samples using single-variable matching tasks. Computational estimates of pitch strength, amplitude modulation filterbank output, and sharpness were obtained through custom-designed MATLAB algorithms.
Perceived roughness and strain were significantly higher in children with SGVS and MVS compared to children with GVS. Among the computational measures, only the modulation filterbank output resulted in significant differences among vibratory sources; a posthoc test revealed that children with SGVS had greater amplitude modulation than children with GVS, as expected from their rougher voice quality.
The results indicate that the output of an auditory amplitude modulation filterbank model may capture characteristics of SGVS that are strongly related to the rough voice quality.
发声困难儿童的发声振动源分为三类,包括在有喉部病变或功能亢进的儿童中观察到的声门振动源(GVS);继发于喉气道损伤、畸形或重建手术的声门上振动源(SGVS);以及声门和声门上振动源两者结合的混合振动源(MVS)。本研究使用单变量匹配任务和从仿生听觉模型获得的计算指标,评估了GVS、SGVS和MVS儿童中振动源对嗓音质量三个主要维度(呼吸音、粗糙音和紧张音)的影响。
共选取了44名4 - 11岁发声困难儿童的嗓音样本。7名听众使用单变量匹配任务对1000毫秒/ɑ/样本的呼吸音、粗糙音和紧张音进行评分。通过定制的MATLAB算法获得音高强度、调幅滤波器组输出和锐度的计算估计值。
与GVS儿童相比,SGVS和MVS儿童的粗糙音和紧张音的感知显著更高。在计算指标中,只有调幅滤波器组输出在不同振动源之间产生了显著差异;事后检验显示,正如预期的那样,SGVS儿童的调幅比GVS儿童更大,因为他们的嗓音质量更粗糙。
结果表明,听觉调幅滤波器组模型的输出可能捕捉到与粗糙嗓音质量密切相关的SGVS特征。