Department of Head & Neck Surgery, David Geffen School of Medicine, University of California Los Angeles (UCLA), Los Angeles, California, United States of America.
Dep. of Otorhinolaryngology, Div. of Phoniatrics and Pediatric Audiology, University Hospital Erlangen, Friedrich-Alexander-University Erlangen-Nürnberg, Erlangen, Germany.
PLoS One. 2021 Feb 2;16(2):e0246136. doi: 10.1371/journal.pone.0246136. eCollection 2021.
In voice research, uncovering relations between the oscillating vocal folds, being the sound source of phonation, and the resulting perceived acoustic signal are of great interest. This is especially the case in the context of voice disorders, such as functional dysphonia (FD). We investigated 250 high-speed videoendoscopy (HSV) recordings with simultaneously recorded acoustic signals (124 healthy females, 60 FD females, 44 healthy males, 22 FD males). 35 glottal area waveform (GAW) parameters and 14 acoustic parameters were calculated for each recording. Linear and non-linear relations between GAW and acoustic parameters were investigated using Pearson correlation coefficients (PCC) and distance correlation coefficients (DCC). Further, norm values for parameters obtained from 250 ms long sustained phonation data (vowel /i/) were provided. 26 PCCs in females (5.3%) and 8 in males (1.6%) were found to be statistically significant (|corr.| ≥ 0.3). Only minor differences were found between PCCs and DCCs, indicating presence of weak non-linear dependencies between parameters. Fundamental frequency was involved in the majority of all relevant PCCs between GAW and acoustic parameters (19 in females and 7 in males). The most distinct difference between correlations in females and males was found for the parameter Period Variability Index. The study shows only weak relations between investigated acoustic and GAW-parameters. This indicates that the reduction of the complex 3D glottal dynamics to the 1D-GAW may erase laryngeal dynamic characteristics that are reflected within the acoustic signal. Hence, other GAW parameters, 2D-, 3D-laryngeal dynamics and vocal tract parameters should be further investigated towards potential correlations to the acoustic signal.
在语音研究中,揭示声带的振动(发声的声源)与产生的感知声学信号之间的关系非常重要。在语音障碍(如功能性发声障碍(FD))的背景下尤其如此。我们研究了 250 个高速视频内窥镜(HSV)记录,同时记录了声学信号(124 名健康女性、60 名 FD 女性、44 名健康男性、22 名 FD 男性)。为每个记录计算了 35 个声门面积波形(GAW)参数和 14 个声学参数。使用 Pearson 相关系数(PCC)和距离相关系数(DCC)研究了 GAW 和声学参数之间的线性和非线性关系。此外,还提供了从 250ms 长持续发声数据(元音/i/)获得的参数的标准值。在女性中发现了 26 个 PCC(5.3%)和 8 个男性(1.6%)具有统计学意义(|corr.|≥0.3)。仅在女性和男性中,PCC 和 DCC 之间存在较小差异,表明参数之间存在微弱的非线性依赖关系。基频参与了 GAW 和声学参数之间大多数相关 PCC(女性 19 个,男性 7 个)。在女性和男性之间的相关性中,最明显的区别是参数周期变异性指数。该研究表明,在所研究的声学和 GAW 参数之间存在微弱的关系。这表明,将复杂的 3D 声门动力学简化为 1D-GAW 可能会消除反映在声学信号中的喉动态特征。因此,应进一步研究其他 GAW 参数、2D、3D 喉动力学和声道参数,以寻找与声学信号的潜在相关性。