Saldías O'Hrens Marcelo, Castro Christian, Espinoza Víctor M, Stoney Justin, Quezada Camilo, Laukkanen Anne-Maria
Departamento de Fonoaudiología, Universidad de Chile, Santiago, Chile.
Department Speech and Language Pathology, Universidad de Valparaíso, Valparaíso, Chile.
Logoped Phoniatr Vocol. 2025 Jul;50(2):57-74. doi: 10.1080/14015439.2024.2345373. Epub 2024 Apr 24.
To the best of our knowledge, studies on the relationship between spectral energy distribution and the degree of perceived voices are still sparse. Through an auditory-perceptual test we aimed to explore the spectral features that may relate with the auditory-perception of voices.
Ten judges who were blind to the test's tasks and stimuli rated the amount of twang perceived on seventy-six audio samples. The stimuli consisted of twenty voices recorded from eight CCM singers who sustained the vowel [a:] in different pitches, with and without a voice. Also, forty filtered and sixteen synthesized-manipulated stimuli were included.
Based on the intra-rater reliability scores, four judges were identified as suitable to be included in the analyses. Results showed that the frequency of F and F correlated strongly with the auditory-perception of voices (0.90 and 0.74, respectively), whereas F showed a moderate negative correlation (-0.52). The frequency difference between F and F showed a strong negative correlation (-0.82). The mean energy between 1-2 kHz and 2-3 kHz correlated moderately (0.51 and 0.42, respectively). The frequency of F and F, and the energy above 3 kHz showed weak correlations. Since the spectral changes under 2 kHz have been associated with the jaw, lips, and tongue adjustments (i.e. vowel articulation) and a higher vertical laryngeal position might affect the frequency of all formants (including F and F), our results suggest that vowel articulation and the laryngeal height may be relevant when performing voices.
据我们所知,关于频谱能量分布与感知到的嗓音程度之间关系的研究仍然很少。通过听觉感知测试,我们旨在探索可能与嗓音听觉感知相关的频谱特征。
十名对测试任务和刺激不知情的评判员对76个音频样本中感知到的鼻音程度进行评分。刺激包括从八位持续发元音[a:]的CCM歌手录制的20个嗓音样本,有或无鼻音。此外,还包括40个经过滤波和16个经过合成处理的刺激样本。
根据评分者内信度得分,确定四名评判员适合纳入分析。结果表明,F1和F2的频率与嗓音的听觉感知密切相关(分别为0.90和0.74),而F3呈现中度负相关(-0.52)。F1和F2之间的频率差呈现强负相关(-0.82)。1-2kHz和2-3kHz之间的平均能量呈现中度相关(分别为0.51和0.42)。F1和F2的频率以及3kHz以上的能量呈现弱相关。由于2kHz以下的频谱变化与下颌、嘴唇和舌头的调整(即元音发音)有关,并且较高的喉部垂直位置可能会影响所有共振峰的频率(包括F1和F2),我们的结果表明,在发出鼻音时,元音发音和喉部高度可能是相关的。