Department of Communication Sciences and Disorders, College of Health and Human Services, Bowling Green State University, Bowling Green, Ohio.
Department of Communication Sciences and Disorders, College of Health and Human Services, Bowling Green State University, Bowling Green, Ohio.
J Voice. 2024 Jul;38(4):889-902. doi: 10.1016/j.jvoice.2022.01.013. Epub 2022 Feb 18.
The use of vocal fry is common in running speech and has potential psychosocial and vocal health consequences. Determining the different patterns of vocal fry is relevant to differentiating phonatory function, understanding cultural and linguistic use of vocal fry, and clinical diagnostics and intervention. The purpose of this project was to study and categorize patterns of vocal fry in running speech.
Analyses were performed on recordings of the Rainbow Passage read out loud by five healthy females 18-21 years old. Praat was used to create audio files with text grids. The audio recordings were examined visually (the audio signal and its spectrogram) and perceptually in order to determine different patterns of vocal fry seen in the audio signal. Criteria for detecting vocal fry were (1) the presence of an acoustic transient (a relatively large and fast dip in acoustic pressure, presumably near glottal closure) with a relatively long cycle period compared to normal phonation periods, or (2) a frequency at or lower than approximately an octave below the nearby normal speaking fundamental frequency.
There were 174 total vocal fry samples obtained from the recordings. Six vocal fry patterns were observed. The patterns were: single pulse fry (FRY1), double pulse fry (FRY2), multiple pulse fry (FRY3), period doubling fry (FRY4), inaudible fry (FRY5), and indeterminate vocal fry (FRY6). Single pulse fry was divided into a single (one) pulse fry cycle (FRY1a), a series of even single pulse fry cycles (FRY1b), and a series of uneven single pulse fry cycles (FRY1c). Double pulse fry was divided into a primary then another primary pulse fry cycle (FRY2a), a secondary then a primary pulse fry cycle (FRY2b), and a primary then a secondary pulse fry cycle (FRY2c). Multiple pulse fry, where a higher frequency was modulated by a lower frequency, was divided into the higher frequency being near the speaking fundamental frequency (FRY3a) and the higher frequency being inconsistent or well below the expected speaking fundamental frequency (FRY3b). The category single pulse fry had the most samples, with 76% of the total occurrences, followed by period doubling 13%, and the rest 11%. Relative to where the fry patterns occurred within syllables, 36% occurred at the onset of the syllable, 26% early in the syllable, 25% later in the syllable, and 13% at the end of the syllable. These tallies did not include the sixth category, indeterminate vocal fry (FRY6), which was not included in the study proper but recognized as indicating complicated patterns that did not fall within the first five categories.
Vocal fry is a complex, multifaceted phenomenon. The results of this study suggest that there are identifiable patterns of vocal fry. These patterns need to be differentiated especially regarding the glottal adductory nature and phonatory function of each pattern, glottal closure appearing to be the primary physiological causative factor of the salient negative pressure dips (the adduction behavior will be reported in a study in progress). Further research is necessary to determine other potential categories of vocal fry, determine if there are individual idiosyncratic patterns of vocal fry, determine possible differences in vocal fry produced by individuals of different ages and gender expression and other factors, and research the physiologic, acoustic, aerodynamic, and perceptual reality of each pattern.
在连续言语中,发声功能低下(vocal fry)很常见,并且可能会对发音和心理产生影响。确定发声功能低下的不同模式对于区分发音功能、理解发声功能低下的文化和语言使用、临床诊断和干预都很重要。本研究旨在研究和分类连续言语中的发声功能低下模式。
对五名 18-21 岁健康女性朗读的 Rainbow Passage 录音进行分析。Praat 软件用于创建带有文本网格的音频文件。通过音频信号及其声谱图进行直观和听觉检查,以确定音频信号中观察到的不同发声功能低下模式。检测发声功能低下的标准为:(1)存在声学瞬变(声学压力的相对较大且快速下降,可能接近声门关闭),其周期比正常发声周期相对较长,或(2)频率等于或低于附近正常说话基频的约一个八度。
从录音中获得了 174 个总发声功能低下样本。观察到六种发声功能低下模式。这些模式为:单脉冲发声功能低下(FRY1)、双脉冲发声功能低下(FRY2)、多脉冲发声功能低下(FRY3)、倍频发声功能低下(FRY4)、听不见的发声功能低下(FRY5)和不确定的发声功能低下(FRY6)。单脉冲发声功能低下分为单脉冲发声功能低下循环(FRY1a)、均匀单脉冲发声功能低下循环(FRY1b)和不均匀单脉冲发声功能低下循环(FRY1c)。双脉冲发声功能低下分为一次主脉冲后再一次主脉冲发声功能低下循环(FRY2a)、一次次脉冲后再一次主脉冲发声功能低下循环(FRY2b)和一次主脉冲后再一次次脉冲发声功能低下循环(FRY2c)。多脉冲发声功能低下是由一个高频被一个低频调制而成,分为高频接近说话基频(FRY3a)和高频不一致或明显低于预期说话基频(FRY3b)。出现频率最高的是单脉冲发声功能低下模式,占总出现次数的 76%,其次是倍频发声功能低下,占 13%,其余为 11%。相对于发声功能低下模式在音节中的位置,36%发生在音节的起始,26%发生在音节的早期,25%发生在音节的后期,13%发生在音节的末尾。这些数据不包括第六个类别,不确定的发声功能低下(FRY6),它没有包含在研究中,但被认为是指示复杂模式的标志,这些复杂模式不属于前五个类别。
发声功能低下是一种复杂的、多方面的现象。本研究结果表明,存在可识别的发声功能低下模式。需要对这些模式进行区分,特别是要考虑到每个模式的声门内收性质和发音功能,声门关闭似乎是产生明显负压力下降的主要生理原因(声门内收行为将在正在进行的研究中报告)。需要进一步研究以确定其他可能的发声功能低下类别,确定是否存在个体独特的发声功能低下模式,确定不同年龄和性别表达的个体产生的发声功能低下差异以及其他因素,并研究每个模式的生理、声学、空气动力学和感知现实。