Lee Yogaku, Tanaka Masato, Kato Hikari, Nakagawa Takashi, Ishikawa Satoshi, Kaburagi Tokihiko
Graduate School of Design, Kyushu University, 4-9-1 Shiobaru, Minami-ku, Fukuoka 815-8540, Japan.
Graduate School of Design, Kyushu University, 4-9-1 Shiobaru, Minami-ku, Fukuoka 815-8540, Japan.
J Voice. 2025 Jan 8. doi: 10.1016/j.jvoice.2024.11.041.
Vocal distortion, also known as a scream or growl, is used worldwide as an essential technique in singing, especially in rock and metal, and as an ethnic voice in Mongolian singing. However, the production mechanism of vocal distortion is not yet clearly understood owing to limited research on the behavior of the larynx, which is the source of the distorted voice.
This study used high-speed digital imaging (HSDI) to observe the larynx of professional singers with exceptional singing skills and determine the laryngeal dynamics in the voice production of various vocal distortions. In addition, this study aimed to classify vocal distortions based on laryngeal dynamics.
Cross-sectional study.
The dynamic behavior of the larynx of six singers (four males and two females) was recorded using HSDI, with the majority of recordings captured at frame rates exceeding 8000 fps. The audio signals were recorded simultaneously. The data were analyzed as topography, glottal area waveforms, long-term average spectra, and spectrograms.
Our analysis revealed that, based on the laryngeal vibration pattern, vocal distortion could be categorized into two primary groups: "periodic" and "aperiodic." In the "periodic" distorted vocalizations, it was suggested that the glottal sound source containing overtone components is generated by vocal fold vibrations, and subharmonic components in the voice are the result of amplitude modulation of the glottal jet by vibrations in the supraglottic region. However, "aperiodic" distorted vocalizations were found to be generated by the vocal folds, which are the main source of noise and generate strong noise due to aperiodic chaotic oscillations of the vocal folds.
The diversity of distorted vocalizations is due to the physiologically and acoustically complex mechanism of voice production in the larynx.
嗓音失真,也被称为嘶吼或咆哮,在全球范围内被用作演唱中的一项关键技巧,尤其在摇滚和金属音乐中,同时也是蒙古歌唱中的一种民族发声方式。然而,由于对作为失真嗓音源头的喉部行为研究有限,嗓音失真的产生机制尚未得到清晰的理解。
本研究使用高速数字成像(HSDI)来观察具有卓越演唱技巧的专业歌手的喉部,并确定各种嗓音失真发声过程中的喉部动力学。此外,本研究旨在根据喉部动力学对嗓音失真进行分类。
横断面研究。
使用HSDI记录了六名歌手(四名男性和两名女性)喉部的动态行为,大部分记录的帧率超过8000帧/秒。同时记录音频信号。数据被分析为地形图、声门面积波形、长期平均谱和频谱图。
我们的分析表明,基于喉部振动模式,嗓音失真可分为两个主要类别:“周期性”和“非周期性”。在“周期性”失真发声中,有人提出包含泛音成分的声门声源是由声带振动产生的,而嗓音中的次谐波成分是声门上区域振动对声门射流进行幅度调制的结果。然而,发现“非周期性”失真发声是由声带产生的,声带是主要的噪声源,由于声带的非周期性混沌振荡而产生强烈噪声。
失真发声的多样性归因于喉部发声在生理和声学上的复杂机制。