Adachi Seiji, Yu Jason
ATR Human Information Science Laboratories, 2-2-2 Hikaridai, Keihanna Science City, Kyoto 619-0288 Japan.
J Acoust Soc Am. 2005 May;117(5):3213-24. doi: 10.1121/1.1861592.
Voiced sounds were simulated with a computer model of the vocal fold composed of a single mass vibrating both parallel and perpendicular to the airflow. Similarities with the two-mass model are found in the amplitudes of the glottal area and the glottal volume flow velocity, the variation in the volume flow waveform with the vocal tract shape, and the dependence of the oscillation amplitude upon the average opening area of the glottis, among other similar features. A few dissimilarities are also found in the more symmetric glottal and volume flow waveforms in the rising and falling phases. The major improvement of the present model over the two-mass model is that it yields a smooth transition between oscillations with an inductive load and a capacitive load of the vocal tract with no sudden jumps in the vibration frequency. Self-excitation is possible both below and above the first formant frequency of the vocal tract. By taking advantage of the wider continuous frequency range, the two-dimensional model can successfully be applied to the sound synthesis of a high-pitched soprano singing, where the fundamental frequency sometimes exceeds the first formant frequency.
有声声音是用一个由单个质量块组成的声带计算机模型模拟的,该质量块在与气流平行和垂直的方向上振动。在声门面积和声门体积流速的振幅、声门体积流速波形随声道形状的变化以及振荡振幅对声门平均开口面积的依赖性等方面,发现了与双质量模型的相似之处,还有其他一些相似特征。在上升和下降阶段更对称的声门和体积流速波形中也发现了一些不同之处。本模型相对于双质量模型的主要改进在于,它在声道的电感负载和电容负载的振荡之间产生了平滑过渡,振动频率没有突然跳跃。在声道的第一共振峰频率以下和以上都可能发生自激。通过利用更宽的连续频率范围,二维模型可以成功地应用于高音女高音歌唱的声音合成,其中基频有时会超过第一共振峰频率。