Adachi Seiji, Takemoto Hironori, Kitamura Tatsuya, Mokhtari Parham, Honda Kiyoshi
ATR Human Information Science Laboratories, 2-2-2 Hikaridai, Keihanna Science City, Kyoto 619-0288, Japan.
J Acoust Soc Am. 2007 Jun;121(6):3874-85. doi: 10.1121/1.2730743.
An alternative and complete derivation of the vocal tract length sensitivity function, which is an equation for finding a change in formant frequency due to perturbation of the vocal tract length [Fant, Quarterly Progress and Status Rep. No. 4, Speech Transmission Laboratory, Kungliga Teknisha Hogskolan, Stockholm, 1975, pp. 1-14] is presented. It is based on the adiabatic invariance of the vocal tract as an acoustic resonator and on the radiation pressure on the wall and at the exit of the vocal tract. An algorithm for tuning the vocal tract shape to match the formant frequencies to target values, such as those of a recorded speech signal, which was proposed in Story [J. Acoust. Soc. Am. 119, 715-718 (2006)], is extended so that the vocal tract length can also be changed. Numerical simulation of this extended algorithm shows that it can successfully convert between the vocal tract shapes of a male and a female for each of five Japanese vowels.
本文给出了声道长度灵敏度函数的另一种完整推导方法,该函数是一个用于确定由于声道长度扰动而导致的共振峰频率变化的方程[范特,《季度进展与状态报告第4号》,语音传输实验室,瑞典皇家理工学院,斯德哥尔摩,1975年,第1 - 14页]。它基于声道作为声学谐振器的绝热不变性以及声道壁和出口处的辐射压力。斯托里[《美国声学学会杂志》119, 715 - 718 (2006)]提出了一种用于调整声道形状以使共振峰频率与目标值(如录制语音信号的目标值)相匹配的算法,在此基础上进行了扩展,使得声道长度也可以改变。对这种扩展算法的数值模拟表明,对于五个日语元音中的每一个,它都能够成功地在男性和女性的声道形状之间进行转换。