Lim Marilyn, Lin Emily, Bones Philip
Electrical and Computer Engineering Department, University of Canterbury, Christchurch, New Zealand.
J Voice. 2006 Mar;20(1):46-54. doi: 10.1016/j.jvoice.2004.09.003. Epub 2005 Jun 6.
This study investigated the relationship among the magnitude of jaw opening, intrinsic fundamental frequency (F0), and glottal parameters in natural speech. Acoustic, jaw opening, and electroglottographic (EGG) signals were simultaneously recorded. The subjects were 10 healthy men with New Zealand English as their native language. Subjects were asked to repeat a standard nonemphasized sentence in which one of the target vowels (/a/, /e/, /i/, /o/, and /u/) was embedded in various contexts. The glottal parameters F0, open quotient (OQ), and speed quotient (SQ) were measured from the EGG signal. Results of a series of one-way repeated-measures analyses of variance (ANOVA) showed a significant vowel effect on the magnitude of jaw opening [F(4, 24) = 25.512, P < .001], F0 [F(4, 28) = 45.415, P < .001] and speed quotient [F(4, 28) = 5.233, P = .003], but not on the open quotient [F(4, 28) = 0.501, P = .735]. The magnitude of jaw opening was found to be inversely related with F0 (r = -0.624, n = 25, P = .0009). These findings showed that the magnitude of jaw opening was related to F0 and that jaw opening might be a control signal for simulation of long-term F0 variation to achieve a higher degree of naturalness in artificial voice.
本研究调查了自然语音中张口幅度、固有基频(F0)和声门参数之间的关系。同时记录了声学、张口和电子声门图(EGG)信号。受试者为10名以新西兰英语为母语的健康男性。要求受试者重复一个标准的非强调性句子,其中目标元音(/a/、/e/、/i/、/o/和/u/)之一嵌入在各种语境中。从EGG信号中测量声门参数F0、开放商数(OQ)和速度商数(SQ)。一系列单因素重复测量方差分析(ANOVA)的结果显示,元音对张口幅度[F(4, 24) = 25.512, P < .001]、F0[F(4, 28) = 45.415, P < .001]和速度商数[F(4, 28) = 5.233, P = .003]有显著影响,但对开放商数没有显著影响[F(4, 28) = 0.501, P = .735]。发现张口幅度与F0呈负相关(r = -0.624, n = 25, P = .0009)。这些结果表明,张口幅度与F0有关,并且张口可能是模拟长期F0变化以实现人工语音更高自然度的控制信号。