Department of Speech, Music & Hearing, School of Computer Science and Communications, KTH, Stockholm, Sweden.
J Voice. 2010 Sep;24(5):574-84. doi: 10.1016/j.jvoice.2009.01.001. Epub 2009 Oct 21.
Whisper productions were produced by a single adult male subject over a wide range of subglottal pressures, glottal areas, and glottal flows. Dimensional measurements were made of these three variables, including glottal perimeter. Subglottal pressure was directly obtained by a pressure transducer in a tracheal catheter, and wide-band flow with a pneumotach mask. Four types of whispers were used-hyperfunctional, hypofunctional, neutral, and postphonation-in addition to three levels of loudness (soft, medium, loud). Sequences of the /pae/ syllable were used. Video recordings of the larynx were made. The glottis was outlined by hand with extrapolation for unseen parts, and area and perimeter were obtained through image analysis software. The whisper tokens resulted in the following wide ranges: subglottal pressure: 1.3-17 cm H2O; glottal flow: 0.9-1.71 L/s; glottal area: 0.065-1.76 m2; and glottal perimeter: 1.09-6.55 cm. Hyperfunctional whisper tended to have higher subglottal pressures and lower areas and flows than hypofunctional whisper, with neutral and postphonation whisper values in between. An important finding is that glottal flow changed more for small changes of area when the area was already small, and did not create much flow change when area was changed for already larger areas; that is, whisper is "more sensitive" to airflow changes for smaller glottal areas. A general equation for whisper aerodynamics was obtained, namely, P (subglottal pressure [cm H2O])=C X F (glottal flow [cm(3)/s]), where C = 0.052 x A(4) - 0.1913 x A(3) + 0.2577 x A(2) - 0.1523 x A+0.0388, where A is the glottal area (cm(2)). Another general equation for nondimensional terms (pressure coefficient vs Reynolds number) also is offered. Implications for whisper flow resistance and aerodynamic power are given. These results give insight into whisper aerodynamics and offer equations relevant to speech synthesis.
耳语产生于单一成年男性,在一系列声门下压力、声门面积和气流下产生。对这三个变量(包括声门周长)进行了量度。声门下压力通过气管导管中的压力传感器直接获得,宽带气流通过气动流量传感器获得。除了三个响度水平(轻声、中声、大声)外,还使用了四种耳语类型:高功能、低功能、中性和后发音。使用了/pae/音节序列。对喉部进行了视频记录。通过手动描绘声门轮廓并对看不见的部分进行外推,使用图像分析软件获得了面积和周长。耳语标记产生了以下宽范围:声门下压力:1.3-17cmH2O;声门气流:0.9-1.71L/s;声门面积:0.065-1.76m2;声门周长:1.09-6.55cm。高功能耳语的声门下压力往往高于低功能耳语,而面积和气流则较低,中性和后发音耳语的值则在两者之间。一个重要的发现是,当面积已经较小时,面积的微小变化会导致声门气流发生更大的变化,而当面积已经较大时,面积的变化不会导致气流发生很大的变化;也就是说,对于较小的声门面积,耳语对气流变化更“敏感”。获得了耳语空气动力学的一般方程,即 P(声门下压力[cmH2O])=C X F(声门气流[cm³/s]),其中 C = 0.052 x A(4) - 0.1913 x A(3) + 0.2577 x A(2) - 0.1523 x A+0.0388,其中 A 是声门面积(cm²)。还提供了另一个无量纲项(压力系数与雷诺数)的一般方程。讨论了耳语流动阻力和空气动力学功率的含义。这些结果深入了解了耳语空气动力学,并提供了与语音合成相关的方程。