Hanson Helen M, Stevens Kenneth N
Sensimetrics Corporation, Somerville, Massachusetts 02144-2500, USA.
J Acoust Soc Am. 2002 Sep;112(3 Pt 1):1158-82. doi: 10.1121/1.1498851.
The HLsyn speech synthesizer uses models of the vocal tract to map higher-level quasiarticulatory parameters to the acoustic parameters of a Klatt-type formant synthesizer. The benefits of this system are several. In addition to requiring a relatively small number of parameters, the HLsyn model includes constraints on source-filter relations that occur naturally during speech production. Such constraints help to prevent combinations of sources and filter that are impossible to achieve with the human vocal tract. Thus, HLsyn could lead to reductions in the complexity of formant synthesis and result in better quality synthesis. HLsyn can also be a useful tool for speech-science education and speech research. This paper focuses on the generation of acoustic sources in HLsyn. Described in detail are the equations and methods used to estimate Klatt-type source parameters from HLsyn parameters. Several examples illustrating the generation of source parameters for obstruents (voiced and voiceless) and sonorants are provided. Future papers will describe the filtering components of HLsyn.
HLsyn语音合成器使用声道模型将更高层次的准发音参数映射到克拉特型共振峰合成器的声学参数上。该系统有诸多优点。除了需要相对较少的参数外,HLsyn模型还包含了语音产生过程中自然出现的源 - 滤波器关系的约束。这些约束有助于防止出现人类声道无法实现的源和滤波器的组合。因此,HLsyn可以降低共振峰合成的复杂度,并产生质量更好的合成效果。HLsyn还可以成为语音科学教育和语音研究的有用工具。本文重点关注HLsyn中声源的生成。详细描述了从HLsyn参数估计克拉特型声源参数所使用的方程和方法。提供了几个说明浊音和清音阻塞音以及响音声源参数生成的示例。后续论文将描述HLsyn的滤波组件。