Hawks J W, Miller J D
School of Speech Pathology & Audiology, Kent State University, Ohio 44242.
J Acoust Soc Am. 1995 Feb;97(2):1343-4. doi: 10.1121/1.412986.
The specification of vowel formant bandwidths for speech synthesis has been inconsistent in the past, perhaps due to the difficulty of measuring formant bandwidths in natural speech and the possible perceptual insignificance of formant bandwidths on the intelligibility of synthetic speech. Here, regression equations are presented for the estimation of formant bandwidths based on measurements from natural speech which is based only on formant center frequency and independent of other formant values. Current usage, as well as comparison with another well-known estimation algorithm suggests that the new procedure should be quite acceptable for some types of speech synthesis.
过去,语音合成中元音共振峰带宽的规范一直不一致,这可能是由于在自然语音中测量共振峰带宽存在困难,以及共振峰带宽对合成语音清晰度可能没有明显的感知影响。本文提出了基于自然语音测量结果来估计共振峰带宽的回归方程,该测量仅基于共振峰中心频率,且与其他共振峰值无关。当前的应用情况以及与另一种著名估计算法的比较表明,新方法对于某些类型的语音合成应该是相当适用的。