Qi Y, Weinberg B, Bi N
Department of Speech and Hearing Sciences, University of Arizona, Tucson 85721, USA.
J Acoust Soc Am. 1995 Nov;98(5 Pt 1):2461-5. doi: 10.1121/1.413279.
Qi [J. Acoust. Soc. Am. 88, 1228-1235 (1990)] has demonstrated that (1) linear predictive (LP) methods can be used to separate vocal tract transfer functions from source functions of vowels produced by alaryngeal talkers and that (2) vowels synthesized with reconstructed transfer functions and totally synthetic voicing excitation sources have improved source-related properties over those present in the original vowels. Here, an extension of this work which is directed to the general goal of developing systems (devices) to enhance the quality of alaryngeal speech is reported. The specific goal of the present project was to determine whether speech, i.e., words spoken by female esophageal and tracheoesophageal talkers, could be enhanced by means of LP-based analysis and synthesis methods. Words spoken by four female alaryngeal talkers were analyzed and synthesized. A perceptual evaluation was completed to permit the quality of the synthetic and the original words to be compared. Listeners generally preferred to listen to the synthesized words, indicating that alaryngeal speech enhancement was accomplished.
齐[《美国声学学会杂志》88, 1228 - 1235 (1990)]已经证明:(1)线性预测(LP)方法可用于从无喉讲话者发出的元音的源函数中分离声道传递函数;(2)用重构的传递函数和完全合成的嗓音激励源合成的元音,与原始元音相比,具有改善的与源相关的特性。在此,报告了这项工作的一个扩展,其目标是开发系统(设备)以提高无喉语音质量这一总体目标。本项目的具体目标是确定语音,即女性食管发声者和气管食管瘘发声者说出的单词,是否可以通过基于LP的分析和合成方法得到增强。对四名女性无喉讲话者说出的单词进行了分析和合成。完成了一项感知评估,以比较合成词和原始词的质量。听众通常更喜欢听合成词,这表明实现了无喉语音增强。