Martin D P, Wolfe V I
Auburn University at Montgomery, AL 36117-3596, USA.
Percept Mot Skills. 1996 Dec;83(3 Pt 2):1291-8. doi: 10.2466/pms.1996.83.3f.1291.
28 undergraduate students participated in a perceptual voice experiment to assess the effects of training utilizing synthesized voice signals. An instructional strategy based upon synthesized examples of a three-part classification system: "breathy," "rough," and "hoarse," was employed. Training samples were synthesized with varying amounts of jitter (cycle-to-cycle deviation in pitch period) and harmonic-to-noise ratios to represent these qualities. Before training, listeners categorized 60 pathological voices into "breathy," "rough," and "hoarse," largely on the basis of fundamental frequency. After training, categorizations were influenced by harmonic-to-noise ratios as well as fundamental frequency, suggesting that listeners were more aware of spectral differences in pathological voices associated with commonly occurring laryngeal conditions. 40% of the pathological voice samples remained unclassified following training.
28名本科生参与了一项感知语音实验,以评估利用合成语音信号进行训练的效果。采用了一种基于三部分分类系统合成示例的教学策略:“呼吸声”、“粗糙声”和“嘶哑声”。训练样本通过不同程度的抖动(音高周期的逐周期偏差)和谐波与噪声比进行合成,以体现这些特征。训练前,听众主要根据基频将60个病理性语音分类为“呼吸声”、“粗糙声”和“嘶哑声”。训练后,分类受到谐波与噪声比以及基频的影响,这表明听众对与常见喉部疾病相关的病理性语音的频谱差异更加敏感。40%的病理性语音样本在训练后仍未分类。