Stack Janet W, Strange Winifred, Jenkins James J, Clarke William D, Trent Sonja A
Communication Disorders Program, Curry School of Education, University of Virginia, 2205 Fontaine Avenue, Suite 202, Charlottesville, Virginia 22903, USA.
J Acoust Soc Am. 2006 Apr;119(4):2394-405. doi: 10.1121/1.2171837.
This study examined the perception and acoustics of a large corpus of vowels spoken in consonant-vowel-consonant syllables produced in citation-form (lists) and spoken in sentences at normal and rapid rates by a female adult. Listeners correctly categorized the speaking rate of sentence materials as normal or rapid (2% errors) but did not accurately classify the speaking rate of the syllables when they were excised from the sentences (25% errors). In contrast, listeners accurately identified the vowels produced in sentences spoken at both rates when presented the sentences and when presented the excised syllables blocked by speaking rate or randomized. Acoustical analysis showed that formant frequencies at syllable midpoint for vowels in sentence materials showed "target undershoot" relative to citation-form values, but little change over speech rate. Syllable durations varied systematically with vowel identity, speaking rate, and voicing of final consonant. Vowel-inherent-spectral-change was invariant in direction of change over rate and context for most vowels. The temporal location of maximum F1 frequency further differentiated spectrally adjacent lax and tense vowels. It was concluded that listeners were able to utilize these rate- and context-independent dynamic spectrotemporal parameters to identify coarticulated vowels, even when sentential information about speaking rate was not available.
本研究考察了一位成年女性以引用形式(列表)说出的辅音 - 元音 - 辅音音节中的大量元音的感知和声学特征,以及这些元音在正常语速和快速语速的句子中的情况。听众能够正确将句子材料的语速归类为正常或快速(错误率为2%),但当从句子中切除音节时,他们无法准确对音节的语速进行分类(错误率为25%)。相比之下,当呈现句子以及呈现按语速分组或随机排列的切除音节时,听众能够准确识别以两种语速说出的句子中产生的元音。声学分析表明,句子材料中元音在音节中点处的共振峰频率相对于引用形式的值呈现“目标下冲”,但在语速变化时变化不大。音节时长随元音特性、语速和词尾辅音的浊音化而系统地变化。对于大多数元音,元音固有频谱变化在语速和语境中的变化方向是不变的。最大F1频率的时间位置进一步区分了频谱相邻的松弛元音和紧张元音。研究得出结论,即使没有关于语速的句子信息,听众也能够利用这些与语速和语境无关的动态频谱时间参数来识别协同发音的元音。