Lisker L
J Acoust Soc Am. 1985 Mar;77(3):1199-202. doi: 10.1121/1.392185.
The search for the acoustic properties useful to the listener in extracting the linguistic message from a speech signal is often construed as the task of matching invariant physical properties to invariant phonological percepts; the discovery of the former will explain the latter. These phonological percepts are essentially the phonemes of pregenerative phonology, and they are more or less faithfully reflected in standard alphabetic writing. Thus English deep and doom are supposed to be perceptually identical in their initial /d/s; the orthographic similarity is in agreement with the linguist's "representation" of these forms. The partial identity in spelling is only weak evidence for perceptual invariance, however. First, while some phonemes may comprise a single "sound," others are said by linguists to include phonetically distinct ones. Thus English /p/includes both aspirated and unaspirated voiceless labial stops. The view that it is not the phoneme, but rather the phonetic feature, to which an acoustic invariant might be attributed, raises two questions: (a) Since segments sharing a feature are rarely judged to constitute a single sound, the search for a feature-specific invariant, whose function is to explain perceptual constancy, is deprived of its essential motivation, and (2) there is no more reason to expect the acoustic cues to a feature to be context-independent than is the case with the phoneme. What seems more likely is to find that some phonemes, and some features, are more invariantly marked in the speech signal than others.
在从语音信号中提取语言信息时,寻找对听者有用的声学特性,通常被视为将不变的物理特性与不变的音位感知进行匹配的任务;发现前者将解释后者。这些音位感知本质上就是生成音系学前的音素,它们或多或少忠实地反映在标准字母书写中。因此,在英语中,deep和doom开头的/d/在感知上被认为是相同的;拼写上的相似性与语言学家对这些形式的“表征”一致。然而,拼写中的部分相同只是感知不变性的微弱证据。首先,虽然有些音素可能只包含单一的“音”,但语言学家认为其他音素包括语音上不同的音。因此,英语中的/p/既包括送气清双唇塞音,也包括不送气清双唇塞音。认为声学不变性可能归因于语音特征而非音素的观点,引发了两个问题:(a) 由于共享一个特征的音段很少被判断为构成一个单一的音,那么寻找一个特定特征的不变性(其作用是解释感知恒常性)就失去了其基本动机;(2) 与音素的情况相比,没有更多理由期望一个特征的声学线索是上下文无关的。更有可能的是,会发现一些音素和一些特征在语音信号中的标记比其他的更具不变性。