Moore Brian C J
Department of Experimental Psychology, University of Cambridge, Downing Street, Cambridge CB2 3EB, UK.
Philos Trans R Soc Lond B Biol Sci. 2008 Mar 12;363(1493):947-63. doi: 10.1098/rstb.2007.2152.
This paper reviews the basic aspects of auditory processing that play a role in the perception of speech. The frequency selectivity of the auditory system, as measured using masking experiments, is described and used to derive the internal representation of the spectrum (the excitation pattern) of speech sounds. The perception of timbre and distinctions in quality between vowels are related to both static and dynamic aspects of the spectra of sounds. The perception of pitch and its role in speech perception are described. Measures of the temporal resolution of the auditory system are described and a model of temporal resolution based on a sliding temporal integrator is outlined. The combined effects of frequency and temporal resolution can be modelled by calculation of the spectro-temporal excitation pattern, which gives good insight into the internal representation of speech sounds. For speech presented in quiet, the resolution of the auditory system in frequency and time usually markedly exceeds the resolution necessary for the identification or discrimination of speech sounds, which partly accounts for the robust nature of speech perception. However, for people with impaired hearing, speech perception is often much less robust.
本文回顾了在语音感知中起作用的听觉处理的基本方面。描述了使用掩蔽实验测量的听觉系统的频率选择性,并用于推导语音频谱的内部表征(激发模式)。音色感知以及元音之间音质的差异与声音频谱的静态和动态方面都有关系。描述了音高感知及其在语音感知中的作用。描述了听觉系统时间分辨率的测量方法,并概述了基于滑动时间积分器的时间分辨率模型。频率和时间分辨率的综合效应可以通过计算频谱-时间激发模式来建模,这有助于深入了解语音的内部表征。对于安静环境中呈现的语音,听觉系统在频率和时间上的分辨率通常明显超过识别或区分语音所需的分辨率,这部分解释了语音感知的稳健性。然而,对于听力受损的人来说,语音感知往往不那么稳健。