Shuai Lan, Gong Tao
Department of Electrical and Computer Engineering, Johns Hopkins University Baltimore, MD, USA.
Department of Linguistics, University of Hong Kong Hong Kong, China.
Front Behav Neurosci. 2014 Mar 25;8:97. doi: 10.3389/fnbeh.2014.00097. eCollection 2014.
Speech perception entails both top-down processing that relies primarily on language experience and bottom-up processing that depends mainly on instant auditory input. Previous models of speech perception often claim that bottom-up processing occurs in an early time window, whereas top-down processing takes place in a late time window after stimulus onset. In this paper, we evaluated the temporal relation of both types of processing in lexical tone perception. We conducted a series of event-related potential (ERP) experiments that recruited Mandarin participants and adopted three experimental paradigms, namely dichotic listening, lexical decision with phonological priming, and semantic violation. By systematically analyzing the lateralization patterns of the early and late ERP components that are observed in these experiments, we discovered that: auditory processing of pitch variations in tones, as a bottom-up effect, elicited greater right hemisphere activation; in contrast, linguistic processing of lexical tones, as a top-down effect, elicited greater left hemisphere activation. We also found that both types of processing co-occurred in both the early (around 200 ms) and late (around 300-500 ms) time windows, which supported a parallel model of lexical tone perception. Unlike the previous view that language processing is special and performed by dedicated neural circuitry, our study have elucidated that language processing can be decomposed into general cognitive functions (e.g., sensory and memory) and share neural resources with these functions.
语音感知既需要主要依赖语言经验的自上而下的加工,也需要主要依赖即时听觉输入的自下而上的加工。以往的语音感知模型常常认为,自下而上的加工发生在早期时间窗口,而自上而下的加工在刺激开始后的晚期时间窗口进行。在本文中,我们评估了两种加工类型在声调感知中的时间关系。我们进行了一系列事件相关电位(ERP)实验,招募了说普通话的参与者,并采用了三种实验范式,即双耳分听、语音启动的词汇判断和语义违反。通过系统分析在这些实验中观察到的早期和晚期ERP成分的偏侧化模式,我们发现:声调音高变化的听觉加工,作为一种自下而上的效应,引起更大的右半球激活;相反,声调的语言加工,作为一种自上而下的效应,引起更大的左半球激活。我们还发现,两种加工类型在早期(约200毫秒)和晚期(约300 - 500毫秒)时间窗口均同时发生,这支持了声调感知的并行模型。与以往认为语言加工是特殊的且由专门的神经回路执行的观点不同,我们的研究阐明了语言加工可以分解为一般认知功能(如感觉和记忆),并与这些功能共享神经资源。