Department of Experimental Psychology, University of Oxford, Oxford, UK; Department of Psychology, Acadia University, Wolfville, Nova Scotia, Canada.
Department of Experimental Psychology, University of Oxford, Oxford, UK.
Curr Biol. 2018 Oct 8;28(19):3106-3113.e2. doi: 10.1016/j.cub.2018.07.030. Epub 2018 Sep 20.
Sensorimotor learning has been studied by altering the sound of the voice in real time as speech is produced. In response to voice alterations, learned changes in production reduce the perceived auditory error and persist for some time after the alteration is removed [1-5]. The results of such experiments have led to the development of prominent models of speech production. This work proposes that the control of speech relies on forward models to predict sensory outcomes of movements, and errors in these predictions drive sensorimotor learning [5-7]. However, sensorimotor learning in speech has only been observed following intensive training on a handful of discrete words or perceptually similar sentences. Stereotyped production does not capture the complex sensorimotor demands of fluid, real-world speech [8-11]. It remains unknown whether talkers predict the sensory consequences of variable sentence production to allow rapid and precise updating of speech motor plans when sensory prediction errors are encountered. Here, we used real-time alterations of speech feedback to test for sensorimotor learning during the production of 50 sentences that varied markedly in length, vocabulary, and grammar. Following baseline production, all vowels were simultaneously altered and played back through headphones in near real time. Robust feedforward changes in sentence production were observed that, on average, precisely countered the direction of the alteration. These changes occurred in every participant and transferred to the production of single words with varying vowel sounds. The results show that to maintain accurate sentence production, the brain actively predicts the auditory consequences of variable sentence-level speech.
感觉运动学习是通过实时改变语音的声音来研究的,而语音是在产生的。作为对声音变化的反应,生产中的习得变化减少了感知到的听觉错误,并在变化消除后持续一段时间[1-5]。这样的实验结果导致了突出的言语产生模型的发展。这项工作提出,言语的控制依赖于前馈模型来预测运动的感觉结果,这些预测中的错误驱动感觉运动学习[5-7]。然而,在对少数离散单词或感知上相似的句子进行密集训练后,才观察到言语感觉运动学习[8-11]。刻板的生产并不能捕捉到流畅的现实世界言语的复杂感觉运动需求。目前尚不清楚说话者是否预测了可变句子产生的感觉后果,以便在遇到感觉预测错误时快速而精确地更新言语运动计划。在这里,我们使用语音反馈的实时改变来测试在生产 50 个句子时的感觉运动学习,这些句子在长度、词汇和语法上有很大的差异。在基线生产之后,所有的元音都同时被改变,并通过耳机近乎实时地播放。在句子生产中观察到了强大的前馈变化,这些变化平均精确地抵消了改变的方向。这些变化发生在每个参与者身上,并转移到了具有不同元音的单个单词的生产中。结果表明,为了保持准确的句子生产,大脑积极预测可变句子层面言语的听觉后果。