Zäske Romi, Volberg Gregor, Kovács Gyula, Schweinberger Stefan Robert
Department for General Psychology and Cognitive Neuroscience, Institute of Psychology,
Department for Psychology, Institute of Psychology, and.
J Neurosci. 2014 Aug 13;34(33):10821-31. doi: 10.1523/JNEUROSCI.0581-14.2014.
Listeners can recognize familiar human voices from variable utterances, suggesting the acquisition of speech-invariant voice representations during familiarization. However, the neurocognitive mechanisms mediating learning and recognition of voices from natural speech are currently unknown. Using electrophysiology, we investigated how representations are formed during intentional learning of initially unfamiliar voices that were later recognized among novel voices. To probe the acquisition of speech-invariant voice representations, we compared a "same sentence" condition, in which speakers repeated the study utterances at test, and a "different sentence" condition. Although recognition performance was higher for same compared with different sentences, substantial voice learning also occurred for different sentences, with recognition performance increasing across consecutive study-test-cycles. During study, event-related potentials elicited by voices subsequently remembered elicited a larger sustained parietal positivity (∼250-1400 ms) compared with subsequently forgotten voices. This difference due to memory was unaffected by test sentence condition and may thus reflect the acquisition of speech-invariant voice representations. At test, voices correctly classified as "old" elicited a larger late positive component (300-700 ms) at Pz than voices correctly classified as "new." This event-related potential OLD/NEW effect was limited to the same sentence condition and may thus reflect speech-dependent retrieval of voices from episodic memory. Importantly, a speech-independent effect for learned compared with novel voices was found in beta band oscillations (16-17 Hz) between 290 and 370 ms at central and right temporal sites. Our results are a first step toward elucidating the electrophysiological correlates of voice learning and recognition.
听众能够从各种不同的话语中识别出熟悉的人声,这表明在熟悉过程中获得了语音不变的声音表征。然而,介导从自然语音中学习和识别声音的神经认知机制目前尚不清楚。我们利用电生理学方法,研究了在有意学习最初不熟悉的声音(这些声音后来在新声音中被识别出来)的过程中,表征是如何形成的。为了探究语音不变声音表征的习得情况,我们比较了“相同句子”条件(即说话者在测试时重复学习时的话语)和“不同句子”条件。尽管相同句子的识别表现高于不同句子,但不同句子也出现了显著的声音学习,识别表现在连续的学习-测试循环中不断提高。在学习过程中,与随后被记住的声音相比,随后被遗忘的声音诱发的事件相关电位在顶叶引发了更大的持续正电位(约250 - 1400毫秒)。这种由记忆引起的差异不受测试句子条件的影响,因此可能反映了语音不变声音表征的习得。在测试时,被正确分类为“旧”的声音在Pz处诱发的晚期正成分(300 - 700毫秒)比被正确分类为“新”的声音更大。这种事件相关电位的旧/新效应仅限于相同句子条件,因此可能反映了从情景记忆中依赖语音的声音检索。重要的是,在中央和右侧颞叶部位,在290至370毫秒之间的β波段振荡(16 - 17赫兹)中发现了与新声音相比,学习声音的语音独立效应。我们的结果是阐明声音学习和识别的电生理相关性的第一步。