人类听觉皮层中的言语表征：它具有特殊性吗？

Representation of speech in human auditory cortex: is it special?

机构信息

Department of Neurology, Rose F. Kennedy Center, Albert Einstein College of Medicine, Room 322, 1300 Morris Park Avenue, Bronx, NY 10461, USA; Department of Neuroscience, Rose F. Kennedy Center, Albert Einstein College of Medicine, Room 322, 1300 Morris Park Avenue, Bronx, NY 10461, USA.

出版信息

Hear Res. 2013 Nov;305:57-73. doi: 10.1016/j.heares.2013.05.013. Epub 2013 Jun 18.

DOI:10.1016/j.heares.2013.05.013

PMID:23792076

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3818517/

Abstract

Successful categorization of phonemes in speech requires that the brain analyze the acoustic signal along both spectral and temporal dimensions. Neural encoding of the stimulus amplitude envelope is critical for parsing the speech stream into syllabic units. Encoding of voice onset time (VOT) and place of articulation (POA), cues necessary for determining phonemic identity, occurs within shorter time frames. An unresolved question is whether the neural representation of speech is based on processing mechanisms that are unique to humans and shaped by learning and experience, or is based on rules governing general auditory processing that are also present in non-human animals. This question was examined by comparing the neural activity elicited by speech and other complex vocalizations in primary auditory cortex of macaques, who are limited vocal learners, with that in Heschl's gyrus, the putative location of primary auditory cortex in humans. Entrainment to the amplitude envelope is neither specific to humans nor to human speech. VOT is represented by responses time-locked to consonant release and voicing onset in both humans and monkeys. Temporal representation of VOT is observed both for isolated syllables and for syllables embedded in the more naturalistic context of running speech. The fundamental frequency of male speakers is represented by more rapid neural activity phase-locked to the glottal pulsation rate in both humans and monkeys. In both species, the differential representation of stop consonants varying in their POA can be predicted by the relationship between the frequency selectivity of neurons and the onset spectra of the speech sounds. These findings indicate that the neurophysiology of primary auditory cortex is similar in monkeys and humans despite their vastly different experience with human speech, and that Heschl's gyrus is engaged in general auditory, and not language-specific, processing. This article is part of a Special Issue entitled "Communication Sounds and the Brain: New Directions and Perspectives".

摘要

成功地对语音中的音素进行分类，要求大脑沿着频谱和时间维度对声学信号进行分析。刺激幅度包络的神经编码对于将语音流解析为音节单位至关重要。语音起始时间 (VOT) 和发音部位 (POA) 的编码，是确定音素身份所必需的线索，发生在更短的时间框架内。一个悬而未决的问题是，语音的神经表示是基于人类特有的处理机制，还是基于也存在于非人类动物中的一般听觉处理规则。这个问题通过比较恒河猴初级听觉皮层中语音和其他复杂发声引起的神经活动与人类初级听觉皮层（假定位于 Heschl 回）的神经活动来检验。幅度包络的同步既不是人类特有的，也不是人类语音特有的。VOT 是通过响应时间锁定在辅音释放和发声开始来表示的，无论是在人类还是猴子中。VOT 的时间表示既可以观察到孤立的音节，也可以观察到更自然的连续语音中的音节。男性说话者的基频通过与声门脉冲率快速锁相的更快的神经活动来表示，无论是在人类还是猴子中。在这两个物种中，根据神经元的频率选择性和语音的起始谱之间的关系，可以预测在 POA 上变化的闭塞辅音的差异表示。这些发现表明，尽管猴子和人类在人类语音方面的经验有很大的不同，但初级听觉皮层的神经生理学在猴子和人类中是相似的，并且 Heschl 回参与的是一般听觉处理，而不是特定于语言的处理。本文是一个特刊的一部分，题为“交流声音与大脑：新方向和视角”。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6275/3818517/2957afb691ba/nihms495990f1.jpg

相似文献

Representation of speech in human auditory cortex: is it special?人类听觉皮层中的言语表征：它具有特殊性吗？

Hear Res. 2013 Nov;305:57-73. doi: 10.1016/j.heares.2013.05.013. Epub 2013 Jun 18.

Temporal encoding of the voice onset time phonetic parameter by field potentials recorded directly from human auditory cortex.通过直接从人类听觉皮层记录的场电位对语音起始时间语音参数进行时间编码。

J Neurophysiol. 1999 Nov;82(5):2346-57. doi: 10.1152/jn.1999.82.5.2346.

Representation of the voice onset time (VOT) speech parameter in population responses within primary auditory cortex of the awake monkey.清醒猴子初级听觉皮层群体反应中语音起始时间（VOT）语音参数的表征。

J Acoust Soc Am. 2003 Jul;114(1):307-21. doi: 10.1121/1.1582449.

Stimulus-dependent activations and attention-related modulations in the auditory cortex: a meta-analysis of fMRI studies.听觉皮层中与刺激相关的激活和与注意相关的调制：功能磁共振成像研究的荟萃分析。

Hear Res. 2014 Jan;307:29-41. doi: 10.1016/j.heares.2013.08.001. Epub 2013 Aug 11.

Specialization of left auditory cortex for speech perception in man depends on temporal coding.人类左听觉皮层对语音感知的特化取决于时间编码。

Cereb Cortex. 1999 Jul-Aug;9(5):484-96. doi: 10.1093/cercor/9.5.484.

How do auditory cortex neurons represent communication sounds?听觉皮层神经元如何表征交流声音？

Hear Res. 2013 Nov;305:102-12. doi: 10.1016/j.heares.2013.03.011. Epub 2013 Apr 17.

Consonance and dissonance of musical chords: neural correlates in auditory cortex of monkeys and humans.音乐和弦的协和与不协和：猴子和人类听觉皮层中的神经关联

J Neurophysiol. 2001 Dec;86(6):2761-88. doi: 10.1152/jn.2001.86.6.2761.

Intracortical responses in human and monkey primary auditory cortex support a temporal processing mechanism for encoding of the voice onset time phonetic parameter.人类和猴子初级听觉皮层的皮质内反应支持一种用于编码语音起始时间语音参数的时间处理机制。

Cereb Cortex. 2005 Feb;15(2):170-86. doi: 10.1093/cercor/bhh120. Epub 2004 Jul 6.

Contribution of spectrotemporal features on auditory event-related potentials elicited by consonant-vowel syllables.协同元音诱发听觉事件相关电位的频谱时域特征贡献。

Ear Hear. 2009 Dec;30(6):704-12. doi: 10.1097/AUD.0b013e3181b1d42d.

Speech-evoked activity in primary auditory cortex: effects of voice onset time.初级听觉皮层中言语诱发的活动：语音起始时间的影响。

Electroencephalogr Clin Neurophysiol. 1994 Jan;92(1):30-43. doi: 10.1016/0168-5597(94)90005-1.

引用本文的文献

The auditory P2 is influenced by pitch changes but not pitch strength and consists of two separate subcomponents.听觉P2受音高变化影响，但不受音强影响，且由两个独立的子成分组成。

Imaging Neurosci (Camb). 2024 May 9;2. doi: 10.1162/imag_a_00160. eCollection 2024.

Voice-Evoked Color Prediction Using Deep Neural Networks in Sound-Color Synesthesia.在声色联觉中使用深度神经网络进行语音诱发的颜色预测。

Brain Sci. 2025 May 19;15(5):520. doi: 10.3390/brainsci15050520.

Neural Dynamics of the Processing of Speech Features: Evidence for a Progression of Features from Acoustic to Sentential Processing.语音特征处理的神经动力学：从声学处理到句子处理的特征递进证据

J Neurosci. 2025 Mar 12;45(11):e1143242025. doi: 10.1523/JNEUROSCI.1143-24.2025.

Cognitive and cortical network alterations in pediatric temporal lobe space-occupying lesions: an fMRI study.小儿颞叶占位性病变的认知和皮质网络改变：一项功能磁共振成像研究

Front Hum Neurosci. 2024 Dec 9;18:1509899. doi: 10.3389/fnhum.2024.1509899. eCollection 2024.

Direct neural coding of speech: Reconsideration of Whalen et al. (2006) (L).直接语音神经编码：重新思考 Whalen 等人（2006 年）的观点（L）。

J Acoust Soc Am. 2024 Mar 1;155(3):1704-1706. doi: 10.1121/10.0025125.

bioRxiv. 2024 Dec 10:2024.02.02.578603. doi: 10.1101/2024.02.02.578603.

Intracranial electrophysiology of spectrally degraded speech in the human cortex.人类大脑皮层中频谱退化语音的颅内电生理学

Front Hum Neurosci. 2024 Jan 22;17:1334742. doi: 10.3389/fnhum.2023.1334742. eCollection 2023.

Neural Fluctuation Contrast as a Code for Complex Sounds: The Role and Control of Peripheral Nonlinearities.神经波动对比作为复杂声音的代码：外围非线性的作用和控制。

Hear Res. 2024 Mar 1;443:108966. doi: 10.1016/j.heares.2024.108966. Epub 2024 Feb 1.

Large-scale single-neuron speech sound encoding across the depth of human cortex.人类大脑皮层深度范围内的大规模单神经元语音编码

Nature. 2024 Feb;626(7999):593-602. doi: 10.1038/s41586-023-06839-2. Epub 2023 Dec 13.

Immediate neural impact and incomplete compensation after semantic hub disconnection.语义中枢连接中断后的即时神经影响和不完全补偿。

Nat Commun. 2023 Oct 7;14(1):6264. doi: 10.1038/s41467-023-42088-7.

本文引用的文献

Processing of communication calls in Guinea pig auditory cortex.豚鼠听觉皮层中通讯呼叫的处理。

PLoS One. 2012;7(12):e51646. doi: 10.1371/journal.pone.0051646. Epub 2012 Dec 12.

Coding of repetitive transients by auditory cortex on posterolateral superior temporal gyrus in humans: an intracranial electrophysiology study.人类后外侧上颞 gyrus 听觉皮层对重复瞬态的编码：一项颅内电生理学研究。

J Neurophysiol. 2013 Mar;109(5):1283-95. doi: 10.1152/jn.00718.2012. Epub 2012 Dec 12.

Dual-pitch processing mechanisms in primate auditory cortex.灵长类听觉皮层的双重音加工机制。

J Neurosci. 2012 Nov 14;32(46):16149-61. doi: 10.1523/JNEUROSCI.2563-12.2012.

Searching for the mismatch negativity in primary auditory cortex of the awake monkey: deviance detection or stimulus specific adaptation?在清醒猴子的初级听觉皮层中搜索失匹配负波：偏差检测还是刺激特异性适应？

J Neurosci. 2012 Nov 7;32(45):15747-58. doi: 10.1523/JNEUROSCI.2835-12.2012.

Towards a new neurobiology of language.迈向语言新神经生物学。

J Neurosci. 2012 Oct 10;32(41):14125-31. doi: 10.1523/JNEUROSCI.3244-12.2012.

Birds, primates, and spoken language origins: behavioral phenotypes and neurobiological substrates.鸟类、灵长类与语言起源：行为表型与神经生物学基础

Front Evol Neurosci. 2012 Aug 16;4:12. doi: 10.3389/fnevo.2012.00012. eCollection 2012.

Emergence of neural encoding of auditory objects while listening to competing speakers.在聆听竞争说话者时听觉对象的神经编码的出现。

Proc Natl Acad Sci U S A. 2012 Jul 17;109(29):11854-9. doi: 10.1073/pnas.1205381109. Epub 2012 Jul 2.

Early experience shapes vocal neural coding and perception in songbirds.早期经验塑造鸣禽的声音神经编码和感知。

Dev Psychobiol. 2012 Sep;54(6):612-31. doi: 10.1002/dev.21014. Epub 2012 Jun 18.

Phase-locked responses to speech in human auditory cortex are enhanced during comprehension.人类听觉皮层对言语的锁相反应在理解过程中增强。

Cereb Cortex. 2013 Jun;23(6):1378-87. doi: 10.1093/cercor/bhs118. Epub 2012 May 17.

Selective cortical representation of attended speaker in multi-talker speech perception.选择性皮层对多说话人语音感知中被注意说话人的代表。

Nature. 2012 May 10;485(7397):233-6. doi: 10.1038/nature11020.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验