语音包络的冗余皮质编码

A Redundant Cortical Code for Speech Envelope.

机构信息

Center for Neural Science, New York University, New York, New York 10003

Center for Neural Science, New York University, New York, New York 10003.

出版信息

J Neurosci. 2023 Jan 4;43(1):93-112. doi: 10.1523/JNEUROSCI.1616-21.2022. Epub 2022 Nov 15.

DOI:10.1523/JNEUROSCI.1616-21.2022

PMID:36379706

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9838705/

Abstract

Animal communication sounds exhibit complex temporal structure because of the amplitude fluctuations that comprise the sound envelope. In human speech, envelope modulations drive synchronized activity in auditory cortex (AC), which correlates strongly with comprehension (Giraud and Poeppel, 2012; Peelle and Davis, 2012; Haegens and Zion Golumbic, 2018). Studies of envelope coding in single neurons, performed in nonhuman animals, have focused on periodic amplitude modulation (AM) stimuli and use response metrics that are not easy to juxtapose with data from humans. In this study, we sought to bridge these fields. Specifically, we looked directly at the temporal relationship between stimulus envelope and spiking, and we assessed whether the apparent diversity across neurons' AM responses contributes to the population representation of speech-like sound envelopes. We gathered responses from single neurons to vocoded speech stimuli and compared them to sinusoidal AM responses in auditory cortex (AC) of alert, freely moving Mongolian gerbils of both sexes. While AC neurons displayed heterogeneous tuning to AM rate, their temporal dynamics were stereotyped. Preferred response phases accumulated near the onsets of sinusoidal AM periods for slower rates (<8 Hz), and an over-representation of amplitude edges was apparent in population responses to both sinusoidal AM and vocoded speech envelopes. Crucially, this encoding bias imparted a decoding benefit: a classifier could discriminate vocoded speech stimuli using summed population activity, while higher frequency modulations required a more sophisticated decoder that tracked spiking responses from individual cells. Together, our results imply that the envelope structure relevant to parsing an acoustic stream could be read-out from a distributed, redundant population code. Animal communication sounds have rich temporal structure and are often produced in extended sequences, including the syllabic structure of human speech. Although the auditory cortex (AC) is known to play a crucial role in representing speech syllables, the contribution of individual neurons remains uncertain. Here, we characterized the representations of both simple, amplitude-modulated sounds and complex, speech-like stimuli within a broad population of cortical neurons, and we found an overrepresentation of amplitude edges. Thus, a phasic, redundant code in auditory cortex can provide a mechanistic explanation for segmenting acoustic streams like human speech.

摘要

动物的交流声音表现出复杂的时间结构，因为声音包络的幅度波动包含其中。在人类言语中，包络调制驱动听觉皮层（AC）中的同步活动，这与理解密切相关（Giraud 和 Poeppel，2012；Peelle 和 Davis，2012；Haegens 和 Zion Golumbic，2018）。在非人类动物中进行的关于单个神经元的包络编码研究，侧重于周期性幅度调制（AM）刺激，并使用不易与人类数据并列的响应度量。在这项研究中，我们试图弥合这些领域之间的差距。具体来说，我们直接观察刺激包络和尖峰之间的时间关系，并评估神经元的 AM 响应的明显多样性是否有助于言语样声音包络的群体表示。我们从单个神经元中收集对语音编码刺激的响应，并将其与警觉、自由移动的雄性和雌性蒙古沙鼠听觉皮层（AC）中的正弦 AM 响应进行比较。虽然 AC 神经元对 AM 率表现出异质调谐，但它们的时间动态是刻板的。对于较慢的速率（<8 Hz），首选响应相位在正弦 AM 周期的开始附近累积，并且在群体对正弦 AM 和语音编码包络的响应中都明显存在幅度边缘的过表示。至关重要的是，这种编码偏差赋予了解码优势：分类器可以使用群体活动的总和来区分语音编码的刺激，而更高频率的调制则需要一个更复杂的解码器，该解码器可以跟踪来自单个细胞的尖峰响应。总的来说，我们的结果表明，与解析声流相关的包络结构可以从分布式、冗余的群体代码中读出。动物的交流声音具有丰富的时间结构，通常在扩展的序列中产生，包括人类言语的音节结构。尽管已知听觉皮层（AC）在表示言语音节方面起着至关重要的作用，但单个神经元的贡献仍然不确定。在这里，我们在广泛的皮层神经元群体中描述了简单的、幅度调制的声音和复杂的、言语样的刺激的表示，并发现了幅度边缘的过表示。因此，听觉皮层中的相位、冗余代码可以为分割像人类言语这样的声流提供一种机制解释。

相似文献

A Redundant Cortical Code for Speech Envelope.

J Neurosci. 2023 Jan 4;43(1):93-112. doi: 10.1523/JNEUROSCI.1616-21.2022. Epub 2022 Nov 15.

Auditory responsive cortex in the squirrel monkey: neural responses to amplitude-modulated sounds.

Exp Brain Res. 1996 Mar;108(2):273-84. doi: 10.1007/BF00228100.

Temporally precise population coding of dynamic sounds by auditory cortex.

J Neurophysiol. 2021 Jul 1;126(1):148-169. doi: 10.1152/jn.00709.2020. Epub 2021 Jun 2.

Cortical Responses to the Amplitude Envelopes of Sounds Change with Age.

J Neurosci. 2021 Jun 9;41(23):5045-5055. doi: 10.1523/JNEUROSCI.2715-20.2021. Epub 2021 Apr 26.

Effects of aging on the response of single neurons to amplitude-modulated noise in primary auditory cortex of rhesus macaque.

J Neurophysiol. 2016 Jun 1;115(6):2911-23. doi: 10.1152/jn.01098.2015. Epub 2016 Mar 2.

Representation of temporal sound features in the human auditory cortex.

Rev Neurosci. 2011;22(2):187-203. doi: 10.1515/RNS.2011.016.

Magnified Neural Envelope Coding Predicts Deficits in Speech Perception in Noise.

J Neurosci. 2017 Aug 9;37(32):7727-7736. doi: 10.1523/JNEUROSCI.2722-16.2017. Epub 2017 Jul 10.

An Emergent Population Code in Primary Auditory Cortex Supports Selective Attention to Spectral and Temporal Sound Features.

J Neurosci. 2021 Sep 8;41(36):7561-7577. doi: 10.1523/JNEUROSCI.0693-20.2021. Epub 2021 Jul 1.

Processing of fast amplitude modulations in bat auditory cortex matches communication call-specific sound features.

J Neurophysiol. 2019 Apr 1;121(4):1501-1512. doi: 10.1152/jn.00748.2018. Epub 2019 Feb 20.

Human Frequency Following Responses to Vocoded Speech.

Ear Hear. 2017 Sep/Oct;38(5):e256-e267. doi: 10.1097/AUD.0000000000000432.

引用本文的文献

Hierarchical emergence of opponent coding in auditory belt cortex.

J Neurophysiol. 2025 Mar 1;133(3):944-964. doi: 10.1152/jn.00519.2024. Epub 2025 Feb 18.

Unsupervised discovery of family specific vocal usage in the Mongolian gerbil.

Elife. 2024 Dec 16;12:RP89892. doi: 10.7554/eLife.89892.

Unsupervised discovery of family specific vocal usage in the Mongolian gerbil.

bioRxiv. 2024 Sep 4:2023.03.11.532197. doi: 10.1101/2023.03.11.532197.

The human auditory system uses amplitude modulation to distinguish music from speech.

PLoS Biol. 2024 May 28;22(5):e3002631. doi: 10.1371/journal.pbio.3002631. eCollection 2024 May.

Sensory cortex plasticity supports auditory social learning.

Nat Commun. 2023 Sep 20;14(1):5828. doi: 10.1038/s41467-023-41641-8.

Parvalbumin neurons enhance temporal coding and reduce cortical noise in complex auditory scenes.

Commun Biol. 2023 Jul 19;6(1):751. doi: 10.1038/s42003-023-05126-0.

Metamodal Coupling of Vibrotactile and Auditory Speech Processing Systems through Matched Stimulus Representations.

J Neurosci. 2023 Jul 5;43(27):4984-4996. doi: 10.1523/JNEUROSCI.1710-22.2023. Epub 2023 May 17.

本文引用的文献

Distinct neuronal types contribute to hybrid temporal encoding strategies in primate auditory cortex.

PLoS Biol. 2022 May 25;20(5):e3001642. doi: 10.1371/journal.pbio.3001642. eCollection 2022 May.

Long-Range GABAergic Projections of Cortical Origin in Brain Function.

Front Syst Neurosci. 2022 Mar 22;16:841869. doi: 10.3389/fnsys.2022.841869. eCollection 2022.

Temporally precise population coding of dynamic sounds by auditory cortex.

J Neurophysiol. 2021 Jul 1;126(1):148-169. doi: 10.1152/jn.00709.2020. Epub 2021 Jun 2.

Differential Short-Term Plasticity of PV and SST Neurons Accounts for Adaptation and Facilitation of Cortical Neurons to Auditory Tones.

J Neurosci. 2020 Nov 25;40(48):9224-9235. doi: 10.1523/JNEUROSCI.0686-20.2020. Epub 2020 Oct 23.

Language prediction mechanisms in human auditory cortex.

Nat Commun. 2020 Oct 16;11(1):5240. doi: 10.1038/s41467-020-19010-6.

Diversity and function of corticopetal and corticofugal GABAergic projection neurons.

Nat Rev Neurosci. 2020 Sep;21(9):499-515. doi: 10.1038/s41583-020-0344-9. Epub 2020 Aug 3.

Deviance detection in physiologically identified cell types in the rat auditory cortex.

Hear Res. 2021 Jan;399:107997. doi: 10.1016/j.heares.2020.107997. Epub 2020 May 21.

A speech envelope landmark for syllable encoding in human superior temporal gyrus.

Sci Adv. 2019 Nov 20;5(11):eaay6279. doi: 10.1126/sciadv.aay6279. eCollection 2019 Nov.

Preserving Inhibition during Developmental Hearing Loss Rescues Auditory Learning and Perception.

J Neurosci. 2019 Oct 16;39(42):8347-8361. doi: 10.1523/JNEUROSCI.0749-19.2019. Epub 2019 Aug 26.

Response properties of single neurons in higher level auditory cortex of adult songbirds.

J Neurophysiol. 2019 Jan 1;121(1):218-237. doi: 10.1152/jn.00751.2018. Epub 2018 Nov 21.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

语音包络的冗余皮质编码

A Redundant Cortical Code for Speech Envelope.

机构信息

Center for Neural Science, New York University, New York, New York 10003

Center for Neural Science, New York University, New York, New York 10003.

出版信息

J Neurosci. 2023 Jan 4;43(1):93-112. doi: 10.1523/JNEUROSCI.1616-21.2022. Epub 2022 Nov 15.

DOI:10.1523/JNEUROSCI.1616-21.2022

PMID:36379706

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9838705/

Abstract

摘要

语音包络的冗余皮质编码

A Redundant Cortical Code for Speech Envelope.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

语音包络的冗余皮质编码

A Redundant Cortical Code for Speech Envelope.

机构信息

出版信息