• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

连续语音的听觉到语言表示的快速转换。

Rapid Transformation from Auditory to Linguistic Representations of Continuous Speech.

机构信息

Institute for Systems Research, University of Maryland, College Park, MD 20742, USA.

Department of Psychiatry, Maryland Psychiatric Research Center, University of Maryland School of Medicine, Baltimore, MD 21201, USA.

出版信息

Curr Biol. 2018 Dec 17;28(24):3976-3983.e5. doi: 10.1016/j.cub.2018.10.042. Epub 2018 Nov 29.

DOI:10.1016/j.cub.2018.10.042
PMID:30503620
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6339854/
Abstract

During speech perception, a central task of the auditory cortex is to analyze complex acoustic patterns to allow detection of the words that encode a linguistic message [1]. It is generally thought that this process includes at least one intermediate, phonetic, level of representations [2-6], localized bilaterally in the superior temporal lobe [7-9]. Phonetic representations reflect a transition from acoustic to linguistic information, classifying acoustic patterns into linguistically meaningful units, which can serve as input to mechanisms that access abstract word representations [10, 11]. While recent research has identified neural signals arising from successful recognition of individual words in continuous speech [12-15], no explicit neurophysiological signal has been found demonstrating the transition from acoustic and/or phonetic to symbolic, lexical representations. Here, we report a response reflecting the incremental integration of phonetic information for word identification, dominantly localized to the left temporal lobe. The short response latency, approximately 114 ms relative to phoneme onset, suggests that phonetic information is used for lexical processing as soon as it becomes available. Responses also tracked word boundaries, confirming previous reports of immediate lexical segmentation [16, 17]. These new results were further investigated using a cocktail-party paradigm [18, 19] in which participants listened to a mix of two talkers, attending to one and ignoring the other. Analysis indicates neural lexical processing of only the attended, but not the unattended, speech stream. Thus, while responses to acoustic features reflect attention through selective amplification of attended speech, responses consistent with a lexical processing model reveal categorically selective processing.

摘要

在言语感知过程中,听觉皮层的一项主要任务是分析复杂的声学模式,以检测编码语言信息的单词[1]。人们普遍认为,这个过程至少包括一个中间的、语音的表示层次[2-6],定位于颞叶的双侧[7-9]。语音表示反映了从声学到语言信息的转变,将声学模式分类为具有语言意义的单元,这些单元可以作为访问抽象单词表示的机制的输入[10,11]。虽然最近的研究已经确定了在连续语音中识别单个单词时产生的神经信号[12-15],但尚未发现明确的神经生理信号表明从声学和/或语音到符号、词汇表示的转变。在这里,我们报告了一个反映用于单词识别的语音信息逐步整合的反应,主要定位于左颞叶。大约 114 毫秒相对于音素起始的短反应潜伏期表明,一旦语音信息可用,它就被用于词汇处理。反应也跟踪单词边界,证实了之前关于立即词汇分割的报告[16,17]。使用鸡尾酒会范式[18,19]进一步研究了这些新结果,参与者在其中听两个说话者的混合音,只关注一个说话者而忽略另一个说话者。分析表明,只有被关注的,而不是未被关注的,语音流进行神经词汇处理。因此,虽然对声学特征的反应通过选择性放大被关注的语音来反映注意力,但与词汇处理模型一致的反应则揭示了分类选择性处理。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/45a3/6339854/4c90f41688a6/nihms-1510583-f0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/45a3/6339854/d2571b8e5273/nihms-1510583-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/45a3/6339854/fe99cca2f4b5/nihms-1510583-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/45a3/6339854/6fdd095ffd09/nihms-1510583-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/45a3/6339854/4c90f41688a6/nihms-1510583-f0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/45a3/6339854/d2571b8e5273/nihms-1510583-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/45a3/6339854/fe99cca2f4b5/nihms-1510583-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/45a3/6339854/6fdd095ffd09/nihms-1510583-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/45a3/6339854/4c90f41688a6/nihms-1510583-f0004.jpg

相似文献

1
Rapid Transformation from Auditory to Linguistic Representations of Continuous Speech.连续语音的听觉到语言表示的快速转换。
Curr Biol. 2018 Dec 17;28(24):3976-3983.e5. doi: 10.1016/j.cub.2018.10.042. Epub 2018 Nov 29.
2
Sublexical properties of spoken words modulate activity in Broca's area but not superior temporal cortex: implications for models of speech recognition.语音词的亚词汇性质调节布罗卡区的活动,但不调节颞上皮质的活动:对语音识别模型的启示。
J Cogn Neurosci. 2011 Oct;23(10):2665-74. doi: 10.1162/jocn.2011.21620. Epub 2011 Jan 24.
3
Left Superior Temporal Gyrus Is Coupled to Attended Speech in a Cocktail-Party Auditory Scene.左颞上回在鸡尾酒会听觉场景中与被关注的语音相关联。
J Neurosci. 2016 Feb 3;36(5):1596-606. doi: 10.1523/JNEUROSCI.1730-15.2016.
4
The influence of lexical statistics on temporal lobe cortical dynamics during spoken word listening.词汇统计对口语单词聆听过程中颞叶皮层动力学的影响。
Brain Lang. 2015 Aug;147:66-75. doi: 10.1016/j.bandl.2015.05.005. Epub 2015 Jun 11.
5
In Spoken Word Recognition, the Future Predicts the Past.在口语识别中,未来预示着过去。
J Neurosci. 2018 Aug 29;38(35):7585-7599. doi: 10.1523/JNEUROSCI.0065-18.2018. Epub 2018 Jul 16.
6
Attention Differentially Affects Acoustic and Phonetic Feature Encoding in a Multispeaker Environment.注意在多说话人环境中对声学和语音特征编码的影响不同。
J Neurosci. 2022 Jan 26;42(4):682-691. doi: 10.1523/JNEUROSCI.1455-20.2021. Epub 2021 Dec 10.
7
Dynamic Encoding of Acoustic Features in Neural Responses to Continuous Speech.对连续语音的神经反应中声学特征的动态编码
J Neurosci. 2017 Feb 22;37(8):2176-2185. doi: 10.1523/JNEUROSCI.2383-16.2017. Epub 2017 Jan 24.
8
Neural Tuning to Low-Level Features of Speech throughout the Perisylvian Cortex.整个外侧裂周皮层对语音低层次特征的神经调谐。
J Neurosci. 2017 Aug 16;37(33):7906-7920. doi: 10.1523/JNEUROSCI.0238-17.2017. Epub 2017 Jul 17.
9
Latent neural dynamics encode temporal context in speech.潜伏的神经动力学编码了语音中的时间上下文。
Hear Res. 2023 Sep 15;437:108838. doi: 10.1016/j.heares.2023.108838. Epub 2023 Jul 4.
10
Functionally integrated neural processing of linguistic and talker information: An event-related fMRI and ERP study.语言和说话者信息的功能整合神经处理:一项事件相关功能磁共振成像和事件相关电位研究。
Neuroimage. 2016 Jan 1;124(Pt A):536-549. doi: 10.1016/j.neuroimage.2015.08.064. Epub 2015 Sep 4.

引用本文的文献

1
Temporal integration in human auditory cortex is predominantly yoked to absolute time.人类听觉皮层中的时间整合主要与绝对时间相关联。
Nat Neurosci. 2025 Sep 18. doi: 10.1038/s41593-025-02060-8.
2
Are you talking to me? How the choice of speech register impacts listeners' hierarchical encoding of speech.你在跟我说话吗?言语语域的选择如何影响听众对言语的层次编码。
Imaging Neurosci (Camb). 2025 Apr 17;3. doi: 10.1162/imag_a_00539. eCollection 2025.
3
Application of machine learning and temporal response function modeling of EEG data for differential diagnosis in primary progressive aphasia.

本文引用的文献

1
A Spatial Map of Onset and Sustained Responses to Speech in the Human Superior Temporal Gyrus.人类上颞 gyrus 中言语起始和持续反应的空间图谱
Curr Biol. 2018 Jun 18;28(12):1860-1871.e4. doi: 10.1016/j.cub.2018.04.033. Epub 2018 May 31.
2
Electrophysiological Correlates of Semantic Dissimilarity Reflect the Comprehension of Natural, Narrative Speech.语义相似度的电生理相关性反映了对自然、叙事性言语的理解。
Curr Biol. 2018 Mar 5;28(5):803-809.e3. doi: 10.1016/j.cub.2018.01.080. Epub 2018 Feb 22.
3
Neural source dynamics of brain responses to continuous stimuli: Speech processing from acoustics to comprehension.
机器学习及脑电图数据的时间响应函数建模在原发性进行性失语鉴别诊断中的应用
Sci Rep. 2025 Aug 12;15(1):29539. doi: 10.1038/s41598-025-13000-8.
4
Reduced Neural Distinctiveness of Speech Representations in the Middle-Aged Brain.中年大脑中语音表征的神经特异性降低。
Neurobiol Lang (Camb). 2025 Jun 18;6. doi: 10.1162/nol_a_00169. eCollection 2025.
5
Recurrent neural networks as neuro-computational models of human speech recognition.作为人类语音识别神经计算模型的循环神经网络。
PLoS Comput Biol. 2025 Jul 28;21(7):e1013244. doi: 10.1371/journal.pcbi.1013244. eCollection 2025 Jul.
6
Neural encoding of linguistic features during natural sentence reading.自然句子阅读过程中语言特征的神经编码
iScience. 2025 May 30;28(7):112798. doi: 10.1016/j.isci.2025.112798. eCollection 2025 Jul 18.
7
Optimized feature gains explain and predict successes and failures of human selective listening.优化后的特征增益能够解释并预测人类选择性听力的成败。
bioRxiv. 2025 May 28:2025.05.28.656682. doi: 10.1101/2025.05.28.656682.
8
Neural Speech Tracking during Selective Attention: A Spatially Realistic Audiovisual Study.选择性注意期间的神经语音追踪:一项空间逼真的视听研究。
eNeuro. 2025 Jun 24;12(6). doi: 10.1523/ENEURO.0132-24.2025. Print 2025 Jun.
9
Le Petit Prince (LPP) multi-talker: Naturalistic 7 T fMRI and EEG dataset.《小王子》多说话者:自然主义7T功能磁共振成像和脑电图数据集
Sci Data. 2025 May 20;12(1):829. doi: 10.1038/s41597-025-05158-7.
10
Dynamic modeling of EEG responses to natural speech reveals earlier processing of predictable words.脑电图对自然语音反应的动态建模揭示了可预测单词的早期处理过程。
PLoS Comput Biol. 2025 Apr 28;21(4):e1013006. doi: 10.1371/journal.pcbi.1013006. eCollection 2025 Apr.
连续刺激下大脑反应的神经源动力学:从声学处理到理解的言语加工。
Neuroimage. 2018 May 15;172:162-174. doi: 10.1016/j.neuroimage.2018.01.042. Epub 2018 Feb 3.
4
Attention Is Required for Knowledge-Based Sequential Grouping: Insights from the Integration of Syllables into Words.注意:基于知识的序列分组需要注意:从音节到单词的整合中得到的启示。
J Neurosci. 2018 Jan 31;38(5):1178-1188. doi: 10.1523/JNEUROSCI.2606-17.2017. Epub 2017 Dec 18.
5
Phonemes: Lexical access and beyond.音位:词汇通达与超越
Psychon Bull Rev. 2018 Apr;25(2):560-585. doi: 10.3758/s13423-017-1362-0.
6
Cortical Representations of Speech in a Multitalker Auditory Scene.多说话者听觉场景中语音的皮质表征
J Neurosci. 2017 Sep 20;37(38):9189-9196. doi: 10.1523/JNEUROSCI.0938-17.2017. Epub 2017 Aug 18.
7
Sensitivity to change in perception of speech.对言语感知变化的敏感性。
Speech Commun. 2003 Aug;41(1):59-69. doi: 10.1016/S0167-6393(02)00093-6.
8
Cortical tracking of hierarchical linguistic structures in connected speech.连贯言语中层次语言结构的皮层追踪。
Nat Neurosci. 2016 Jan;19(1):158-64. doi: 10.1038/nn.4186. Epub 2015 Dec 7.
9
Low-Frequency Cortical Entrainment to Speech Reflects Phoneme-Level Processing.对语音的低频皮层夹带反映音素水平的加工。
Curr Biol. 2015 Oct 5;25(19):2457-65. doi: 10.1016/j.cub.2015.08.030. Epub 2015 Sep 24.
10
Non-linear processing of a linear speech stream: The influence of morphological structure on the recognition of spoken Arabic words.线性语音流的非线性处理:形态结构对阿拉伯语口语单词识别的影响。
Brain Lang. 2015 Aug;147:1-13. doi: 10.1016/j.bandl.2015.04.006. Epub 2015 May 18.