• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于大型语言模型估计的词汇可预测性与噪声中语音声学的神经追踪相关联。

Neural Tracking of Speech Acoustics in Noise Is Coupled with Lexical Predictability as Estimated by Large Language Models.

机构信息

Department of Speech, Hearing and Phonetic Sciences, University College London, London WC1N 1PF, United Kingdom

School of Digital Humanities and Computational Social Sciences, Korea Advanced Institute of Science and Technology, Daejeon 34141, Republic of Korea.

出版信息

eNeuro. 2024 Aug 20;11(8). doi: 10.1523/ENEURO.0507-23.2024. Print 2024 Aug.

DOI:10.1523/ENEURO.0507-23.2024
PMID:39095091
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11335968/
Abstract

Adults heard recordings of two spatially separated speakers reading newspaper and magazine articles. They were asked to listen to one of them and ignore the other, and EEG was recorded to assess their neural processing. Machine learning extracted neural sources that tracked the target and distractor speakers at three levels: the acoustic envelope of speech (delta- and theta-band modulations), lexical frequency for individual words, and the contextual predictability of individual words estimated by GPT-4 and earlier lexical models. To provide a broader view of speech perception, half of the subjects completed a simultaneous visual task, and the listeners included both native and non-native English speakers. Distinct neural components were extracted for these levels of auditory and lexical processing, demonstrating that native English speakers had greater target-distractor separation compared with non-native English speakers on most measures, and that lexical processing was reduced by the visual task. Moreover, there was a novel interaction of lexical predictability and frequency with auditory processing; acoustic tracking was stronger for lexically harder words, suggesting that people listened harder to the acoustics when needed for lexical selection. This demonstrates that speech perception is not simply a feedforward process from acoustic processing to the lexicon. Rather, the adaptable context-sensitive processing long known to occur at a lexical level has broader consequences for perception, coupling with the acoustic tracking of individual speakers in noise.

摘要

成年人听了两段来自两个空间分离的说话者的朗读的报纸和杂志文章的录音,他们被要求只听其中一个说话者的内容而忽略另一个。研究人员记录他们的脑电活动以评估他们的神经处理过程。机器学习从三个层面提取了跟踪目标和干扰说话者的神经源:言语的声谱包络(delta 和 theta 波段调制)、单个单词的词汇频率,以及由 GPT-4 和早期词汇模型估计的单个单词的上下文可预测性。为了更全面地了解言语感知,一半的受试者完成了同时的视觉任务,参与者包括英语母语者和非母语者。在这些听觉和词汇处理层面上,都提取到了不同的神经成分,这表明在大多数指标上,英语母语者比非英语母语者具有更强的目标-干扰者分离能力,而视觉任务则降低了词汇处理能力。此外,词汇可预测性和频率与听觉处理之间还存在一种新的相互作用;对于词汇较难的单词,声学跟踪更强,这表明当需要进行词汇选择时,人们会更努力地倾听声学信息。这表明言语感知不仅仅是从声学处理到词汇的简单前馈过程。相反,在词汇层面上长期存在的适应性、上下文敏感处理对感知具有更广泛的影响,与噪声中单个说话者的声学跟踪相结合。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b0b5/11335968/d53e029dbbca/eneuro-11-ENEURO.0507-23.2024-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b0b5/11335968/e06ac42ae5dd/eneuro-11-ENEURO.0507-23.2024-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b0b5/11335968/17c3995dbec0/eneuro-11-ENEURO.0507-23.2024-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b0b5/11335968/608bf2567cf9/eneuro-11-ENEURO.0507-23.2024-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b0b5/11335968/b12ad1cfe025/eneuro-11-ENEURO.0507-23.2024-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b0b5/11335968/fc3bde73f086/eneuro-11-ENEURO.0507-23.2024-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b0b5/11335968/d53e029dbbca/eneuro-11-ENEURO.0507-23.2024-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b0b5/11335968/e06ac42ae5dd/eneuro-11-ENEURO.0507-23.2024-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b0b5/11335968/17c3995dbec0/eneuro-11-ENEURO.0507-23.2024-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b0b5/11335968/608bf2567cf9/eneuro-11-ENEURO.0507-23.2024-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b0b5/11335968/b12ad1cfe025/eneuro-11-ENEURO.0507-23.2024-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b0b5/11335968/fc3bde73f086/eneuro-11-ENEURO.0507-23.2024-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b0b5/11335968/d53e029dbbca/eneuro-11-ENEURO.0507-23.2024-g006.jpg

相似文献

1
Neural Tracking of Speech Acoustics in Noise Is Coupled with Lexical Predictability as Estimated by Large Language Models.基于大型语言模型估计的词汇可预测性与噪声中语音声学的神经追踪相关联。
eNeuro. 2024 Aug 20;11(8). doi: 10.1523/ENEURO.0507-23.2024. Print 2024 Aug.
2
Auditory neural tracking and lexical processing of speech in noise: Masker type, spatial location, and language experience.噪声环境下语音的听觉神经追踪与词汇加工:掩蔽类型、空间位置及语言经验
J Acoust Soc Am. 2020 Jul;148(1):253. doi: 10.1121/10.0001477.
3
Neural Speech Tracking in the Theta and in the Delta Frequency Band Differentially Encode Clarity and Comprehension of Speech in Noise.在θ和δ频段的神经语音跟踪对语音清晰度和噪声中理解的编码有差异。
J Neurosci. 2019 Jul 17;39(29):5750-5759. doi: 10.1523/JNEUROSCI.1828-18.2019. Epub 2019 May 20.
4
Listening effort during speech perception enhances auditory and lexical processing for non-native listeners and accents.在言语感知过程中集中注意力可以增强非母语者和非本地口音者的听觉和词汇处理能力。
Cognition. 2018 Oct;179:163-170. doi: 10.1016/j.cognition.2018.06.001. Epub 2018 Jun 26.
5
The differential roles of lexical and sublexical processing during spoken-word recognition in clear and in noise.在清晰和嘈杂的语音识别中词汇和次词汇处理的差异作用。
Cortex. 2022 Jun;151:70-88. doi: 10.1016/j.cortex.2022.02.011. Epub 2022 Mar 16.
6
Effects of language experience and stimulus context on the neural organization and categorical perception of speech.语言经验和刺激情境对言语神经组织及范畴知觉的影响。
Neuroimage. 2015 Oct 15;120:191-200. doi: 10.1016/j.neuroimage.2015.06.087. Epub 2015 Jul 3.
7
Thai lexical tone perception in native speakers of Thai, English and Mandarin Chinese: an event-related potentials training study.泰语母语者、英语母语者和汉语普通话母语者对泰语词汇声调的感知:一项事件相关电位训练研究。
BMC Neurosci. 2008 Jun 23;9:53. doi: 10.1186/1471-2202-9-53.
8
Listening Effort by Native and Nonnative Listeners Due to Noise, Reverberation, and Talker Foreign Accent During English Speech Perception.母语和非母语听者在英语语音感知中因噪声、混响和说话者外国口音而产生的听力努力。
J Speech Lang Hear Res. 2019 Apr 15;62(4):1068-1081. doi: 10.1044/2018_JSLHR-H-17-0423.
9
Distinct roles of delta- and theta-band neural tracking for sharpening and predictive coding of multi-level speech features during spoken language processing.在口语处理过程中,Delta 和 theta 波段神经追踪对多级语音特征的锐化和预测编码具有不同的作用。
Hum Brain Mapp. 2023 Dec 1;44(17):6149-6172. doi: 10.1002/hbm.26503. Epub 2023 Oct 11.
10
Effects of Task Demands on Neural Correlates of Acoustic and Semantic Processing in Challenging Listening Conditions.任务需求对在挑战性聆听条件下的声学和语义处理的神经相关物的影响。
J Speech Lang Hear Res. 2021 Sep 14;64(9):3697-3706. doi: 10.1044/2021_JSLHR-21-00006. Epub 2021 Aug 17.

本文引用的文献

1
Time Scales and Moments of Listening Effort Revealed in Pupillometry.瞳孔测量法揭示的听力努力的时间尺度和瞬间
Semin Hear. 2023 Apr 4;44(2):106-123. doi: 10.1055/s-0043-1767741. eCollection 2023 May.
2
Distracting linguistic information impairs neural tracking of attended speech.分散注意力的语言信息会损害对所关注语音的神经追踪。
Curr Res Neurobiol. 2022 May 28;3:100043. doi: 10.1016/j.crneur.2022.100043. eCollection 2022.
3
A hierarchy of linguistic predictions during natural language comprehension.自然语言理解过程中的语言预测层次。
Proc Natl Acad Sci U S A. 2022 Aug 9;119(32):e2201968119. doi: 10.1073/pnas.2201968119. Epub 2022 Aug 3.
4
Linear Modeling of Neurophysiological Responses to Speech and Other Continuous Stimuli: Methodological Considerations for Applied Research.对语音和其他连续刺激的神经生理反应的线性建模:应用研究的方法学考量
Front Neurosci. 2021 Nov 22;15:705621. doi: 10.3389/fnins.2021.705621. eCollection 2021.
5
Neural Markers of Speech Comprehension: Measuring EEG Tracking of Linguistic Speech Representations, Controlling the Speech Acoustics.言语理解的神经标记物:测量 EEG 追踪语言言语表征,控制言语声学。
J Neurosci. 2021 Dec 15;41(50):10316-10329. doi: 10.1523/JNEUROSCI.0812-21.2021. Epub 2021 Nov 3.
6
An electrophysiological megastudy of spoken word recognition.一项关于口语单词识别的电生理大型研究。
Lang Cogn Neurosci. 2018;33(8):1063-1082. doi: 10.1080/23273798.2018.1455985. Epub 2018 Mar 27.
7
Dissociable electrophysiological measures of natural language processing reveal differences in speech comprehension strategy in healthy ageing.分离的自然语言处理的电生理测量揭示了健康老化中言语理解策略的差异。
Sci Rep. 2021 Mar 2;11(1):4963. doi: 10.1038/s41598-021-84597-9.
8
Continuous speech processing.连续语音处理
Curr Opin Physiol. 2020 Dec;18:25-31. doi: 10.1016/j.cophys.2020.07.014. Epub 2020 Jul 28.
9
Auditory neural tracking and lexical processing of speech in noise: Masker type, spatial location, and language experience.噪声环境下语音的听觉神经追踪与词汇加工:掩蔽类型、空间位置及语言经验
J Acoust Soc Am. 2020 Jul;148(1):253. doi: 10.1121/10.0001477.
10
Degradation levels of continuous speech affect neural speech tracking and alpha power differently.连续语音的降解水平对神经语音跟踪和 alpha 功率的影响不同。
Eur J Neurosci. 2022 Jun;55(11-12):3288-3302. doi: 10.1111/ejn.14912. Epub 2020 Aug 7.