• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

适当的数据分割可改善语音编码模型:电生理记录的分析与模拟

Appropriate data segmentation improves speech encoding models: Analysis and simulation of electrophysiological recordings.

作者信息

Bialas Ole, Lalor Edmund C

机构信息

Department of Biomedical Engineering, University of Rochester, Rochester, New York, United States of America.

Del Monte Institute for Neuroscience , University of Rochester, Rochester, New York, United States of America.

出版信息

PLoS One. 2025 May 23;20(5):e0323276. doi: 10.1371/journal.pone.0323276. eCollection 2025.

DOI:10.1371/journal.pone.0323276
PMID:40408314
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12101633/
Abstract

BACKGROUND

In recent decades, studies modeling the neural processing of continuous, naturalistic, speech provided new insights into how speech and language are represented in the brain. However, the linear encoder models commonly used in such studies assume that the underlying data are stationary, varying to a fixed degree around a constant mean. Long, continuous, neural recordings may violate this assumption leading to impaired model performance. We aimed to examine the effect of non-stationary trends in continuous neural recordings on the performance of linear speech encoding models.

METHODS

We used temporal response functions (TRFs) to predict continuous neural responses to speech while splitting the data into segments of varying length, prior to model fitting. Our Hypothesis was that if the data were non-stationary, segmentation should improve model performance by making individual segments approximately stationary. We simulated and predicted stationary and non-stationary recordings to test our hypothesis under a known ground truth and predicted the brain activity of participants who listened to a narrated story, to test our hypothesis on actual neural recordings.

RESULTS

Simulations showed that, for stationary data, increasing segmentation steadily decreased model performance. For non-stationary data however, segmentation initially improved model performance. Modeling of neural recordings yielded similar results: segments of intermediate length (5-15 s) led to improved model performance compared to very short (1-2 s) and very long (30-120 s) segments.

CONCLUSIONS

We showed that data segmentation improves the performance of encoding models for both simulated and real neural data and that this can be explained by the fact that shorter segments approximate stationarity more closely. Thus, the common practice of applying encoding models to long continuous segments of data is suboptimal and recordings should be segmented prior to modeling.

摘要

背景

近几十年来,对连续、自然语音的神经处理进行建模的研究为语音和语言在大脑中的表征方式提供了新的见解。然而,此类研究中常用的线性编码器模型假设基础数据是平稳的,即在恒定均值周围以固定程度变化。长时间的连续神经记录可能会违反这一假设,导致模型性能受损。我们旨在研究连续神经记录中的非平稳趋势对线性语音编码模型性能的影响。

方法

在进行模型拟合之前,我们使用时间响应函数(TRF)来预测对语音的连续神经反应,同时将数据分割成不同长度的片段。我们的假设是,如果数据是非平稳的,分割应通过使各个片段近似平稳来提高模型性能。我们模拟并预测了平稳和非平稳记录,以在已知的真实情况下检验我们的假设,并预测了听叙述故事的参与者的大脑活动,以在实际神经记录上检验我们的假设。

结果

模拟表明,对于平稳数据,增加分割会稳步降低模型性能。然而,对于非平稳数据,分割最初会提高模型性能。神经记录的建模产生了类似的结果:与非常短(1 - 2秒)和非常长(30 - 120秒)的片段相比,中等长度(5 - 15秒)的片段导致模型性能提高。

结论

我们表明,数据分割提高了模拟和真实神经数据编码模型的性能,这可以通过较短片段更接近平稳性这一事实来解释。因此,将编码模型应用于长连续数据段的常见做法是次优的,记录应在建模之前进行分割。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/771a/12101633/f001e8345803/pone.0323276.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/771a/12101633/d70ffd60e94a/pone.0323276.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/771a/12101633/bc98922c48b6/pone.0323276.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/771a/12101633/f001e8345803/pone.0323276.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/771a/12101633/d70ffd60e94a/pone.0323276.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/771a/12101633/bc98922c48b6/pone.0323276.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/771a/12101633/f001e8345803/pone.0323276.g003.jpg

相似文献

1
Appropriate data segmentation improves speech encoding models: Analysis and simulation of electrophysiological recordings.适当的数据分割可改善语音编码模型:电生理记录的分析与模拟
PLoS One. 2025 May 23;20(5):e0323276. doi: 10.1371/journal.pone.0323276. eCollection 2025.
2
Robust cortical encoding of slow temporal modulations of speech.语音缓慢时变调制的稳健皮质编码。
Adv Exp Med Biol. 2013;787:373-81. doi: 10.1007/978-1-4614-1590-9_41.
3
Neuro-current response functions: A unified approach to MEG source analysis under the continuous stimuli paradigm.神经电流响应函数:连续刺激范式下脑磁图源分析的统一方法。
Neuroimage. 2020 May 1;211:116528. doi: 10.1016/j.neuroimage.2020.116528. Epub 2020 Jan 13.
4
Generalizable EEG Encoding Models with Naturalistic Audiovisual Stimuli.具有自然视听刺激的可泛化 EEG 编码模型。
J Neurosci. 2021 Oct 27;41(43):8946-8962. doi: 10.1523/JNEUROSCI.2891-20.2021. Epub 2021 Sep 9.
5
Neural source dynamics of brain responses to continuous stimuli: Speech processing from acoustics to comprehension.连续刺激下大脑反应的神经源动力学:从声学处理到理解的言语加工。
Neuroimage. 2018 May 15;172:162-174. doi: 10.1016/j.neuroimage.2018.01.042. Epub 2018 Feb 3.
6
Dynamic modeling of EEG responses to natural speech reveals earlier processing of predictable words.脑电图对自然语音反应的动态建模揭示了可预测单词的早期处理过程。
PLoS Comput Biol. 2025 Apr 28;21(4):e1013006. doi: 10.1371/journal.pcbi.1013006. eCollection 2025 Apr.
7
Deep-learning models reveal how context and listener attention shape electrophysiological correlates of speech-to-language transformation.深度学习模型揭示了语境和听众注意力如何塑造言语到语言转换的电生理相关性。
PLoS Comput Biol. 2024 Nov 11;20(11):e1012537. doi: 10.1371/journal.pcbi.1012537. eCollection 2024 Nov.
8
Adaptive temporal encoding leads to a background-insensitive cortical representation of speech.自适应时编码导致语音的背景不敏感的皮质表示。
J Neurosci. 2013 Mar 27;33(13):5728-35. doi: 10.1523/JNEUROSCI.5297-12.2013.
9
Using Coherence-based spectro-spatial filters for stimulus features prediction from electro-corticographic recordings.使用基于相干性的谱-空间滤波器从脑电记录中预测刺激特征。
Sci Rep. 2020 May 6;10(1):7637. doi: 10.1038/s41598-020-63303-1.
10
Semantic Context Enhances the Early Auditory Encoding of Natural Speech.语义语境增强了对自然语音的早期听觉编码。
J Neurosci. 2019 Sep 18;39(38):7564-7575. doi: 10.1523/JNEUROSCI.0584-19.2019. Epub 2019 Aug 1.

本文引用的文献

1
Neural dynamics of phoneme sequences reveal position-invariant code for content and order.音素序列的神经动力学揭示了内容和顺序的位置不变代码。
Nat Commun. 2022 Nov 3;13(1):6606. doi: 10.1038/s41467-022-34326-1.
2
The revolution will not be controlled: natural stimuli in speech neuroscience.这场革命无法被控制:言语神经科学中的自然刺激
Lang Cogn Neurosci. 2018 Jul 22;35(5):573-582. doi: 10.1080/23273798.2018.1499946. eCollection 2020.
3
Source-Informed Segmentation: A Data-Driven Approach for the Temporal Segmentation of EEG.
基于源信息的分段:一种 EEG 时间分段的数 据驱动方法。
IEEE Trans Biomed Eng. 2019 May;66(5):1429-1446. doi: 10.1109/TBME.2018.2874167. Epub 2018 Oct 4.
4
Electrophysiological Correlates of Semantic Dissimilarity Reflect the Comprehension of Natural, Narrative Speech.语义相似度的电生理相关性反映了对自然、叙事性言语的理解。
Curr Biol. 2018 Mar 5;28(5):803-809.e3. doi: 10.1016/j.cub.2018.01.080. Epub 2018 Feb 22.
5
Decline of long-range temporal correlations in the human brain during sustained wakefulness.人类大脑在持续清醒过程中长程时间相关性的下降。
Sci Rep. 2017 Sep 19;7(1):11825. doi: 10.1038/s41598-017-12140-w.
6
The Multivariate Temporal Response Function (mTRF) Toolbox: A MATLAB Toolbox for Relating Neural Signals to Continuous Stimuli.多元时间响应函数(mTRF)工具箱:一个用于将神经信号与连续刺激相关联的MATLAB工具箱。
Front Hum Neurosci. 2016 Nov 30;10:604. doi: 10.3389/fnhum.2016.00604. eCollection 2016.
7
The auditory representation of speech sounds in human motor cortex.人类运动皮层中语音的听觉表征。
Elife. 2016 Mar 4;5:e12577. doi: 10.7554/eLife.12577.
8
An intelligent approach for variable size segmentation of non-stationary signals.一种用于非平稳信号变粒度分割的智能方法。
J Adv Res. 2015 Sep;6(5):687-98. doi: 10.1016/j.jare.2014.03.004. Epub 2014 Mar 19.
9
Low-Frequency Cortical Entrainment to Speech Reflects Phoneme-Level Processing.对语音的低频皮层夹带反映音素水平的加工。
Curr Biol. 2015 Oct 5;25(19):2457-65. doi: 10.1016/j.cub.2015.08.030. Epub 2015 Sep 24.
10
Age-Related Changes in 1/f Neural Electrophysiological Noise.1/f 神经电生理噪声的年龄相关变化
J Neurosci. 2015 Sep 23;35(38):13257-65. doi: 10.1523/JNEUROSCI.2332-14.2015.