• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

脑电图中用于稳健声学和语音编码模型的数据集规模考量

Dataset size considerations for robust acoustic and phonetic speech encoding models in EEG.

作者信息

Desai Maansi, Field Alyssa M, Hamilton Liberty S

机构信息

Department of Speech, Language, and Hearing Sciences, Moody College of Communication, The University of Texas at Austin, Austin, TX, United States.

Department of Neurology, Dell Medical School, The University of Texas at Austin, Austin, TX, United States.

出版信息

Front Hum Neurosci. 2023 Jan 20;16:1001171. doi: 10.3389/fnhum.2022.1001171. eCollection 2022.

DOI:10.3389/fnhum.2022.1001171
PMID:36741776
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9895838/
Abstract

In many experiments that investigate auditory and speech processing in the brain using electroencephalography (EEG), the experimental paradigm is often lengthy and tedious. Typically, the experimenter errs on the side of including more data, more trials, and therefore conducting a longer task to ensure that the data are robust and effects are measurable. Recent studies used naturalistic stimuli to investigate the brain's response to individual or a combination of multiple speech features using system identification techniques, such as multivariate temporal receptive field (mTRF) analyses. The neural data collected from such experiments must be divided into a training set and a test set to fit and validate the mTRF weights. While a good strategy is clearly to collect as much data as is feasible, it is unclear how much data are needed to achieve stable results. Furthermore, it is unclear whether the specific stimulus used for mTRF fitting and the choice of feature representation affects how much data would be required for robust and generalizable results. Here, we used previously collected EEG data from our lab using sentence stimuli and movie stimuli as well as EEG data from an open-source dataset using audiobook stimuli to better understand how much data needs to be collected for naturalistic speech experiments measuring acoustic and phonetic tuning. We found that the EEG receptive field structure tested here stabilizes after collecting a training dataset of approximately 200 s of TIMIT sentences, around 600 s of movie trailers training set data, and approximately 460 s of audiobook training set data. Thus, we provide suggestions on the minimum amount of data that would be necessary for fitting mTRFs from naturalistic listening data. Our findings are motivated by highly practical concerns when working with children, patient populations, or others who may not tolerate long study sessions. These findings will aid future researchers who wish to study naturalistic speech processing in healthy and clinical populations while minimizing participant fatigue and retaining signal quality.

摘要

在许多使用脑电图(EEG)研究大脑听觉和言语处理的实验中,实验范式往往冗长乏味。通常,实验者倾向于纳入更多数据、更多试验,从而进行更长时间的任务,以确保数据可靠且效应可测量。最近的研究使用自然主义刺激,通过系统识别技术,如多变量时间感受野(mTRF)分析,来研究大脑对单个或多个言语特征组合的反应。从此类实验收集的神经数据必须分为训练集和测试集,以拟合和验证mTRF权重。虽然显然一个好的策略是尽可能多地收集可行的数据,但尚不清楚需要多少数据才能获得稳定的结果。此外,尚不清楚用于mTRF拟合的特定刺激以及特征表示的选择是否会影响获得可靠且可推广结果所需的数据量。在这里,我们使用了我们实验室之前收集的使用句子刺激和电影刺激的EEG数据,以及来自一个开源数据集的使用有声读物刺激的EEG数据,以更好地了解在测量声学和语音调谐的自然主义言语实验中需要收集多少数据。我们发现,在这里测试的EEG感受野结构在收集大约200秒的TIMIT句子训练数据集、大约600秒的电影预告片训练集数据和大约460秒的有声读物训练集数据后趋于稳定。因此,我们就从自然主义听觉数据拟合mTRF所需的最小数据量提供了建议。我们的研究结果是出于与儿童、患者群体或其他可能无法耐受长时间研究的人群合作时的高度实际考虑。这些发现将有助于未来希望在健康和临床人群中研究自然主义言语处理,同时将参与者疲劳降至最低并保持信号质量的研究人员。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ab38/9895838/8bb0b856c188/fnhum-16-1001171-g0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ab38/9895838/90c17a9e4b85/fnhum-16-1001171-g0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ab38/9895838/07157133b988/fnhum-16-1001171-g0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ab38/9895838/e8c79329f699/fnhum-16-1001171-g0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ab38/9895838/8bb0b856c188/fnhum-16-1001171-g0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ab38/9895838/90c17a9e4b85/fnhum-16-1001171-g0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ab38/9895838/07157133b988/fnhum-16-1001171-g0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ab38/9895838/e8c79329f699/fnhum-16-1001171-g0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ab38/9895838/8bb0b856c188/fnhum-16-1001171-g0004.jpg

相似文献

1
Dataset size considerations for robust acoustic and phonetic speech encoding models in EEG.脑电图中用于稳健声学和语音编码模型的数据集规模考量
Front Hum Neurosci. 2023 Jan 20;16:1001171. doi: 10.3389/fnhum.2022.1001171. eCollection 2022.
2
Generalizable EEG Encoding Models with Naturalistic Audiovisual Stimuli.具有自然视听刺激的可泛化 EEG 编码模型。
J Neurosci. 2021 Oct 27;41(43):8946-8962. doi: 10.1523/JNEUROSCI.2891-20.2021. Epub 2021 Sep 9.
3
Errors on a Speech-in-Babble Sentence Recognition Test Reveal Individual Differences in Acoustic Phonetic Perception and Babble Misallocations.嘈杂语音句子识别测试中的错误揭示了声学语音感知和嘈杂语音误分配方面的个体差异。
Ear Hear. 2021 May/Jun;42(3):673-690. doi: 10.1097/AUD.0000000000001020.
4
Effects of directional sound processing and listener's motivation on EEG responses to continuous noisy speech: Do normal-hearing and aided hearing-impaired listeners differ?方向性声音处理和听者动机对连续噪声语音的 EEG 反应的影响:正常听力和助听听力障碍者是否不同?
Hear Res. 2019 Jun;377:260-270. doi: 10.1016/j.heares.2019.04.005. Epub 2019 Apr 11.
5
Linear Modeling of Neurophysiological Responses to Speech and Other Continuous Stimuli: Methodological Considerations for Applied Research.对语音和其他连续刺激的神经生理反应的线性建模:应用研究的方法学考量
Front Neurosci. 2021 Nov 22;15:705621. doi: 10.3389/fnins.2021.705621. eCollection 2021.
6
Isolating Neural Indices of Continuous Speech Processing at the Phonetic Level.在语音层面分离连续语音处理的神经指标。
Adv Exp Med Biol. 2016;894:337-345. doi: 10.1007/978-3-319-25474-6_35.
7
The effects of attention and visual input on the representation of natural speech in EEG.注意力和视觉输入对脑电图中自然语音表征的影响。
Annu Int Conf IEEE Eng Med Biol Soc. 2013;2013:2800-3. doi: 10.1109/EMBC.2013.6610122.
8
Automatic auditory processing of english words as indexed by the mismatch negativity, using a multiple deviant paradigm.使用多偏差范式,通过失配负波来索引英语单词的自动听觉处理。
Ear Hear. 2004 Jun;25(3):284-301. doi: 10.1097/01.aud.0000130800.88987.03.
9
Eelbrain, a Python toolkit for time-continuous analysis with temporal response functions.Eelbrain,一个用于使用时间响应函数进行时间连续分析的 Python 工具包。
Elife. 2023 Nov 29;12:e85012. doi: 10.7554/eLife.85012.
10
Robust decoding of the speech envelope from EEG recordings through deep neural networks.通过深度神经网络从 EEG 记录中稳健地解码语音包络。
J Neural Eng. 2022 Jul 6;19(4). doi: 10.1088/1741-2552/ac7976.

引用本文的文献

1
Neural tracking of natural speech: an effective marker for post-stroke aphasia.自然言语的神经追踪:中风后失语症的有效标志物。
Brain Commun. 2025 Mar 10;7(2):fcaf095. doi: 10.1093/braincomms/fcaf095. eCollection 2025.
2
A comparison of EEG encoding models using audiovisual stimuli and their unimodal counterparts.使用视听刺激与单模态刺激的 EEG 编码模型比较。
PLoS Comput Biol. 2024 Sep 9;20(9):e1012433. doi: 10.1371/journal.pcbi.1012433. eCollection 2024 Sep.

本文引用的文献

1
The effects of data quantity on performance of temporal response function analyses of natural speech processing.数据量对自然语音处理时间响应函数分析性能的影响。
Front Neurosci. 2023 Jan 12;16:963629. doi: 10.3389/fnins.2022.963629. eCollection 2022.
2
The neural response at the fundamental frequency of speech is modulated by word-level acoustic and linguistic information.语音基频的神经反应受到词汇层面的声学和语言信息的调制。
Front Neurosci. 2022 Jul 22;16:915744. doi: 10.3389/fnins.2022.915744. eCollection 2022.
3
EEG variability: Task-driven or subject-driven signal of interest?
脑电图变异性:任务驱动还是受试者驱动的感兴趣信号?
Neuroimage. 2022 May 15;252:119034. doi: 10.1016/j.neuroimage.2022.119034. Epub 2022 Mar 1.
4
Beyond t test and ANOVA: applications of mixed-effects models for more rigorous statistical analysis in neuroscience research.超越 t 检验和 ANOVA:混合效应模型在神经科学研究中更严格的统计分析中的应用。
Neuron. 2022 Jan 5;110(1):21-35. doi: 10.1016/j.neuron.2021.10.030. Epub 2021 Nov 15.
5
Generalizable EEG Encoding Models with Naturalistic Audiovisual Stimuli.具有自然视听刺激的可泛化 EEG 编码模型。
J Neurosci. 2021 Oct 27;41(43):8946-8962. doi: 10.1523/JNEUROSCI.2891-20.2021. Epub 2021 Sep 9.
6
Basis profile curve identification to understand electrical stimulation effects in human brain networks.基于轮廓曲线识别以了解人脑网络中的电刺激效应。
PLoS Comput Biol. 2021 Sep 2;17(9):e1008710. doi: 10.1371/journal.pcbi.1008710. eCollection 2021 Sep.
7
Semantic Context Enhances the Early Auditory Encoding of Natural Speech.语义语境增强了对自然语音的早期听觉编码。
J Neurosci. 2019 Sep 18;39(38):7564-7575. doi: 10.1523/JNEUROSCI.0584-19.2019. Epub 2019 Aug 1.
8
Prosodic pitch processing is represented in delta-band EEG and is dissociable from the cortical tracking of other acoustic and phonetic features.韵律音高加工在 delta 频段的 EEG 中得到体现,并且与皮质对其他声学和语音特征的跟踪是可分离的。
Eur J Neurosci. 2019 Dec;50(11):3831-3842. doi: 10.1111/ejn.14510. Epub 2019 Aug 1.
9
Rapid Transformation from Auditory to Linguistic Representations of Continuous Speech.连续语音的听觉到语言表示的快速转换。
Curr Biol. 2018 Dec 17;28(24):3976-3983.e5. doi: 10.1016/j.cub.2018.10.042. Epub 2018 Nov 29.
10
A Spatial Map of Onset and Sustained Responses to Speech in the Human Superior Temporal Gyrus.人类上颞 gyrus 中言语起始和持续反应的空间图谱
Curr Biol. 2018 Jun 18;28(12):1860-1871.e4. doi: 10.1016/j.cub.2018.04.033. Epub 2018 May 31.