• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

个体差异、非平稳性的影响,以及 EEG 跨参与者模型的训练和测试中数据分区决策的重要性。

The Effects of Individual Differences, Non-Stationarity, and the Importance of Data Partitioning Decisions for Training and Testing of EEG Cross-Participant Models.

机构信息

Department of Electrical and Computer Engineering, Air Force Institute of Technology, Wright-Patterson AFB, OH 45433, USA.

出版信息

Sensors (Basel). 2021 May 6;21(9):3225. doi: 10.3390/s21093225.

DOI:10.3390/s21093225
PMID:34066595
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8125354/
Abstract

EEG-based deep learning models have trended toward models that are designed to perform classification on any individual (cross-participant models). However, because EEG varies across participants due to non-stationarity and individual differences, certain guidelines must be followed for partitioning data into training, validation, and testing sets, in order for cross-participant models to avoid overestimation of model accuracy. Despite this necessity, the majority of EEG-based cross-participant models have not adopted such guidelines. Furthermore, some data repositories may unwittingly contribute to the problem by providing partitioned test and non-test datasets for reasons such as competition support. In this study, we demonstrate how improper dataset partitioning and the resulting improper training, validation, and testing of a cross-participant model leads to overestimated model accuracy. We demonstrate this mathematically, and empirically, using five publicly available datasets. To build the cross-participant models for these datasets, we replicate published results and demonstrate how the model accuracies are significantly reduced when proper EEG cross-participant model guidelines are followed. Our empirical results show that by not following these guidelines, error rates of cross-participant models can be underestimated between 35% and 3900%. This misrepresentation of model performance for the general population potentially slows scientific progress toward truly high-performing classification models.

摘要

基于 EEG 的深度学习模型倾向于设计为对任何个体进行分类的模型(跨参与者模型)。然而,由于 EEG 因非平稳性和个体差异而在参与者之间发生变化,因此必须遵循某些准则将数据划分为训练集、验证集和测试集,以便跨参与者模型避免过度估计模型准确性。尽管有此必要,但大多数基于 EEG 的跨参与者模型并未采用此类准则。此外,一些数据存储库可能由于竞争支持等原因,无意中通过提供分区的测试和非测试数据集来促成该问题。在这项研究中,我们展示了不当的数据分区以及由此导致的跨参与者模型的不当训练、验证和测试如何导致模型准确性被高估。我们通过数学和实证方法证明了这一点,使用了五个公开可用的数据集。为了为这些数据集构建跨参与者模型,我们复制了已发布的结果,并展示了在遵循适当的 EEG 跨参与者模型准则时,模型准确性如何显著降低。我们的实证结果表明,如果不遵循这些准则,跨参与者模型的错误率可能会低估 35%至 3900%。这种对模型性能在一般人群中的代表性不足,可能会减缓朝着真正高性能分类模型的科学进展。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/28ba/8125354/947b96c646b0/sensors-21-03225-g008a.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/28ba/8125354/226165119a2d/sensors-21-03225-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/28ba/8125354/382bd47e0151/sensors-21-03225-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/28ba/8125354/d0412c80b5ae/sensors-21-03225-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/28ba/8125354/b55aaeab7972/sensors-21-03225-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/28ba/8125354/c9c642fac194/sensors-21-03225-g005a.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/28ba/8125354/70fff1689eb9/sensors-21-03225-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/28ba/8125354/a8183d389bf6/sensors-21-03225-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/28ba/8125354/947b96c646b0/sensors-21-03225-g008a.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/28ba/8125354/226165119a2d/sensors-21-03225-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/28ba/8125354/382bd47e0151/sensors-21-03225-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/28ba/8125354/d0412c80b5ae/sensors-21-03225-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/28ba/8125354/b55aaeab7972/sensors-21-03225-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/28ba/8125354/c9c642fac194/sensors-21-03225-g005a.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/28ba/8125354/70fff1689eb9/sensors-21-03225-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/28ba/8125354/a8183d389bf6/sensors-21-03225-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/28ba/8125354/947b96c646b0/sensors-21-03225-g008a.jpg

相似文献

1
The Effects of Individual Differences, Non-Stationarity, and the Importance of Data Partitioning Decisions for Training and Testing of EEG Cross-Participant Models.个体差异、非平稳性的影响,以及 EEG 跨参与者模型的训练和测试中数据分区决策的重要性。
Sensors (Basel). 2021 May 6;21(9):3225. doi: 10.3390/s21093225.
2
Covariate shift estimation based adaptive ensemble learning for handling non-stationarity in motor imagery related EEG-based brain-computer interface.基于协变量偏移估计的自适应集成学习,用于处理基于运动想象脑电图的脑机接口中的非平稳性。
Neurocomputing (Amst). 2019 May 28;343:154-166. doi: 10.1016/j.neucom.2018.04.087.
3
Generalized Deep Learning EEG Models for Cross-Participant and Cross-Task Detection of the Vigilance Decrement in Sustained Attention Tasks.基于广义深度学习的脑电模型在持续性注意任务警觉下降的跨被试和跨任务检测。
Sensors (Basel). 2021 Aug 20;21(16):5617. doi: 10.3390/s21165617.
4
Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.在流行地区,服用抗叶酸抗疟药物的人群中,叶酸补充剂与疟疾易感性和严重程度的关系。
Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217.
5
Deep learning-based electroencephalography analysis: a systematic review.基于深度学习的脑电图分析:系统评价。
J Neural Eng. 2019 Aug 14;16(5):051001. doi: 10.1088/1741-2552/ab260c.
6
EEG-based emotion charting for Parkinson's disease patients using Convolutional Recurrent Neural Networks and cross dataset learning.基于 EEG 的帕金森病患者情绪图表分析,使用卷积循环神经网络和跨数据集学习。
Comput Biol Med. 2022 May;144:105327. doi: 10.1016/j.compbiomed.2022.105327. Epub 2022 Mar 11.
7
MaDeNet: Disentangling Individuality of EEG Signals through Feature Space Mapping and Detachment.
Annu Int Conf IEEE Eng Med Biol Soc. 2020 Jul;2020:260-263. doi: 10.1109/EMBC44109.2020.9176301.
8
Data augmentation for enhancing EEG-based emotion recognition with deep generative models.基于深度生成模型的数据增强以增强基于 EEG 的情绪识别。
J Neural Eng. 2020 Oct 14;17(5):056021. doi: 10.1088/1741-2552/abb580.
9
Multi-class motor imagery EEG classification method with high accuracy and low individual differences based on hybrid neural network.基于混合神经网络的高精度、低个体差异多类运动想象 EEG 分类方法。
J Neural Eng. 2021 Aug 31;18(4). doi: 10.1088/1741-2552/ac1ed0.
10
Constructing a Personalized Cross-Day EEG-Based Emotion-Classification Model Using Transfer Learning.利用迁移学习构建基于个人跨日 EEG 的情绪分类模型。
IEEE J Biomed Health Inform. 2020 May;24(5):1255-1264. doi: 10.1109/JBHI.2019.2934172. Epub 2019 Aug 9.

引用本文的文献

1
Home Robot Interaction Based on EEG Motor Imagery and Visual Perception Fusion.基于脑电运动想象与视觉感知融合的家庭机器人交互
Sensors (Basel). 2025 Sep 6;25(17):5568. doi: 10.3390/s25175568.
2
Domain-generalized Deep Learning for Improved Subject-independent Emotion Recognition Based on Electroencephalography.基于脑电图的领域泛化深度学习用于改进独立于个体的情绪识别
Exp Neurobiol. 2025 Jun 30;34(3):119-130. doi: 10.5607/en25011. Epub 2025 May 14.
3
Exploring Task-Related EEG for Cross-Subject Early Alzheimer's Disease Susceptibility Prediction in Middle-Aged Adults Using Multitaper Spectral Analysis.

本文引用的文献

1
Standardization of Neurotechnology for Brain-Machine Interfacing: State of the Art and Recommendations.用于脑机接口的神经技术标准化:现状与建议
IEEE Open J Eng Med Biol. 2021 Feb 23;2:71-73. doi: 10.1109/OJEMB.2021.3061328. eCollection 2021.
2
The Perils and Pitfalls of Block Design for EEG Classification Experiments.脑电图分类实验中分组设计的风险与陷阱
IEEE Trans Pattern Anal Mach Intell. 2020 Nov 19;PP. doi: 10.1109/TPAMI.2020.2973153.
3
Covariate shift estimation based adaptive ensemble learning for handling non-stationarity in motor imagery related EEG-based brain-computer interface.
利用多 taper 谱分析探索与任务相关的脑电图用于中年成年人跨受试者早期阿尔茨海默病易感性预测
Sensors (Basel). 2024 Dec 25;25(1):52. doi: 10.3390/s25010052.
4
EEG-based emotional valence and emotion regulation classification: a data-centric and explainable approach.基于脑电图的情绪效价和情绪调节分类:一种以数据为中心且可解释的方法。
Sci Rep. 2024 Oct 14;14(1):24046. doi: 10.1038/s41598-024-75263-x.
5
Power spectral density-based resting-state EEG classification of first-episode psychosis.基于功率谱密度的首发精神病静息态 EEG 分类。
Sci Rep. 2024 Jul 2;14(1):15154. doi: 10.1038/s41598-024-66110-0.
6
EEG-based investigation of effects of mindfulness meditation training on state and trait by deep learning and traditional machine learning.基于脑电图,通过深度学习和传统机器学习研究正念冥想训练对状态和特质的影响。
Front Hum Neurosci. 2023 Aug 31;17:1033420. doi: 10.3389/fnhum.2023.1033420. eCollection 2023.
7
EEG Signal Multichannel Frequency-Domain Ratio Indices for Drowsiness Detection Based on Multicriteria Optimization.基于多准则优化的脑电信号多通道频域比指数用于瞌睡检测。
Sensors (Basel). 2021 Oct 19;21(20):6932. doi: 10.3390/s21206932.
8
Generalized Deep Learning EEG Models for Cross-Participant and Cross-Task Detection of the Vigilance Decrement in Sustained Attention Tasks.基于广义深度学习的脑电模型在持续性注意任务警觉下降的跨被试和跨任务检测。
Sensors (Basel). 2021 Aug 20;21(16):5617. doi: 10.3390/s21165617.
9
Intelligent Biosignal Analysis Methods.智能生物信号分析方法。
Sensors (Basel). 2021 Jul 12;21(14):4743. doi: 10.3390/s21144743.
10
A Review of EEG Signal Features and their Application in Driver Drowsiness Detection Systems.脑电信号特征及其在驾驶员困倦检测系统中的应用综述。
Sensors (Basel). 2021 May 30;21(11):3786. doi: 10.3390/s21113786.
基于协变量偏移估计的自适应集成学习,用于处理基于运动想象脑电图的脑机接口中的非平稳性。
Neurocomputing (Amst). 2019 May 28;343:154-166. doi: 10.1016/j.neucom.2018.04.087.
4
Combining Inter-Subject Modeling with a Subject-Based Data Transformation to Improve Affect Recognition from EEG Signals.结合跨被试建模与基于被试的数据转换来提高 EEG 信号的情感识别。
Sensors (Basel). 2019 Jul 8;19(13):2999. doi: 10.3390/s19132999.
5
Deep learning-based electroencephalography analysis: a systematic review.基于深度学习的脑电图分析:系统评价。
J Neural Eng. 2019 Aug 14;16(5):051001. doi: 10.1088/1741-2552/ab260c.
6
Removing Confounding Factors Associated Weights in Deep Neural Networks Improves the Prediction Accuracy for Healthcare Applications.消除深度神经网络中与权重相关的混杂因素可提高医疗保健应用的预测准确性。
Pac Symp Biocomput. 2019;24:54-65.
7
Dynamical Hurst analysis identifies EEG channel differences between PTSD and healthy controls.动力 Hurst 分析确定 PTSD 患者和健康对照者之间 EEG 通道的差异。
PLoS One. 2018 Jul 3;13(7):e0199144. doi: 10.1371/journal.pone.0199144. eCollection 2018.
8
Cross-Participant EEG-Based Assessment of Cognitive Workload Using Multi-Path Convolutional Recurrent Neural Networks.基于多路径卷积循环神经网络的跨参与者 EEG 认知负荷评估。
Sensors (Basel). 2018 Apr 26;18(5):1339. doi: 10.3390/s18051339.
9
Individual differences in EEG correlates of recognition memory due to DAT polymorphisms.由于 DAT 多态性导致的 EEG 相关性识别记忆的个体差异。
Brain Behav. 2017 Nov 10;7(12):e00870. doi: 10.1002/brb3.870. eCollection 2017 Dec.
10
Driver fatigue detection through multiple entropy fusion analysis in an EEG-based system.基于脑电的多熵融合分析驾驶员疲劳检测系统。
PLoS One. 2017 Dec 8;12(12):e0188756. doi: 10.1371/journal.pone.0188756. eCollection 2017.