Department of Electrical and Computer Engineering, Air Force Institute of Technology, Wright-Patterson AFB, OH 45433, USA.
Sensors (Basel). 2021 May 6;21(9):3225. doi: 10.3390/s21093225.
EEG-based deep learning models have trended toward models that are designed to perform classification on any individual (cross-participant models). However, because EEG varies across participants due to non-stationarity and individual differences, certain guidelines must be followed for partitioning data into training, validation, and testing sets, in order for cross-participant models to avoid overestimation of model accuracy. Despite this necessity, the majority of EEG-based cross-participant models have not adopted such guidelines. Furthermore, some data repositories may unwittingly contribute to the problem by providing partitioned test and non-test datasets for reasons such as competition support. In this study, we demonstrate how improper dataset partitioning and the resulting improper training, validation, and testing of a cross-participant model leads to overestimated model accuracy. We demonstrate this mathematically, and empirically, using five publicly available datasets. To build the cross-participant models for these datasets, we replicate published results and demonstrate how the model accuracies are significantly reduced when proper EEG cross-participant model guidelines are followed. Our empirical results show that by not following these guidelines, error rates of cross-participant models can be underestimated between 35% and 3900%. This misrepresentation of model performance for the general population potentially slows scientific progress toward truly high-performing classification models.
基于 EEG 的深度学习模型倾向于设计为对任何个体进行分类的模型(跨参与者模型)。然而,由于 EEG 因非平稳性和个体差异而在参与者之间发生变化,因此必须遵循某些准则将数据划分为训练集、验证集和测试集,以便跨参与者模型避免过度估计模型准确性。尽管有此必要,但大多数基于 EEG 的跨参与者模型并未采用此类准则。此外,一些数据存储库可能由于竞争支持等原因,无意中通过提供分区的测试和非测试数据集来促成该问题。在这项研究中,我们展示了不当的数据分区以及由此导致的跨参与者模型的不当训练、验证和测试如何导致模型准确性被高估。我们通过数学和实证方法证明了这一点,使用了五个公开可用的数据集。为了为这些数据集构建跨参与者模型,我们复制了已发布的结果,并展示了在遵循适当的 EEG 跨参与者模型准则时,模型准确性如何显著降低。我们的实证结果表明,如果不遵循这些准则,跨参与者模型的错误率可能会低估 35%至 3900%。这种对模型性能在一般人群中的代表性不足,可能会减缓朝着真正高性能分类模型的科学进展。