IEEE J Biomed Health Inform. 2020 May;24(5):1255-1264. doi: 10.1109/JBHI.2019.2934172. Epub 2019 Aug 9.
State-of-the-art electroencephalogram (EEG)-based emotion-classification works indicate that a personalized model may not be well exploited until sufficient labeled data are available, given a substantial EEG non-stationarity over days. However, it is impractical to impose a labor-intensive, time-consuming multiple-day data collection. This study proposes a robust principal component analysis (RPCA)-embedded transfer learning (TL) to generate a personalized cross-day model with less labeled data, while obviating intra- and inter-individual differences. Upon the add-session-in validation on two datasets MDME (five-day data of 12 subjects) and SDMN (single-day data of 26 subjects), the experimental results showed that TL enabled the classifier of an MDME individual (using his/her 1st-day session only) to improve progressively in valence and arousal classification by adding similar source sessions (SSs) via the within-dataset TL (wdTL) and cross-dataset TL (cdTL) manners. When recruiting three SSs to test on the 5th-day session, the wdTL improvement (valence: 11.19%, arousal: 5.82%) marginally outperformed the subject-dependent (SD) counterpart (valence: 9.75%, arousal: 3.77%) that was obtained using their own 2nd-4th-day sessions only. The cdTL returned a similar trend in valence (8.35%), yet it was less effective in arousal (0.81%). Most importantly, such cross-day enhancements did not occur in either SD or TL scenarios until RPCA processing. This work sheds light on how to construct a personalized model by leveraging ever-growing EEG repositories.
基于脑电图(EEG)的情绪分类的最新研究表明,在 EEG 存在显著非平稳性的情况下,每天需要大量的标注数据才能充分利用个性化模型。然而,进行多日的劳动密集型、耗时的数据采集是不切实际的。本研究提出了一种稳健的主成分分析(RPCA)嵌入的迁移学习(TL)方法,以利用较少的标注数据生成个性化的跨日模型,同时消除个体内和个体间的差异。在两个数据集 MDME(12 个被试者的五日数据)和 SDMN(26 个被试者的单日数据)的添加会话验证中,实验结果表明,TL 使 MDME 个体的分类器(仅使用他/她的第一天会话)能够通过添加相似源会话(SS)通过内部数据集 TL(wdTL)和跨数据集 TL(cdTL)方式逐步提高效价和唤醒分类。当招募三个 SS 来测试第 5 天的会话时,wdTL 改进(效价:11.19%,唤醒:5.82%)略优于仅使用他们自己的第 2-4 天会话的基于个体的(SD)对应物(效价:9.75%,唤醒:3.77%)。cdTL 在效价方面呈现出类似的趋势(8.35%),但在唤醒方面的效果较弱(0.81%)。最重要的是,这种跨日增强在 RPCA 处理之前不会在 SD 或 TL 场景中发生。这项工作阐明了如何通过利用不断增长的 EEG 存储库构建个性化模型。