Swartz Center for Computational Neuroscience, Institute for Neural Computation, University of California San Diego, La Jolla, CA, USA ; Center for Advanced Neurological Engineering, Institute of Engineering in Medicine, University of California San Diego, La Jolla, CA, USA.
Music and Audio Computing Lab, Research Center for IT Innovation Academia Sinica, Taipei, Taiwan.
Front Neurosci. 2014 May 1;8:94. doi: 10.3389/fnins.2014.00094. eCollection 2014.
Electroencephalography (EEG)-based emotion classification during music listening has gained increasing attention nowadays due to its promise of potential applications such as musical affective brain-computer interface (ABCI), neuromarketing, music therapy, and implicit multimedia tagging and triggering. However, music is an ecologically valid and complex stimulus that conveys certain emotions to listeners through compositions of musical elements. Using solely EEG signals to distinguish emotions remained challenging. This study aimed to assess the applicability of a multimodal approach by leveraging the EEG dynamics and acoustic characteristics of musical contents for the classification of emotional valence and arousal. To this end, this study adopted machine-learning methods to systematically elucidate the roles of the EEG and music modalities in the emotion modeling. The empirical results suggested that when whole-head EEG signals were available, the inclusion of musical contents did not improve the classification performance. The obtained performance of 74~76% using solely EEG modality was statistically comparable to that using the multimodality approach. However, if EEG dynamics were only available from a small set of electrodes (likely the case in real-life applications), the music modality would play a complementary role and augment the EEG results from around 61-67% in valence classification and from around 58-67% in arousal classification. The musical timber appeared to replace less-discriminative EEG features and led to improvements in both valence and arousal classification, whereas musical loudness was contributed specifically to the arousal classification. The present study not only provided principles for constructing an EEG-based multimodal approach, but also revealed the fundamental insights into the interplay of the brain activity and musical contents in emotion modeling.
基于脑电图(EEG)的音乐聆听情绪分类在当今得到了越来越多的关注,因为它具有潜在的应用前景,如音乐情感脑机接口(ABCI)、神经营销、音乐治疗和隐式多媒体标记和触发。然而,音乐是一种生态有效的复杂刺激物,通过音乐元素的组合向听众传达特定的情感。仅使用 EEG 信号来区分情绪仍然具有挑战性。本研究旨在评估通过利用 EEG 动力学和音乐内容的声学特征来进行情感效价和唤醒分类的多模态方法的适用性。为此,本研究采用机器学习方法系统地阐明 EEG 和音乐模态在情绪建模中的作用。实证结果表明,当可获得全头 EEG 信号时,包含音乐内容并不会提高分类性能。仅使用 EEG 模态获得的 74%~76%的性能在统计学上与使用多模态方法相当。然而,如果 EEG 动力学仅可从一小部分电极获得(可能在实际应用中就是这种情况),那么音乐模态将发挥补充作用,将效价分类的 EEG 结果从约 61-67%提高到约 67%,将唤醒分类的 EEG 结果从约 58-67%提高到约 67%。音乐音色似乎取代了较少可区分的 EEG 特征,从而提高了效价和唤醒分类,而音乐响度则专门有助于唤醒分类。本研究不仅为构建基于 EEG 的多模态方法提供了原则,还揭示了大脑活动和音乐内容在情绪建模中的相互作用的基本见解。