Department of Electronic and Computer Engineering, National Taiwan University of Science and Technology, Taipei 106, Taiwan.
Department of Humanities and Social Sciences, National Taiwan University of Science and Technology, Taipei 106, Taiwan.
Sensors (Basel). 2022 Jan 20;22(3):777. doi: 10.3390/s22030777.
Music can generate a positive effect in runners' performance and motivation. However, the practical implementation of music intervention during exercise is mostly absent from the literature. Therefore, this paper designs a playback sequence system for joggers by considering music emotion and physiological signals. This playback sequence is implemented by a music selection module that combines artificial intelligence techniques with physiological data and emotional music. In order to make the system operate for a long time, this paper improves the model and selection music module to achieve lower energy consumption. The proposed model obtains fewer FLOPs and parameters by using logarithm scaled Mel-spectrogram as input features. The accuracy, computational complexity, trainable parameters, and inference time are evaluated on the Bi-modal, 4Q emotion, and Soundtrack datasets. The experimental results show that the proposed model is better than that of Sarkar et al. and achieves competitive performance on Bi-modal (84.91%), 4Q emotion (92.04%), and Soundtrack (87.24%) datasets. More specifically, the proposed model reduces the computational complexity and inference time while maintaining the classification accuracy, compared to other models. Moreover, the size of the proposed model for network training is small, which can be applied to mobiles and other devices with limited computing resources. This study designed the overall playback sequence system by considering the relationship between music emotion and physiological situation during exercise. The playback sequence system can be adopted directly during exercise to improve users' exercise efficiency.
音乐可以对跑步者的表现和动力产生积极影响。然而,运动中音乐干预的实际实施在文献中大多是缺失的。因此,本文通过考虑音乐情感和生理信号,为慢跑者设计了一个播放序列系统。这个播放序列是通过一个音乐选择模块来实现的,该模块结合了人工智能技术、生理数据和情感音乐。为了使系统能够长时间运行,本文对模型和选择音乐模块进行了改进,以实现更低的能耗。所提出的模型通过使用对数标度梅尔频谱图作为输入特征,获得更少的 FLOPs 和参数。在 Bi-modal、4Q 情感和 Soundtrack 数据集上评估了准确性、计算复杂度、可训练参数和推理时间。实验结果表明,与 Sarkar 等人的模型相比,所提出的模型在 Bi-modal(84.91%)、4Q 情感(92.04%)和 Soundtrack(87.24%)数据集上具有更好的性能。更具体地说,与其他模型相比,所提出的模型在保持分类准确性的同时,降低了计算复杂度和推理时间。此外,所提出的模型用于网络训练的大小较小,可应用于计算资源有限的移动设备和其他设备。本研究通过考虑运动中音乐情感和生理状态之间的关系,设计了整体播放序列系统。该播放序列系统可以在运动中直接采用,以提高用户的运动效率。