School of Culture and Education, Shaanxi University of Science & Technology, 710021, Xi'an, Shaanxi, China.
BMC Psychol. 2024 Mar 4;12(1):121. doi: 10.1186/s40359-024-01585-0.
The intersection of psychology and English teaching is profound, as the application of psychological principles not only guides specific English instruction but also elevates the overall quality of teaching. This paper takes a multimodal approach, incorporating image, acoustics, and text information, to construct a joint analysis model for English teaching interaction and psychological characteristics. The novel addition of an attention mechanism in the multimodal fusion process enables the development of an English teaching psychological characteristics recognition model. The initial step involves balancing the proportions of each emotion, followed by achieving multimodal alignment. In the cross-modal stage, the interaction of image, acoustics, and text is facilitated through a cross-modal attention mechanism. The utilization of a multi-attention mechanism not only enhances the network's representation capabilities but also streamlines the complexity of the model. Empirical results demonstrate the model's proficiency in accurately identifying five psychological characteristics. The proposed method achieves a classification accuracy of 90.40% for psychological features, with a commendable accuracy of 78.47% in multimodal classification. Furthermore, the incorporation of the attention mechanism in feature fusion contributes to an improved fusion effect.
心理学与英语教学的交集是深远的,因为心理学原理的应用不仅指导具体的英语教学,而且提升教学的整体质量。本文采用多模态方法,融合图像、声学和文本信息,构建英语教学互动和心理特征的联合分析模型。在多模态融合过程中,新颖的注意力机制的加入使得英语教学心理特征识别模型得以发展。最初的步骤是平衡每种情感的比例,然后实现多模态对齐。在跨模态阶段,通过跨模态注意力机制促进图像、声学和文本的交互。多注意力机制的利用不仅增强了网络的表示能力,而且简化了模型的复杂性。实证结果表明,该模型在准确识别五种心理特征方面表现出色。所提出的方法在心理特征分类方面达到了 90.40%的准确率,在多模态分类方面达到了 78.47%的准确率。此外,在特征融合中加入注意力机制有助于改善融合效果。