Quan Jingyu, Miyake Yoshihiro, Nozawa Takayuki
Department of Computer Science, Institute of Science Tokyo, Yokohama 226-8502, Japan.
Faculty of Engineering, University of Toyama, Toyama 930-8555, Japan.
Sensors (Basel). 2025 Jan 13;25(2):434. doi: 10.3390/s25020434.
This study investigates how interpersonal (speaker-partner) synchrony contributes to empathetic response generation in communication scenarios. To perform this investigation, we propose a model that incorporates multimodal directional (positive and negative) interpersonal synchrony, operationalized using the cosine similarity measure, into empathetic response generation. We evaluate how incorporating specific synchrony affects the generated responses at the language and empathy levels. Based on comparison experiments, models with multimodal synchrony generate responses that are closer to ground truth responses and more diverse than models without synchrony. This demonstrates that these features are successfully integrated into the models. Additionally, we find that positive synchrony is linked to enhanced emotional reactions, reduced exploration, and improved interpretation. Negative synchrony is associated with reduced exploration and increased interpretation. These findings shed light on the connections between multimodal directional interpersonal synchrony and empathy's emotional and cognitive aspects in artificial intelligence applications.
本研究探讨人际(说话者-伙伴)同步性如何在交流场景中促进共情反应的产生。为了进行这项研究,我们提出了一个模型,该模型将使用余弦相似度度量进行操作化的多模态方向性(正向和负向)人际同步性纳入共情反应生成过程。我们评估纳入特定同步性如何在语言和共情层面影响生成的反应。基于对比实验,具有多模态同步性的模型生成的反应比没有同步性的模型更接近真实反应且更加多样。这表明这些特征已成功整合到模型中。此外,我们发现正向同步性与增强的情绪反应、减少的探索和更好的解释相关联。负向同步性与减少的探索和增加的解释相关联。这些发现揭示了人工智能应用中多模态方向性人际同步性与共情的情感和认知方面之间的联系。