Zhu Xianxun, Liu Zhaozhao, Cambria Erik, Yu Xiaohan, Fan Xuhui, Chen Hui, Wang Rui
School of Communication and Information Engineering, Shanghai University, 200444, Shanghai, China.
College of Computing and Data Science, Nanyang Technological University, 639798, Singapore.
Comput Methods Programs Biomed. 2025 Mar;260:108564. doi: 10.1016/j.cmpb.2024.108564. Epub 2024 Dec 24.
In the current global health landscape, there is an increasing demand for rapid and accurate assessment of mental states. Traditional assessment methods typically rely on face-to-face interactions, which are not only time-consuming but also highly subjective. Addressing this issue, this study aims to develop a client-server-based, non-contact multimodal emotion and behavior recognition system to enhance the efficiency and accuracy of mental state assessments.
This study designed and implemented a multimodal assessment system integrating voice, text, facial expressions, and body movements. Utilizing a client-server architecture, the system optimizes diagnostic efficiency and decision-making accuracy through an intuitive visual interface. The system's effectiveness was validated and tested in actual hospital settings.
The system demonstrated exceptional performance in multimodal emotion and behavior recognition, achieving a voice recognition accuracy of 92.01%, facial expression recognition accuracy of 91.3%, and an overall multimodal assessment accuracy of 77.9%. Moreover, it reached a behavior analysis accuracy of 94.5%.
The multimodal assessment system developed in this study significantly enhances the accuracy and efficiency of mental state assessments, meeting the needs of clinicians for precise and rapid diagnostics in real-world settings.
在当前全球卫生格局下,对精神状态进行快速准确评估的需求日益增长。传统评估方法通常依赖面对面交流,这不仅耗时,而且主观性很强。为解决这一问题,本研究旨在开发一种基于客户端-服务器的非接触式多模态情感与行为识别系统,以提高精神状态评估的效率和准确性。
本研究设计并实现了一个集成语音、文本、面部表情和身体动作的多模态评估系统。该系统采用客户端-服务器架构,通过直观的视觉界面优化诊断效率和决策准确性。该系统的有效性在实际医院环境中得到了验证和测试。
该系统在多模态情感与行为识别方面表现出色,语音识别准确率达到92.01%,面部表情识别准确率达到91.3%,整体多模态评估准确率达到77.9%。此外,其行为分析准确率达到94.5%。
本研究开发的多模态评估系统显著提高了精神状态评估的准确性和效率,满足了临床医生在实际环境中进行精确快速诊断的需求。