Esposito Anna, Esposito Antonietta M
Department of Psychology, Second University of Naples, Caserta, Italy.
Cogn Process. 2012 Oct;13 Suppl 2:541-50. doi: 10.1007/s10339-012-0516-2. Epub 2012 Aug 8.
Human beings seem to be able to recognize emotions from speech very well and information communication technology aims to implement machines and agents that can do the same. However, to be able to automatically recognize affective states from speech signals, it is necessary to solve two main technological problems. The former concerns the identification of effective and efficient processing algorithms capable of capturing emotional acoustic features from speech sentences. The latter focuses on finding computational models able to classify, with an approximation as good as human listeners, a given set of emotional states. This paper will survey these topics and provide some insights for a holistic approach to the automatic analysis, recognition and synthesis of affective states.
人类似乎能够很好地从语音中识别情感,而信息通信技术旨在实现能够做到同样事情的机器和智能体。然而,要能够从语音信号中自动识别情感状态,有两个主要技术问题需要解决。前者涉及识别能够从语音句子中捕捉情感声学特征的有效且高效的处理算法。后者则专注于找到能够以与人类听众相当的近似度对给定的一组情感状态进行分类的计算模型。本文将对这些主题进行综述,并为情感状态的自动分析、识别和合成提供一种整体方法的一些见解。