Aguiar Alexandra Christine de, Constantini Ana Carolina, Moraes Ronei Marcos de, Almeida Anna Alice
Programa de Pós-graduação em Modelos de Decisão e Saúde, Universidade Federal da Paraíba - UFPB - João Pessoa (PB), Brasil.
Departamento de Desenvolvimento Humano e Reabilitação, Universidade Estadual de Campinas - UNICAMP - Campinas (SP), Brasil.
Codas. 2025 Aug 4;37(4):e20240116. doi: 10.1590/2317-1782/e20240116pt. eCollection 2025.
To verify if there is a difference in acoustic-prosodic measures in different emotional states of speakers of Brazilian Portuguese (BP).
The data sample consisted of 182 audio signals produced by actors (professionals or students), from the semi-spontaneous speech task "Look at the blue plane" in the various emotions (joy, sadness, fear, anger, surprise, disgust) and neutral emission. Values were extracted from acoustic-prosodic measures of duration, fundamental frequency and intensity of the various emotions. The Friedman comparison test was used to verify whether these measures are able to discriminate emotions.
The prosodic-acoustic analysis revealed significant variations between emotions. The disgust emotion stood out for having the highest rate of utterance, with higher values of duration. In contrast, the joy exhibited a more accelerated speech, with lower values of duration and greater intensity. Sadness and fear were marked by lower intensity and lower frequencies, and fear presented the lowest positive asymmetry values of z-score and z-smoothed, with less elongation of the segments. Anger was highlighted by the higher vocal intensity, while surprise recorded the highest values of fundamental frequency.
The acoustic-prosodic measures proved to be effective tools for differentiating emotions in CP speakers. These parameters have great potential to discern different emotional states, broaden knowledge about vocal expressiveness and open possibilities for emotion recognition technologies with applications in artificial intelligence and mental health.
验证巴西葡萄牙语(BP)使用者在不同情绪状态下的声学韵律指标是否存在差异。
数据样本由演员(专业人士或学生)产生的182个音频信号组成,来自半自发言语任务“看那架蓝色飞机”,包含各种情绪(喜悦、悲伤、恐惧、愤怒、惊讶、厌恶)以及中性发声。从各种情绪的持续时间、基频和强度的声学韵律指标中提取数值。使用弗里德曼比较检验来验证这些指标是否能够区分情绪。
韵律声学分析显示不同情绪之间存在显著差异。厌恶情绪的突出特点是发声率最高,持续时间值较高。相比之下,喜悦情绪表现为语速更快,持续时间值较低且强度更大。悲伤和恐惧的特点是强度较低和频率较低,恐惧的z分数和z平滑的正不对称值最低,片段拉长较少。愤怒的特点是发声强度较高,而惊讶的基频值最高。
声学韵律指标被证明是区分巴西葡萄牙语使用者情绪的有效工具。这些参数具有辨别不同情绪状态的巨大潜力,拓宽了关于语音表现力的知识,并为在人工智能和心理健康领域应用的情绪识别技术开辟了可能性。