Suppr超能文献

使用源自声音的声压变化加速度评估情绪唤醒水平和抑郁严重程度。

Evaluation of emotional arousal level and depression severity using voice-derived sound pressure change acceleration.

机构信息

Department of Bioengineering, Graduate School of Engineering, The University of Tokyo, 7-3-1 Hongo, Bunkyo-ku, Tokyo, 113-8656, Japan.

Department of Psychiatry, National Defense Medical College, 3-2 Namiki, Tokorozawa, Saitama, 359-8513, Japan.

出版信息

Sci Rep. 2021 Jun 30;11(1):13615. doi: 10.1038/s41598-021-92982-7.

Abstract

In this research, we propose a new index of emotional arousal level using sound pressure change acceleration, called the emotional arousal level voice index (EALVI), and investigate the relationship between this index and depression severity. First, EALVI values were calculated from various speech recordings in the interactive emotional dyadic motion capture database, and the correlation with the emotional arousal level of each voice was examined. The resulting correlation coefficient was 0.52 (n = 10,039, p < 2.2 × 10). We collected a total of 178 datasets comprising 10 speech phrases and the Hamilton Rating Scale for Depression (HAM-D) score of outpatients with major depression at the Ginza Taimei Clinic (GTC) and the National Defense Medical College (NDMC) Hospital. The correlation coefficients between the EALVI and HAM-D scores were - 0.33 (n = 88, p = 1.8 × 10) and - 0.43 (n = 90, p = 2.2 × 10) at the GTC and NDMC, respectively. Next, the dataset was divided into "no depression" (HAM-D < 8) and "depression" groups (HAM-D ≥ 8) according to the HAM-D score. The number of patients in the "no depression" and "depression" groups were 10 and 78 in the GTC data, and 65 and 25 in the NDMC data, respectively. There was a significant difference in the mean EALVI values between the two groups in both the GTC and NDMC data (p = 8.9 × 10, Cliff's delta = 0.51 and p = 1.6 × 10; Cliff's delta = 0.43, respectively). The area under the curve of the receiver operating characteristic curve when discriminating both groups by EALVI was 0.76 in GTC data and 0.72 in NDMC data. Indirectly, the data suggest that there is some relationship between emotional arousal level and depression severity.

摘要

在这项研究中,我们提出了一种新的基于声压变化加速度的情感唤醒水平指数,称为情感唤醒水平语音指数(EALVI),并研究了该指数与抑郁严重程度之间的关系。首先,从交互情感对偶运动捕捉数据库中的各种语音记录中计算出 EALVI 值,并检查了与每个语音的情感唤醒水平的相关性。得到的相关系数为 0.52(n=10039,p<2.2×10)。我们总共收集了 178 个数据集,其中包括银座大明诊所(GTC)和国防医科大学医院(NDMC)的 10 个语音短语和门诊重度抑郁症患者的汉密尔顿抑郁量表(HAM-D)评分。EALVI 与 HAM-D 评分之间的相关系数分别为 -0.33(n=88,p=1.8×10)和 -0.43(n=90,p=2.2×10)在 GTC 和 NDMC。接下来,根据 HAM-D 评分,将数据集分为“无抑郁”(HAM-D<8)和“抑郁”(HAM-D≥8)组。在 GTC 数据中,“无抑郁”和“抑郁”组的患者人数分别为 10 人和 78 人,NDMC 数据分别为 65 人和 25 人。在 GTC 和 NDMC 数据中,两组之间的平均 EALVI 值存在显著差异(p=8.9×10,Cliff's delta=0.51 和 p=1.6×10;Cliff's delta=0.43,分别)。EALVI 区分两组时,ROC 曲线下面积在 GTC 数据中为 0.76,在 NDMC 数据中为 0.72。间接地,这些数据表明情感唤醒水平和抑郁严重程度之间存在一定的关系。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c86b/8245525/aa3ac38a1e44/41598_2021_92982_Fig1_HTML.jpg

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验