Institute of Psychology, Chinese Academy of Sciences, Beijing, China.
University of Chinese Academy of Sciences, Beijing, China.
PLoS One. 2019 Jun 20;14(6):e0218172. doi: 10.1371/journal.pone.0218172. eCollection 2019.
A large proportion of Depression Disorder patients do not receive an effective diagnosis, which makes it necessary to find a more objective assessment to facilitate a more rapid and accurate diagnosis of depression. Speech data is easy to acquire clinically, its association with depression has been studied, although the actual predictive effect of voice features has not been examined. Thus, we do not have a general understanding of the extent to which voice features contribute to the identification of depression. In this study, we investigated the significance of the association between voice features and depression using binary logistic regression, and the actual classification effect of voice features on depression was re-examined through classification modeling. Nearly 1000 Chinese females participated in this study. Several different datasets was included as test set. We found that 4 voice features (PC1, PC6, PC17, PC24, P<0.05, corrected) made significant contribution to depression, and that the contribution effect of the voice features alone reached 35.65% (Nagelkerke's R2). In classification modeling, voice data based model has consistently higher predicting accuracy(F-measure) than the baseline model of demographic data when tested on different datasets, even across different emotion context. F-measure of voice features alone reached 81%, consistent with existing data. These results demonstrate that voice features are effective in predicting depression and indicate that more sophisticated models based on voice features can be built to help in clinical diagnosis.
大量的抑郁症患者未得到有效诊断,这使得我们有必要寻找更客观的评估方法,以实现更快速、准确的抑郁诊断。语音数据在临床上易于获取,其与抑郁的相关性已经过研究,尽管语音特征的实际预测效果尚未得到检验。因此,我们并不完全了解语音特征在多大程度上有助于识别抑郁症。在这项研究中,我们使用二元逻辑回归研究了语音特征与抑郁症之间的关联的重要性,并通过分类建模重新检验了语音特征对抑郁症的实际分类效果。近 1000 名中国女性参与了这项研究。该研究包含了多个不同的数据集作为测试集。我们发现 4 个语音特征(PC1、PC6、PC17、PC24,P<0.05,校正后)对抑郁症有显著贡献,并且语音特征的单独贡献效果达到 35.65%(Nagelkerke 的 R2)。在分类建模中,基于语音数据的模型在不同数据集上进行测试时,其预测准确率(F 值)始终高于基于人口统计学数据的基线模型,即使在不同的情绪背景下也是如此。仅语音特征的 F 值就达到了 81%,与现有数据一致。这些结果表明,语音特征在预测抑郁症方面是有效的,并表明可以构建基于语音特征的更复杂模型,以帮助临床诊断。