• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

使用心理健康热线咨询师声音识别同理心的机器学习方法:算法开发与验证

Machine Learning Approach to Identifying Empathy Using the Vocals of Mental Health Helpline Counselors: Algorithm Development and Validation.

作者信息

Sanjeewa Ruvini, Iyer Ravi, Apputhurai Pragalathan, Wickramasinghe Nilmini, Meyer Denny

机构信息

School of Health Sciences, Swinburne University of Technology, Hawthorn, PO Box 218, John Street, Melbourne, 3122, Australia, 61 422587030.

School of Computing, Engineering & Mathematical Sciences, La Trobe University, Melbourne, Australia.

出版信息

JMIR Form Res. 2025 Apr 16;9:e67835. doi: 10.2196/67835.

DOI:10.2196/67835
PMID:40239193
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12017608/
Abstract

BACKGROUND

This research study aimed to detect the vocal features immersed in empathic counselor speech using samples of calls to a mental health helpline service.

OBJECTIVE

This study aimed to produce an algorithm for the identification of empathy from these features, which could act as a training guide for counselors and conversational agents who need to transmit empathy in their vocals.

METHODS

Two annotators with a psychology background and English heritage provided empathy ratings for 57 calls involving female counselors, as well as multiple short call segments within each of these calls. These ratings were found to be well-correlated between the 2 raters in a sample of 6 common calls. Using vocal feature extraction from call segments and statistical variable selection methods, such as L1 penalized LASSO (Least Absolute Shrinkage and Selection Operator) and forward selection, a total of 14 significant vocal features were associated with empathic speech. Generalized additive mixed models (GAMM), binary logistics regression with splines, and random forest models were used to obtain an algorithm that differentiated between high- and low-empathy call segments.

RESULTS

The binary logistics regression model reported higher predictive accuracies of empathy (area under the curve [AUC]=0.617, 95% CI 0.613-0.622) compared to the GAMM (AUC=0.605, 95% CI 0.601-0.609) and the random forest model (AUC=0.600, 95% CI 0.595-0.604). This difference was statistically significant, as evidenced by the nonoverlapping 95% CIs obtained for AUC. The DeLong test further validated these results, showing a significant difference in the binary logistic model compared to the random forest (D=6.443, df=186283, P<.001) and GAMM (Z=5.846, P<.001). These findings confirm that the binary logistic regression model outperforms the other 2 models concerning predictive accuracy for empathy classification.

CONCLUSIONS

This study suggests that the identification of empathy from vocal features alone is challenging, and further research involving multimodal models (eg, models incorporating facial expression, words used, and vocal features) are encouraged for detecting empathy in the future. This study has several limitations, including a relatively small sample of calls and only 2 empathy raters. Future research should focus on accommodating multiple raters with varied backgrounds to explore these effects on perceptions of empathy. Additionally, considering counselor vocals from larger, more heterogeneous populations, including mixed-gender samples, will allow an exploration of the factors influencing the level of empathy projected in counselor voices more generally.

摘要

背景

本研究旨在利用心理健康热线服务的通话样本,检测共情咨询师言语中所蕴含的语音特征。

目的

本研究旨在生成一种从这些特征中识别共情的算法,该算法可作为咨询师和对话代理在语音中传递共情时的训练指南,这些咨询师和对话代理需要在语音中传递共情。

方法

两名具有心理学背景和英语文化背景的注释者对57个涉及女性咨询师的通话以及每个通话中的多个短通话片段进行了共情评分。在6个常见通话样本中,发现这两名评分者的评分具有良好的相关性。通过从通话片段中提取语音特征以及采用统计变量选择方法,如L1惩罚最小绝对收缩和选择算子(LASSO)和向前选择,共确定了14个与共情言语相关的重要语音特征。使用广义相加混合模型(GAMM)、带样条的二元逻辑回归和随机森林模型来获得一种能够区分高共情和低共情通话片段的算法。

结果

与GAMM(曲线下面积[AUC]=0.605,95%可信区间0.601 - 0.609)和随机森林模型(AUC=0.600,95%可信区间0.595 - 0.604)相比,二元逻辑回归模型在共情预测准确性方面表现更高(AUC=0.617,95%可信区间0.613 - 0.622)。这种差异具有统计学意义,AUC的95%可信区间不重叠证明了这一点。DeLong检验进一步验证了这些结果,显示二元逻辑模型与随机森林模型(D = 6.443,自由度 = 186283,P <.001)和GAMM(Z = 5.846,P <.001)相比存在显著差异。这些发现证实,在共情分类的预测准确性方面,二元逻辑回归模型优于其他两个模型。

结论

本研究表明,仅从语音特征识别共情具有挑战性,未来鼓励开展涉及多模态模型(例如,结合面部表情、用词和语音特征的模型)的进一步研究以检测共情。本研究存在若干局限性,包括通话样本相对较小且仅有两名共情评分者。未来的研究应集中于纳入背景各异的多名评分者,以探索这些因素对共情感知的影响。此外,考虑来自更大、更具异质性群体(包括不同性别样本)的咨询师语音,将更全面地探索影响咨询师语音中所投射的共情水平的因素。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7fd5/12017608/e3d4a9ef8c70/formative-v9-e67835-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7fd5/12017608/d4bbb6bb816e/formative-v9-e67835-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7fd5/12017608/e3d4a9ef8c70/formative-v9-e67835-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7fd5/12017608/d4bbb6bb816e/formative-v9-e67835-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7fd5/12017608/e3d4a9ef8c70/formative-v9-e67835-g002.jpg

相似文献

1
Machine Learning Approach to Identifying Empathy Using the Vocals of Mental Health Helpline Counselors: Algorithm Development and Validation.使用心理健康热线咨询师声音识别同理心的机器学习方法:算法开发与验证
JMIR Form Res. 2025 Apr 16;9:e67835. doi: 10.2196/67835.
2
Using Vocal Characteristics To Classify Psychological Distress in Adult Helpline Callers: Retrospective Observational Study.利用声音特征对成人求助热线来电者的心理困扰进行分类:回顾性观察研究。
JMIR Form Res. 2022 Dec 19;6(12):e42249. doi: 10.2196/42249.
3
Empathic Conversational Agent Platform Designs and Their Evaluation in the Context of Mental Health: Systematic Review.共情式对话代理平台设计及其在心理健康领域的评估:系统评价。
JMIR Ment Health. 2024 Sep 9;11:e58974. doi: 10.2196/58974.
4
Acoustic Features for Identifying Suicide Risk in Crisis Hotline Callers: Machine Learning Approach.通过声学特征识别危机热线来电者自杀风险:机器学习方法
J Med Internet Res. 2025 Apr 14;27:e67772. doi: 10.2196/67772.
5
Perception of Empathy in Mental Health Care Through Voice-Based Conversational Agent Prototypes: Experimental Study.通过基于语音的对话代理原型感知心理健康护理中的同理心:实验研究
JMIR Form Res. 2025 May 7;9:e69329. doi: 10.2196/69329.
6
Can Predictive Modeling Tools Identify Patients at High Risk of Prolonged Opioid Use After ACL Reconstruction?预测模型工具能否识别 ACL 重建术后阿片类药物使用时间延长的高风险患者?
Clin Orthop Relat Res. 2020 Jul;478(7):0-1618. doi: 10.1097/CORR.0000000000001251.
7
[Construction of recognition models for subthreshold depression based on multiple machine learning algorithms and vocal emotional characteristics].基于多种机器学习算法和嗓音情感特征的阈下抑郁症识别模型构建
Nan Fang Yi Ke Da Xue Xue Bao. 2025 Apr 20;45(4):711-717. doi: 10.12122/j.issn.1673-4254.2025.04.05.
8
Development and Validation of a Machine Learning Method Using Vocal Biomarkers for Identifying Frailty in Community-Dwelling Older Adults: Cross-Sectional Study.使用声音生物标志物识别社区老年人虚弱状态的机器学习方法的开发与验证:横断面研究
JMIR Med Inform. 2025 Jan 16;13:e57298. doi: 10.2196/57298.
9
[Constructing a predictive model for the death risk of patients with septic shock based on supervised machine learning algorithms].基于监督机器学习算法构建脓毒症休克患者死亡风险预测模型
Zhonghua Wei Zhong Bing Ji Jiu Yi Xue. 2024 Apr;36(4):345-352. doi: 10.3760/cma.j.cn121430-20230930-00832.
10
Automated Behavioral Coding to Enhance the Effectiveness of Motivational Interviewing in a Chat-Based Suicide Prevention Helpline: Secondary Analysis of a Clinical Trial.自动行为编码以提高基于聊天的自杀预防热线中动机性访谈的有效性:一项临床试验的二次分析
J Med Internet Res. 2024 Aug 1;26:e53562. doi: 10.2196/53562.

本文引用的文献

1
Empathic Conversational Agent Platform Designs and Their Evaluation in the Context of Mental Health: Systematic Review.共情式对话代理平台设计及其在心理健康领域的评估:系统评价。
JMIR Ment Health. 2024 Sep 9;11:e58974. doi: 10.2196/58974.
2
Extending the DeLong algorithm for comparing areas under correlated receiver operating characteristic curves with missing data.扩展 DeLong 算法以比较具有缺失数据的相关接受者操作特征曲线下的面积。
Stat Med. 2024 Sep 20;43(21):4148-4162. doi: 10.1002/sim.10172. Epub 2024 Jul 16.
3
The telephone nursing dialogue process: an integrative review.
电话护理对话过程:一项综合综述。
BMC Nurs. 2023 Sep 28;22(1):345. doi: 10.1186/s12912-023-01509-0.
4
Class imbalance should not throw you off balance: Choosing the right classifiers and performance metrics for brain decoding with imbalanced data.不要被类别不平衡问题困扰:选择合适的分类器和性能指标,对不平衡数据进行脑解码。
Neuroimage. 2023 Aug 15;277:120253. doi: 10.1016/j.neuroimage.2023.120253. Epub 2023 Jun 28.
5
Using Vocal Characteristics To Classify Psychological Distress in Adult Helpline Callers: Retrospective Observational Study.利用声音特征对成人求助热线来电者的心理困扰进行分类:回顾性观察研究。
JMIR Form Res. 2022 Dec 19;6(12):e42249. doi: 10.2196/42249.
6
COVID-19 related innovation in Aotearoa/New Zealand mental health helplines and telehealth providers - mapping solutions and discussing sustainability from the perspective of service providers.新西兰心理健康求助热线和远程医疗服务提供商在新冠疫情期间的创新举措——从服务提供商的角度梳理解决方案并探讨可持续性。
Front Psychiatry. 2022 Aug 30;13:973261. doi: 10.3389/fpsyt.2022.973261. eCollection 2022.
7
Using Voice Biomarkers to Classify Suicide Risk in Adult Telehealth Callers: Retrospective Observational Study.利用语音生物标志物对成人远程医疗呼叫者的自杀风险进行分类:回顾性观察研究。
JMIR Ment Health. 2022 Aug 15;9(8):e39807. doi: 10.2196/39807.
8
Defining clinical empathy: a grounded theory approach from the perspective of healthcare workers and patients in a multicultural setting.从多元文化背景下医护人员和患者的角度出发,用扎根理论定义临床同理心。
BMJ Open. 2021 Sep 14;11(9):e045224. doi: 10.1136/bmjopen-2020-045224.
9
The Impact of COVID-19 on Telemedicine Utilization Across Multiple Service Lines in the United States.2019年冠状病毒病对美国多个服务领域远程医疗利用情况的影响
Healthcare (Basel). 2020 Oct 1;8(4):380. doi: 10.3390/healthcare8040380.
10
From economic crisis to the COVID-19 pandemic crisis: evidence from a mental health helpline in Greece.从经济危机到新冠疫情危机:来自希腊一条心理健康求助热线的证据。
Eur Arch Psychiatry Clin Neurosci. 2021 Mar;271(2):407-409. doi: 10.1007/s00406-020-01165-4. Epub 2020 Jul 14.