Suppr超能文献

利用韵律和词汇信息学习心理治疗中的话语层面行为。

Using Prosodic and Lexical Information for Learning Utterance-level Behaviors in Psychotherapy.

作者信息

Singla Karan, Chen Zhuohao, Flemotomos Nikolaos, Gibson James, Can Dogan, Atkins David C, Narayanan Shrikanth

机构信息

Signal Analysis and Interpretation Lab, University of Southern California, Los Angeles, CA, USA.

Department of Psychiatry and Behavioral Sciences, University of Washington, Seattle, WA, USA.

出版信息

Interspeech. 2018 Sep;2018:3413-3417. doi: 10.21437/interspeech.2018-2551.

Abstract

In this paper, we present an approach for predicting utterance level behaviors in psychotherapy sessions using both speech and lexical features. We train long short term memory (LSTM) networks with an attention mechanism using words, both manually and automatically transcribed, and prosodic features, at the word level, to predict the annotated behaviors. We demonstrate that prosodic features provide discriminative information relevant to the behavior task and show that they improve prediction when fused with automatically derived lexical features. Additionally, we investigate the weights of the attention mechanism to determine words and prosodic patterns which are of importance to the behavior prediction task.

摘要

在本文中,我们提出了一种利用语音和词汇特征预测心理治疗会话中话语级行为的方法。我们使用带有注意力机制的长短期记忆(LSTM)网络,通过人工转录和自动转录的单词以及单词级的韵律特征来训练,以预测带注释的行为。我们证明韵律特征提供了与行为任务相关的判别信息,并表明当与自动提取的词汇特征融合时,它们能提高预测效果。此外,我们研究了注意力机制的权重,以确定对行为预测任务重要的单词和韵律模式。

相似文献

1
Using Prosodic and Lexical Information for Learning Utterance-level Behaviors in Psychotherapy.
Interspeech. 2018 Sep;2018:3413-3417. doi: 10.21437/interspeech.2018-2551.
2
Automatic Prosodic Event Detection Using Acoustic, Lexical, and Syntactic Evidence.
IEEE Trans Audio Speech Lang Process. 2008 Jan;16(1):216-228. doi: 10.1109/TASL.2007.907570.
4
Can prosody aid the automatic classification of dialog acts in conversational speech?
Lang Speech. 1998 Jul-Dec;41 ( Pt 3-4):443-92. doi: 10.1177/002383099804100410.
7
Feature Fusion Strategies for End-to-End Evaluation of Cognitive Behavior Therapy Sessions.
Annu Int Conf IEEE Eng Med Biol Soc. 2021 Nov;2021:1836-1839. doi: 10.1109/EMBC46164.2021.9629694.
9
Prosodic and lexical aspects of maternal linguistic input to late-talking toddlers.
Int J Lang Commun Disord. 2006 May-Jun;41(3):293-311. doi: 10.1080/13682820500342976.

引用本文的文献

1
Association of machine-learning-rated supportive counseling skills with psychotherapy outcome.
J Consult Clin Psychol. 2025 Feb;93(2):110-119. doi: 10.1037/ccp0000935.
3
Natural language processing for mental health interventions: a systematic review and research framework.
Transl Psychiatry. 2023 Oct 6;13(1):309. doi: 10.1038/s41398-023-02592-2.
4
Towards End-2-end Learning for Predicting Behavior Codes from Spoken Utterances in Psychotherapy Conversations.
Proc Conf Assoc Comput Linguist Meet. 2020 Jul;2020:3797-3803. doi: 10.18653/v1/2020.acl-main.351.
5
Multi-label Multi-task Deep Learning for Behavioral Coding.
IEEE Trans Affect Comput. 2022 Jan-Mar;13(1):508-518. doi: 10.1109/taffc.2019.2952113. Epub 2019 Nov 8.
6
IMPROVING THE PREDICTION OF THERAPIST BEHAVIORS IN ADDICTION COUNSELING BY EXPLOITING CLASS CONFUSIONS.
Proc IEEE Int Conf Acoust Speech Signal Process. 2019 May;2019:6605-6609. doi: 10.1109/icassp.2019.8682885. Epub 2019 Apr 17.
7
An Automated Quality Evaluation Framework of Psychotherapy Conversations with Local Quality Estimates.
Comput Speech Lang. 2022 Sep;75. doi: 10.1016/j.csl.2022.101380. Epub 2022 Mar 28.
8
Automated quality assessment of cognitive behavioral therapy sessions through highly contextualized language representations.
PLoS One. 2021 Oct 22;16(10):e0258639. doi: 10.1371/journal.pone.0258639. eCollection 2021.
9
Automated evaluation of psychotherapy skills using speech and language technologies.
Behav Res Methods. 2022 Apr;54(2):690-711. doi: 10.3758/s13428-021-01623-4. Epub 2021 Aug 3.
10
Multimodal Automatic Coding of Client Behavior in Motivational Interviewing.
Proc ACM Int Conf Multimodal Interact. 2020 Oct;2020:406-413. doi: 10.1145/3382507.3418853.

本文引用的文献

1
A technology prototype system for rating therapist empathy from audio recordings in addiction counseling.
PeerJ Comput Sci. 2016 Apr;2. doi: 10.7717/peerj-cs.59. Epub 2016 Apr 20.
2
A Comparison of Natural Language Processing Methods for Automated Coding of Motivational Interviewing.
J Subst Abuse Treat. 2016 Jun;65:43-50. doi: 10.1016/j.jsat.2016.01.006. Epub 2016 Jan 28.
3
pyAudioAnalysis: An Open-Source Python Library for Audio Signal Analysis.
PLoS One. 2015 Dec 11;10(12):e0144610. doi: 10.1371/journal.pone.0144610. eCollection 2015.
4
"Rate My Therapist": Automated Detection of Empathy in Drug and Alcohol Counseling via Speech and Language Processing.
PLoS One. 2015 Dec 2;10(12):e0143055. doi: 10.1371/journal.pone.0143055. eCollection 2015.
7
8
Long short-term memory.
Neural Comput. 1997 Nov 15;9(8):1735-80. doi: 10.1162/neco.1997.9.8.1735.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验