使用随机森林的监督式说话人分割：一种心理治疗过程研究工具。

Supervised Speaker Diarization Using Random Forests: A Tool for Psychotherapy Process Research.

作者信息

Fürer Lukas, Schenk Nathalie, Roth Volker, Steppan Martin, Schmeck Klaus, Zimmermann Ronan

机构信息

Clinic for Children and Adolescents, University Psychiatric Clinic, Basel, Switzerland.

Department of Mathematics and Computer Science, University of Basel, Basel, Switzerland.

出版信息

Front Psychol. 2020 Jul 28;11:1726. doi: 10.3389/fpsyg.2020.01726. eCollection 2020.

DOI:10.3389/fpsyg.2020.01726

PMID:32849033

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7399377/

Abstract

Speaker diarization is the practice of determining who speaks when in audio recordings. Psychotherapy research often relies on labor intensive manual diarization. Unsupervised methods are available but yield higher error rates. We present a method for supervised speaker diarization based on random forests. It can be considered a compromise between commonly used labor-intensive manual coding and fully automated procedures. The method is validated using the EMRAI synthetic speech corpus and is made publicly available. It yields low diarization error rates (M: 5.61%, STD: 2.19). Supervised speaker diarization is a promising method for psychotherapy research and similar fields.

摘要

说话人分割是指在音频记录中确定谁在何时说话的实践。心理治疗研究通常依赖于劳动强度大的人工分割。虽然有非监督方法可用，但错误率较高。我们提出了一种基于随机森林的监督说话人分割方法。它可以被视为常用的劳动密集型手动编码和全自动程序之间的一种折衷。该方法使用EMRAI合成语音语料库进行了验证，并已公开提供。它产生的分割错误率较低（平均值：5.61%，标准差：2.19）。监督说话人分割是心理治疗研究和类似领域一种很有前景的方法。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2c79/7399377/20a022206a5f/fpsyg-11-01726-g001.jpg

相似文献

Supervised Speaker Diarization Using Random Forests: A Tool for Psychotherapy Process Research.使用随机森林的监督式说话人分割：一种心理治疗过程研究工具。

Front Psychol. 2020 Jul 28;11:1726. doi: 10.3389/fpsyg.2020.01726. eCollection 2020.

Multimodal Speaker Diarization Using a Pre-Trained Audio-Visual Synchronization Model.基于预训练的视听同步模型的多模态说话人分割。

Sensors (Basel). 2019 Nov 25;19(23):5163. doi: 10.3390/s19235163.

Development of Supervised Speaker Diarization System Based on the PyAnnote Audio Processing Library.基于 PyAnnote 音频处理库的监督式说话人标注系统的开发。

Sensors (Basel). 2023 Feb 13;23(4):2082. doi: 10.3390/s23042082.

Audio-Visual Speaker Diarization Based on Spatiotemporal Bayesian Fusion.基于时空贝叶斯融合的视听说话人定界

IEEE Trans Pattern Anal Mach Intell. 2018 May;40(5):1086-1099. doi: 10.1109/TPAMI.2017.2648793. Epub 2017 Jan 5.

Real-time multilingual speech recognition and speaker diarization system based on Whisper segmentation.基于Whisper分割的实时多语言语音识别与说话人识别系统。

PeerJ Comput Sci. 2024 Mar 29;10:e1973. doi: 10.7717/peerj-cs.1973. eCollection 2024.

Speaker-turn aware diarization for speech-based cognitive assessments.用于基于语音的认知评估的说话轮次感知语音分离

Front Neurosci. 2024 Jan 16;17:1351848. doi: 10.3389/fnins.2023.1351848. eCollection 2023.

End-to-end neural speaker diarization with an iterative adaptive attractor estimation.基于迭代自适应吸引子估计的端到端神经说话人聚类

Neural Netw. 2023 Sep;166:566-578. doi: 10.1016/j.neunet.2023.07.043. Epub 2023 Aug 1.

The Impact of Speaker Diarization on DNN-based Autism Severity Estimation.说话人分段对基于 DNN 的自闭症严重程度估计的影响。

Annu Int Conf IEEE Eng Med Biol Soc. 2022 Jul;2022:3414-3417. doi: 10.1109/EMBC48229.2022.9871523.

Evaluation of Deep Clustering for Diarization of Aphasic Speech.用于失语症语音分离的深度聚类评估

Stud Health Technol Inform. 2019;260:81-88.

Multimodal Speaker Diarization.多模态说话人分割。

IEEE Trans Pattern Anal Mach Intell. 2012 Jan;34(1):79-93. doi: 10.1109/TPAMI.2011.47. Epub 2011 Mar 10.

引用本文的文献

Development of Supervised Speaker Diarization System Based on the PyAnnote Audio Processing Library.基于 PyAnnote 音频处理库的监督式说话人标注系统的开发。

Sensors (Basel). 2023 Feb 13;23(4):2082. doi: 10.3390/s23042082.

Withdrawal ruptures in adolescents with borderline personality disorder psychotherapy are marked by increased speech pauses-can minimal responses be automatically detected?边缘型人格障碍青少年的断乳破裂，其心理治疗表现为言语停顿增加——能否自动检测到最小反应？

PLoS One. 2023 Jan 17;18(1):e0280329. doi: 10.1371/journal.pone.0280329. eCollection 2023.

The Influence of Cognitive Biases and Financial Factors on Forecast Accuracy of Analysts.认知偏差和财务因素对分析师预测准确性的影响

Front Psychol. 2022 Jan 4;12:773894. doi: 10.3389/fpsyg.2021.773894. eCollection 2021.

While the Chatbot's Away, the Mice Will Play.主人不在家，耗子成精啦。

Front Digit Health. 2021 Feb 2;3:617013. doi: 10.3389/fdgth.2021.617013. eCollection 2021.

Alliance Ruptures and Resolutions in Personality Disorders.人格障碍中的联盟破裂与解决

Curr Psychiatry Rep. 2020 Dec 11;23(1):1. doi: 10.1007/s11920-020-01212-w.

本文引用的文献

Motion energy analysis (MEA): A primer on the assessment of motion from video.运动能量分析（MEA）：视频中运动评估的入门指南。

J Couns Psychol. 2020 Jul;67(4):536-549. doi: 10.1037/cou0000407.

Coregulation of therapist and client emotion during psychotherapy.心理治疗过程中治疗师和来访者情绪的共同调节。

Psychother Res. 2020 Jun;30(5):591-603. doi: 10.1080/10503307.2019.1661541. Epub 2019 Sep 4.

Silence in the psychotherapy of adolescents with borderline personality pathology.青少年边缘型人格障碍心理治疗中的沉默。

Personal Disord. 2021 Mar;12(2):160-170. doi: 10.1037/per0000402. Epub 2020 Apr 23.

Machine Learning in Psychometrics and Psychological Research.心理测量学与心理学研究中的机器学习

Front Psychol. 2020 Jan 10;10:2970. doi: 10.3389/fpsyg.2019.02970. eCollection 2019.

Interpersonal synchrony feels good but impedes self-regulation of affect.人际同步感觉良好，但会阻碍情感的自我调节。

Sci Rep. 2019 Oct 11;9(1):14691. doi: 10.1038/s41598-019-50960-0.

Predicting personalized process-outcome associations in psychotherapy using machine learning approaches-A demonstration.使用机器学习方法预测心理治疗中的个性化过程-结果关联——演示。

Psychother Res. 2020 Mar;30(3):300-309. doi: 10.1080/10503307.2019.1597994. Epub 2019 Mar 26.

A design for process-outcome psychotherapy research in adolescents with Borderline Personality Pathology.针对患有边缘型人格障碍的青少年进行过程-结果心理治疗研究的一种设计。

Contemp Clin Trials Commun. 2018 Oct 31;12:182-191. doi: 10.1016/j.conctc.2018.10.007. eCollection 2018 Dec.

Predicting Adherence to Internet-Delivered Psychotherapy for Symptoms of Depression and Anxiety After Myocardial Infarction: Machine Learning Insights From the U-CARE Heart Randomized Controlled Trial.预测心肌梗死后抑郁症和焦虑症症状的互联网心理治疗依从性：来自U-CARE心脏随机对照试验的机器学习见解

J Med Internet Res. 2018 Oct 10;20(10):e10754. doi: 10.2196/10754.

Major developments in methods addressing for whom psychotherapy may work and why.方法上的主要发展，旨在解决哪些人可能从心理治疗中获益，以及为什么。

Psychother Res. 2019 Aug;29(6):693-708. doi: 10.1080/10503307.2018.1429691. Epub 2018 Feb 7.

State of the Art of Interpersonal Physiology in Psychotherapy: A Systematic Review.心理治疗中人际生理学的现状：一项系统综述。

Front Psychol. 2017 Nov 24;8:2053. doi: 10.3389/fpsyg.2017.02053. eCollection 2017.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

使用随机森林的监督式说话人分割：一种心理治疗过程研究工具。

Supervised Speaker Diarization Using Random Forests: A Tool for Psychotherapy Process Research.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献