Suppr超能文献

使用随机森林的监督式说话人分割:一种心理治疗过程研究工具。

Supervised Speaker Diarization Using Random Forests: A Tool for Psychotherapy Process Research.

作者信息

Fürer Lukas, Schenk Nathalie, Roth Volker, Steppan Martin, Schmeck Klaus, Zimmermann Ronan

机构信息

Clinic for Children and Adolescents, University Psychiatric Clinic, Basel, Switzerland.

Department of Mathematics and Computer Science, University of Basel, Basel, Switzerland.

出版信息

Front Psychol. 2020 Jul 28;11:1726. doi: 10.3389/fpsyg.2020.01726. eCollection 2020.

Abstract

Speaker diarization is the practice of determining who speaks when in audio recordings. Psychotherapy research often relies on labor intensive manual diarization. Unsupervised methods are available but yield higher error rates. We present a method for supervised speaker diarization based on random forests. It can be considered a compromise between commonly used labor-intensive manual coding and fully automated procedures. The method is validated using the EMRAI synthetic speech corpus and is made publicly available. It yields low diarization error rates (M: 5.61%, STD: 2.19). Supervised speaker diarization is a promising method for psychotherapy research and similar fields.

摘要

说话人分割是指在音频记录中确定谁在何时说话的实践。心理治疗研究通常依赖于劳动强度大的人工分割。虽然有非监督方法可用,但错误率较高。我们提出了一种基于随机森林的监督说话人分割方法。它可以被视为常用的劳动密集型手动编码和全自动程序之间的一种折衷。该方法使用EMRAI合成语音语料库进行了验证,并已公开提供。它产生的分割错误率较低(平均值:5.61%,标准差:2.19)。监督说话人分割是心理治疗研究和类似领域一种很有前景的方法。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2c79/7399377/20a022206a5f/fpsyg-11-01726-g001.jpg

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验