文献检索文档翻译深度研究
Suppr Zotero 插件Zotero 插件
邀请有礼套餐&价格历史记录

新学期,新优惠

限时优惠:9月1日-9月22日

30天高级会员仅需29元

1天体验卡首发特惠仅需5.99元

了解详情
不再提醒
插件&应用
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
高级版
套餐订阅购买积分包
AI 工具
文献检索文档翻译深度研究
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2025

数字听写员的自动语音识别性能:针对患者-临床医生对话进行调整的通用和专用模型之间的性能比较。

Automatic speech recognition performance for digital scribes: a performance comparison between general-purpose and specialized models tuned for patient-clinician conversations.

机构信息

University of California Irvine, Irvine, CA, USA.

University of California San Diego, La Jolla, USA.

出版信息

AMIA Annu Symp Proc. 2023 Apr 29;2022:1072-1080. eCollection 2022.


DOI:
PMID:37128439
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10148344/
Abstract

One promising solution to address physician data entry needs is through the development of so-called "digital scribes," or tools which aim to automate clinical documentation via automatic speech recognition (ASR) of patient-clinician conversations. Evaluation of specialized ASR models in this domain, useful for understanding feasibility and development opportunities, has been difficult because most models have been under development. Following the commercial release of such models, we report an independent evaluation of four models, two general-purpose, and two for medical conversation with a corpus of 36 primary care conversations. We identify word error rates (WER) of 8.8%-10.5% and word-level diarization error rates (WDER) ranging from 1.8%-13.9%, which are generally lower than previous reports. The findings indicate that, while there is room for improvement, the performance of these specialized models, at least under ideal recording conditions, may be amenable to the development of downstream applications which rely on ASR of patient-clinician conversations.

摘要

解决医师数据录入需求的一个有前景的解决方案是通过开发所谓的“数字抄写员”,或者通过自动语音识别(ASR)自动记录医患对话的工具来实现。由于大多数模型仍在开发中,因此评估专门针对该领域的 ASR 模型(这对于理解可行性和开发机会很有用)一直很困难。在这些模型商业化发布之后,我们报告了对四个模型(两个通用模型和两个用于医疗对话的模型)的独立评估,该评估使用了 36 个初级保健对话的语料库。我们确定了 8.8%-10.5%的单词错误率(WER)和 1.8%-13.9%的单词级对话分割错误率(WDER),这些结果通常低于之前的报告。这些发现表明,虽然仍有改进的空间,但这些专门模型的性能,至少在理想的记录条件下,可能适合开发依赖于医患对话的 ASR 的下游应用程序。

相似文献

[1]
Automatic speech recognition performance for digital scribes: a performance comparison between general-purpose and specialized models tuned for patient-clinician conversations.

AMIA Annu Symp Proc. 2022

[2]
"Mm-hm," "Uh-uh": are non-lexical conversational sounds deal breakers for the ambient clinical documentation technology?

J Am Med Inform Assoc. 2023-3-16

[3]
Complete and Resilient Documentation for Operational Medical Environments Leveraging Mobile Hands-free Technology in a Systems Approach: Experimental Study.

JMIR Mhealth Uhealth. 2021-10-12

[4]
A systematic comparison of contemporary automatic speech recognition engines for conversational clinical speech.

AMIA Annu Symp Proc. 2018-12-5

[5]
How does medical scribes' work inform development of speech-based clinical documentation technologies? A systematic review.

J Am Med Inform Assoc. 2020-5-1

[6]
Automatic Speech Recognition Performance Improvement for Mandarin Based on Optimizing Gain Control Strategy.

Sensors (Basel). 2022-4-15

[7]
Development and benchmarking of a Korean audio speech recognition model for Clinician-Patient conversations in radiation oncology clinics.

Int J Med Inform. 2023-8

[8]
The development of an automatic speech recognition model using interview data from long-term care for older adults.

J Am Med Inform Assoc. 2023-2-16

[9]
The digital scribe in clinical practice: a scoping review and research agenda.

NPJ Digit Med. 2021-3-26

[10]
Combining automatic speech recognition with semantic natural language processing in schizophrenia.

Psychiatry Res. 2023-7

引用本文的文献

[1]
Evaluating the Usability, Technical Performance, and Accuracy of Artificial Intelligence Scribes for Primary Care: Competitive Analysis.

JMIR Hum Factors. 2025-7-23

[2]
Inspired Spine Smart Universal Resource Identifier (SURI): An Adaptive AI Framework for Transforming Multilingual Speech Into Structured Medical Reports.

Cureus. 2025-3-26

[3]
The Utility and Implications of Ambient Scribes in Primary Care.

JMIR AI. 2024-10-4

本文引用的文献

[1]
The digital scribe in clinical practice: a scoping review and research agenda.

NPJ Digit Med. 2021-3-26

[2]
Automated rating of patient and physician emotion in primary care visits.

Patient Educ Couns. 2021-8

[3]
Assessing the accuracy of automatic speech recognition for psychotherapy.

NPJ Digit Med. 2020-6-3

[4]
How does medical scribes' work inform development of speech-based clinical documentation technologies? A systematic review.

J Am Med Inform Assoc. 2020-5-1

[5]
Challenges of developing a digital scribe to reduce clinical documentation burden.

NPJ Digit Med. 2019-11-22

[6]
Detecting conversation topics in primary care office visits from transcripts of patient-provider interactions.

J Am Med Inform Assoc. 2019-12-1

[7]
A network model of activities in primary care consultations.

J Am Med Inform Assoc. 2019-10-1

[8]
The digital scribe.

NPJ Digit Med. 2018-10-16

[9]
Automatically Charting Symptoms From Patient-Physician Conversations Using Machine Learning.

JAMA Intern Med. 2019-6-1

[10]
A systematic comparison of contemporary automatic speech recognition engines for conversational clinical speech.

AMIA Annu Symp Proc. 2018-12-5

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

推荐工具

医学文档翻译智能文献检索