• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

当代用于对话式临床语音的自动语音识别引擎的系统比较。

A systematic comparison of contemporary automatic speech recognition engines for conversational clinical speech.

作者信息

Kodish-Wachs Jodi, Agassi Emin, Kenny Patrick, Overhage J Marc

机构信息

Cerner Corporation, Malvern, PA.

出版信息

AMIA Annu Symp Proc. 2018 Dec 5;2018:683-689. eCollection 2018.

PMID:30815110
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6371385/
Abstract

Conversations especially between a clinician and a patient are important sources of data to support clinical care. To date, clinicians act as the sensor to capture these data and record them in the medical record. Automatic speech recognition (ASR) engines have advanced to support continuous speech, to work independently of speaker and deliver continuously improving performance. Near human levels of performance have been reported for several ASR engines. We undertook a systematic comparison of selected ASRs for clinical conversational speech. Using audio recorded from unscripted clinical scenarios using two microphones, we evaluated eight ASR engines using word error rate (WER) and the precision, recall and F1 scores for concept extraction. We found a wide range of word errors across the ASR engines, with values ranging from 65% to 34%, all falling short of the rates achieved for other conversational speech. Recall for health concepts also ranged from 22% to 74%. Concept recall rates match or exceed expectations given measured word error rates suggesting that vocabulary is not the dominant issue.

摘要

对话,尤其是临床医生与患者之间的对话,是支持临床护理的数据的重要来源。迄今为止,临床医生充当捕捉这些数据并将其记录在病历中的传感器。自动语音识别(ASR)引擎已经取得进展,以支持连续语音,独立于说话者工作并不断提高性能。已有报道称,几款ASR引擎的性能接近人类水平。我们对用于临床对话语音的选定ASR进行了系统比较。我们使用两个麦克风从无脚本临床场景中录制的音频,通过单词错误率(WER)以及概念提取的精确率、召回率和F1分数对八个ASR引擎进行了评估。我们发现,各ASR引擎的单词错误率差异很大,数值范围从65%到34%,均低于其他对话语音所达到的比率。健康概念的召回率也在22%到74%之间。考虑到测得的单词错误率,概念召回率符合或超过预期,这表明词汇不是主要问题。

相似文献

1
A systematic comparison of contemporary automatic speech recognition engines for conversational clinical speech.当代用于对话式临床语音的自动语音识别引擎的系统比较。
AMIA Annu Symp Proc. 2018 Dec 5;2018:683-689. eCollection 2018.
2
"Mm-hm," "Uh-uh": are non-lexical conversational sounds deal breakers for the ambient clinical documentation technology?“嗯”“呃”:非词汇性会话声音是否是环境临床文档技术的障碍?
J Am Med Inform Assoc. 2023 Mar 16;30(4):703-711. doi: 10.1093/jamia/ocad001.
3
Automatic speech recognition performance for digital scribes: a performance comparison between general-purpose and specialized models tuned for patient-clinician conversations.数字听写员的自动语音识别性能:针对患者-临床医生对话进行调整的通用和专用模型之间的性能比较。
AMIA Annu Symp Proc. 2023 Apr 29;2022:1072-1080. eCollection 2022.
4
The development of an automatic speech recognition model using interview data from long-term care for older adults.利用老年人长期护理访谈数据开发自动语音识别模型。
J Am Med Inform Assoc. 2023 Feb 16;30(3):411-417. doi: 10.1093/jamia/ocac241.
5
A simple error classification system for understanding sources of error in automatic speech recognition and human transcription.一个用于理解自动语音识别和人工转录中错误来源的简单错误分类系统。
Int J Med Inform. 2004 Sep;73(9-10):719-30. doi: 10.1016/j.ijmedinf.2004.05.008.
6
Automatic Speech Recognition Performance Improvement for Mandarin Based on Optimizing Gain Control Strategy.基于优化增益控制策略的普通话自动语音识别性能提升
Sensors (Basel). 2022 Apr 15;22(8):3027. doi: 10.3390/s22083027.
7
A comparison of automatic and human speech recognition in null grammar.自动语音识别与零语法下的人工语音识别比较。
J Acoust Soc Am. 2012 Mar;131(3):EL256-61. doi: 10.1121/1.3684744.
8
Retrospective Analysis of Clinical Performance of an Estonian Speech Recognition System for Radiology: Effects of Different Acoustic and Language Models.回顾性分析爱沙尼亚语音识别系统在放射学中的临床性能:不同声学和语言模型的影响。
J Digit Imaging. 2018 Oct;31(5):615-621. doi: 10.1007/s10278-018-0085-8.
9
Assessing the Effectiveness of Automatic Speech Recognition Technology in Emergency Medicine Settings: A Comparative Study of Four AI-powered Engines.评估自动语音识别技术在急诊医学环境中的有效性:四种人工智能驱动引擎的比较研究。
Res Sq. 2024 Aug 17:rs.3.rs-4727659. doi: 10.21203/rs.3.rs-4727659/v1.
10
Machine learning based sample extraction for automatic speech recognition using dialectal Assamese speech.基于机器学习的方言阿萨姆语语音自动识别样本提取。
Neural Netw. 2016 Jun;78:97-111. doi: 10.1016/j.neunet.2015.12.010. Epub 2015 Dec 30.

引用本文的文献

1
The impact of using AI-powered voice-to-text technology for clinical documentation on quality of care in primary care and outpatient settings: a systematic review.在初级保健和门诊环境中,使用人工智能语音转文本技术进行临床文档记录对医疗质量的影响:一项系统综述。
EBioMedicine. 2025 Jul 21;118:105861. doi: 10.1016/j.ebiom.2025.105861.
2
Evaluating the performance of artificial intelligence-based speech recognition for clinical documentation: a systematic review.评估基于人工智能的语音识别在临床文档记录中的性能:一项系统综述。
BMC Med Inform Decis Mak. 2025 Jul 1;25(1):236. doi: 10.1186/s12911-025-03061-0.
3
Association of machine-learning-rated supportive counseling skills with psychotherapy outcome.机器学习评估的支持性咨询技能与心理治疗结果的关联。
J Consult Clin Psychol. 2025 Feb;93(2):110-119. doi: 10.1037/ccp0000935.
4
Cognitive Computing-Based CDSS in Medical Practice.医学实践中基于认知计算的临床决策支持系统
Health Data Sci. 2021 Jul 22;2021:9819851. doi: 10.34133/2021/9819851. eCollection 2021.
5
Feedback From Automatic Speech Recognition to Elicit Clear Speech in Healthy Speakers.从自动语音识别中获取反馈,以促使健康说话者说出清晰的语音。
Am J Speech Lang Pathol. 2023 Nov 6;32(6):2940-2959. doi: 10.1044/2023_AJSLP-23-00030. Epub 2023 Oct 12.
6
Automatic speech recognition performance for digital scribes: a performance comparison between general-purpose and specialized models tuned for patient-clinician conversations.数字听写员的自动语音识别性能:针对患者-临床医生对话进行调整的通用和专用模型之间的性能比较。
AMIA Annu Symp Proc. 2023 Apr 29;2022:1072-1080. eCollection 2022.
7
"Mm-hm," "Uh-uh": are non-lexical conversational sounds deal breakers for the ambient clinical documentation technology?“嗯”“呃”:非词汇性会话声音是否是环境临床文档技术的障碍?
J Am Med Inform Assoc. 2023 Mar 16;30(4):703-711. doi: 10.1093/jamia/ocad001.
8
Automatic Assessment of Intelligibility in Noise in Parkinson Disease: Validation Study.帕金森病噪声环境下言语可懂度的自动评估:验证研究。
J Med Internet Res. 2022 Oct 20;24(10):e40567. doi: 10.2196/40567.
9
Expectations for Artificial Intelligence (AI) in Psychiatry.对精神病学人工智能的期望。
Curr Psychiatry Rep. 2022 Nov;24(11):709-721. doi: 10.1007/s11920-022-01378-5. Epub 2022 Oct 10.
10
A dataset of simulated patient-physician medical interviews with a focus on respiratory cases.一个以呼吸病例为重点的模拟医患医疗访谈数据集。
Sci Data. 2022 Jun 16;9(1):313. doi: 10.1038/s41597-022-01423-1.

本文引用的文献

1
Benchmarking clinical speech recognition and information extraction: new data, methods, and evaluations.基准临床语音识别和信息提取:新数据、方法和评估。
JMIR Med Inform. 2015 Apr 27;3(2):e19. doi: 10.2196/medinform.4321.
2
Towards spoken clinical-question answering: evaluating and adapting automatic speech-recognition systems for spoken clinical questions.迈向口语临床问答:评估和调整自动语音识别系统以应对口语临床问题。
J Am Med Inform Assoc. 2011 Sep-Oct;18(5):625-30. doi: 10.1136/amiajnl-2010-000071. Epub 2011 Jun 24.
3
Continuous speech recognition for clinicians.面向临床医生的连续语音识别
J Am Med Inform Assoc. 1999 May-Jun;6(3):195-204. doi: 10.1136/jamia.1999.0060195.
4
The validity of the medical record.病历的有效性。
Med Care. 1981 Mar;19(3):310-5. doi: 10.1097/00005650-198103000-00006.
5
Validating the content of pediatric outpatient medical records by means of tape-recording doctor-patient encounters.通过录音医患诊疗过程来验证儿科门诊病历的内容。
Pediatrics. 1975 Sep;56(3):407-11.