• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一个用于理解自动语音识别和人工转录中错误来源的简单错误分类系统。

A simple error classification system for understanding sources of error in automatic speech recognition and human transcription.

作者信息

Zafar Atif, Mamlin Burke, Perkins Susan, Belsito Anne M, Overhage J Marc, McDonald Clement J

机构信息

School of Medicine, Regenstrief Institute, Indiana University, 1001 West 10th Street, RG5 Indianapolis, IN 46202, USA.

出版信息

Int J Med Inform. 2004 Sep;73(9-10):719-30. doi: 10.1016/j.ijmedinf.2004.05.008.

DOI:10.1016/j.ijmedinf.2004.05.008
PMID:15325329
Abstract

OBJECTIVES

To (1) discover the types of errors most commonly found in clinical notes that are generated either using automatic speech recognition (ASR) or via human transcription and (2) to develop efficient rules for classifying these errors based on the categories found in (1). The purpose of classifying errors into categories is to understand the underlying processes that generate these errors, so that measures can be taken to improve these processes.

METHODS

We integrated the Dragon NaturallySpeaking v4.0 speech recognition engine into the Regenstrief Medical Record System. We captured the text output of the speech engine prior to error correction by the speaker. We also acquired a set of human transcribed but uncorrected notes for comparison. We then attempted to error correct these notes based on looking at the context alone. Initially, three domain experts independently examined 104 ASR notes (containing 29,144 words) generated by a single speaker and 44 human transcribed notes (containing 14,199 words) generated by multiple speakers for errors. Collaborative group sessions were subsequently held where error categorizes were determined and rules developed and incrementally refined for systematically examining the notes and classifying errors.

RESULTS

We found that the errors could be classified into nine categories: (1) announciation errors occurring due to speaker mispronounciation, (2) dictionary errors resulting from missing terms, (3) suffix errors caused by misrecognition of appropriate tenses of a word, (4) added words, (5) deleted words, (6) homonym errors resulting from substitution of a phonetically identical word, (7) spelling errors, (8) nonsense errors, words/phrases whose meaning could not be appreciated by examining just the context, and (9) critical errors, words/phrases where a reader of a note could potentially misunderstand the concept that was related by the speaker.

CONCLUSIONS

A simple method is presented for examining errors in transcribed documents and classifying these errors into meaningful and useful categories. Such a classification can potentially help pinpoint sources of such errors so that measures (such as better training of the speaker and improved dictionary and language modeling) can be taken to optimize the error rates.

摘要

目的

(1)找出在使用自动语音识别(ASR)或人工转录生成的临床记录中最常见的错误类型;(2)根据(1)中发现的类别制定有效的错误分类规则。将错误分类的目的是了解产生这些错误的潜在过程,以便采取措施改进这些过程。

方法

我们将Dragon NaturallySpeaking v4.0语音识别引擎集成到Regenstrief医疗记录系统中。在说话者进行纠错之前,我们捕获了语音引擎的文本输出。我们还获取了一组人工转录但未校正的记录用于比较。然后,我们仅根据上下文尝试对这些记录进行纠错。最初,三位领域专家独立检查了由一位说话者生成的104份ASR记录(包含29,144个单词)和由多位说话者生成的44份人工转录记录(包含14,199个单词)中的错误。随后举行了协作小组会议,确定错误类别,制定并逐步完善规则,以便系统地检查记录并对错误进行分类。

结果

我们发现错误可分为九类:(1)由于说话者发音错误导致的发音错误;(2)因缺少术语而产生的词典错误;(3)由单词适当时态误识别导致的后缀错误;(4)添加的单词;(5)删除的单词;(6)由同音词替换导致的同音异形词错误;(7)拼写错误;(8)无意义错误,即仅通过检查上下文无法理解其含义的单词/短语;(9)关键错误,即记录的读者可能会误解说话者所传达概念的单词/短语。

结论

提出了一种简单的方法来检查转录文档中的错误,并将这些错误分类为有意义且有用的类别。这种分类可能有助于确定此类错误的来源,以便采取措施(如对说话者进行更好的培训以及改进词典和语言建模)来优化错误率。

相似文献

1
A simple error classification system for understanding sources of error in automatic speech recognition and human transcription.一个用于理解自动语音识别和人工转录中错误来源的简单错误分类系统。
Int J Med Inform. 2004 Sep;73(9-10):719-30. doi: 10.1016/j.ijmedinf.2004.05.008.
2
The benefit obtained from visually displayed text from an automatic speech recognizer during listening to speech presented in noise.在收听有噪声干扰的语音时,从自动语音识别器的可视文本显示中获得的益处。
Ear Hear. 2008 Dec;29(6):838-52. doi: 10.1097/AUD.0b013e31818005bd.
3
Error rates in breast imaging reports: comparison of automatic speech recognition and dictation transcription.乳腺影像报告中的错误率:自动语音识别与听写转录的比较。
AJR Am J Roentgenol. 2011 Oct;197(4):923-7. doi: 10.2214/AJR.11.6691.
4
Incidence of speech recognition errors in the emergency department.急诊科语音识别错误的发生率。
Int J Med Inform. 2016 Sep;93:70-3. doi: 10.1016/j.ijmedinf.2016.05.005. Epub 2016 May 26.
5
Analysis of Errors in Dictated Clinical Documents Assisted by Speech Recognition Software and Professional Transcriptionists.语音识别软件辅助下的口述临床文档与专业转录员的错误分析。
JAMA Netw Open. 2018 Jul;1(3):e180530. doi: 10.1001/jamanetworkopen.2018.0530. Epub 2018 Jul 6.
6
Free-text data entry by speech recognition software and its impact on clinical routine.通过语音识别软件进行自由文本数据录入及其对临床常规的影响。
Ear Nose Throat J. 2006 Aug;85(8):523-7.
7
Web System Prototype based on speech recognition to construct medical reports in Brazilian Portuguese.基于语音识别的巴西葡萄牙语医疗报告书写的网络系统原型。
Int J Med Inform. 2019 Jan;121:39-52. doi: 10.1016/j.ijmedinf.2018.10.010. Epub 2018 Oct 26.
8
A cognitive taxonomy of medical errors.医学错误的认知分类法。
J Biomed Inform. 2004 Jun;37(3):193-204. doi: 10.1016/j.jbi.2004.04.004.
9
[Automatic report documentation in cardiology using a speech recognition system].[使用语音识别系统进行心脏病学的自动报告记录]
Z Kardiol. 1996 Sep;85(9):684-8.
10
Improving language models for radiology speech recognition.
J Biomed Inform. 2009 Feb;42(1):53-8. doi: 10.1016/j.jbi.2008.08.001. Epub 2008 Aug 12.

引用本文的文献

1
Evaluating the Usability, Technical Performance, and Accuracy of Artificial Intelligence Scribes for Primary Care: Competitive Analysis.评估用于初级保健的人工智能抄写员的可用性、技术性能和准确性:竞争分析
JMIR Hum Factors. 2025 Jul 23;12:e71434. doi: 10.2196/71434.
2
Speech recognition for clinical documentation from 1990 to 2018: a systematic review.1990 年至 2018 年临床文档的语音识别:系统评价。
J Am Med Inform Assoc. 2019 Apr 1;26(4):324-338. doi: 10.1093/jamia/ocy179.
3
Incidence of speech recognition errors in the emergency department.
急诊科语音识别错误的发生率。
Int J Med Inform. 2016 Sep;93:70-3. doi: 10.1016/j.ijmedinf.2016.05.005. Epub 2016 May 26.
4
Hands-free image capture, data tagging and transfer using Google Glass: a pilot study for improved wound care management.使用谷歌眼镜进行免提图像采集、数据标记和传输:改善伤口护理管理的初步研究
PLoS One. 2015 Apr 22;10(4):e0121179. doi: 10.1371/journal.pone.0121179. eCollection 2015.
5
Towards spoken clinical-question answering: evaluating and adapting automatic speech-recognition systems for spoken clinical questions.迈向口语临床问答:评估和调整自动语音识别系统以应对口语临床问题。
J Am Med Inform Assoc. 2011 Sep-Oct;18(5):625-30. doi: 10.1136/amiajnl-2010-000071. Epub 2011 Jun 24.
6
[Speech recognition: impact on workflow and report availability].[语音识别:对工作流程和报告可用性的影响]
Radiologe. 2005 Aug;45(8):735-42. doi: 10.1007/s00117-005-1253-7.