• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于云的语音识别开放式应用程序编程接口对韩语医学术语的准确性。

Accuracy of Cloud-Based Speech Recognition Open Application Programming Interface for Medical Terms of Korean.

机构信息

Rehabilitation and Prevention Center, Heart Vascular Stroke Institute, Samsung Medical Center, Sungkyunkwan University School of Medicine, Seoul, Korea.

Department of Biomedical Engineering, Seoul National University College of Medicine, Seoul, Korea.

出版信息

J Korean Med Sci. 2022 May 9;37(18):e144. doi: 10.3346/jkms.2022.37.e144.

DOI:10.3346/jkms.2022.37.e144
PMID:35535371
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9091429/
Abstract

BACKGROUND

There are limited data on the accuracy of cloud-based speech recognition (SR) open application programming interfaces (APIs) for medical terminology. This study aimed to evaluate the medical term recognition accuracy of current available cloud-based SR open APIs in Korean.

METHODS

We analyzed the SR accuracy of currently available cloud-based SR open APIs using real doctor-patient conversation recordings collected from an outpatient clinic at a large tertiary medical center in Korea. For each original and SR transcription, we analyzed the accuracy rate of each cloud-based SR open API (i.e., the number of medical terms in the SR transcription per number of medical terms in the original transcription).

RESULTS

A total of 112 doctor-patient conversation recordings were converted with three cloud-based SR open APIs (Naver Clova SR from Naver Corporation; Google Speech-to-Text from Alphabet Inc.; and Amazon Transcribe from Amazon), and each transcription was compared. Naver Clova SR (75.1%) showed the highest accuracy with the recognition of medical terms compared to the other open APIs (Google Speech-to-Text, 50.9%, < 0.001; Amazon Transcribe, 57.9%, < 0.001), and Amazon Transcribe demonstrated higher recognition accuracy compared to Google Speech-to-Text ( < 0.001). In the sub-analysis, Naver Clova SR showed the highest accuracy in all areas according to word classes, but the accuracy of words longer than five characters showed no statistical differences (Naver Clova SR, 52.6%; Google Speech-to-Text, 56.3%; Amazon Transcribe, 36.6%).

CONCLUSION

Among three current cloud-based SR open APIs, Naver Clova SR which manufactured by Korean company showed highest accuracy of medical terms in Korean, compared to Google Speech-to-Text and Amazon Transcribe. Although limitations are existing in the recognition of medical terminology, there is a lot of rooms for improvement of this promising technology by combining strengths of each SR engines.

摘要

背景

目前关于云语音识别(SR)开放应用程序接口(API)在医学术语方面的准确性数据有限。本研究旨在评估当前可用的基于云的 SR 开放 API 在韩语中的医学术语识别准确性。

方法

我们使用从韩国一家大型三级医疗中心的门诊收集的真实医患对话记录,分析了当前可用的基于云的 SR 开放 API 的 SR 准确性。对于每个原始和 SR 转录本,我们分析了每个基于云的 SR 开放 API 的准确率(即,SR 转录本中的医学术语数量与原始转录本中的医学术语数量之比)。

结果

共有 112 份医患对话记录通过三个基于云的 SR 开放 API(Naver Corporation 的 Naver Clova SR;Alphabet Inc. 的 Google Speech-to-Text;以及 Amazon 的 Amazon Transcribe)进行了转换,并且比较了每个转录本。与其他开放 API 相比,Naver Clova SR(75.1%)在医学术语识别方面的准确性最高(Google Speech-to-Text,50.9%,<0.001;Amazon Transcribe,57.9%,<0.001),并且 Amazon Transcribe 比 Google Speech-to-Text 的识别准确率更高(<0.001)。在亚组分析中,Naver Clova SR 在所有词类中均表现出最高的准确性,但五个字符以上的单词准确性没有统计学差异(Naver Clova SR,52.6%;Google Speech-to-Text,56.3%;Amazon Transcribe,36.6%)。

结论

在三种当前的基于云的 SR 开放 API 中,韩国公司制造的 Naver Clova SR 在韩语中的医学术语识别准确性最高,其次是 Google Speech-to-Text 和 Amazon Transcribe。尽管在医学术语识别方面存在局限性,但通过结合每个 SR 引擎的优势,这项很有前途的技术仍有很大的改进空间。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3292/9091429/480447ff6cd9/jkms-37-e144-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3292/9091429/480447ff6cd9/jkms-37-e144-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3292/9091429/480447ff6cd9/jkms-37-e144-g001.jpg

相似文献

1
Accuracy of Cloud-Based Speech Recognition Open Application Programming Interface for Medical Terms of Korean.基于云的语音识别开放式应用程序编程接口对韩语医学术语的准确性。
J Korean Med Sci. 2022 May 9;37(18):e144. doi: 10.3346/jkms.2022.37.e144.
2
End-to-End Lip-Reading Open Cloud-Based Speech Architecture.端到端唇读开放云语音架构。
Sensors (Basel). 2022 Apr 12;22(8):2938. doi: 10.3390/s22082938.
3
Assessing the Effectiveness of Automatic Speech Recognition Technology in Emergency Medicine Settings: A Comparative Study of Four AI-powered Engines.评估自动语音识别技术在急诊医学环境中的有效性:四种人工智能驱动引擎的比较研究。
Res Sq. 2024 Aug 17:rs.3.rs-4727659. doi: 10.21203/rs.3.rs-4727659/v1.
4
"Mm-hm," "Uh-uh": are non-lexical conversational sounds deal breakers for the ambient clinical documentation technology?“嗯”“呃”:非词汇性会话声音是否是环境临床文档技术的障碍?
J Am Med Inform Assoc. 2023 Mar 16;30(4):703-711. doi: 10.1093/jamia/ocad001.
5
Safe "cloudification" of large images through picker APIs.通过选取器应用程序编程接口对大图像进行安全的“云处理”。
AMIA Annu Symp Proc. 2017 Feb 10;2016:342-351. eCollection 2016.
6
Machine learning-based speech recognition system for nursing documentation - A pilot study.基于机器学习的护理记录语音识别系统——一项试点研究。
Int J Med Inform. 2023 Oct;178:105213. doi: 10.1016/j.ijmedinf.2023.105213. Epub 2023 Sep 9.
7
An Evaluation of Expedited Transcription Methods for School-Age Children's Narrative Language: Automatic Speech Recognition and Real-Time Transcription.学校年龄儿童叙事语言的快速转录方法评估:自动语音识别和实时转录。
J Speech Lang Hear Res. 2021 Sep 14;64(9):3533-3548. doi: 10.1044/2021_JSLHR-21-00096. Epub 2021 Aug 18.
8
The impact of implementing speech recognition technology on the accuracy and efficiency (time to complete) clinical documentation by nurses: A systematic review.实施语音识别技术对护士临床文档记录的准确性和效率(完成时间)的影响:系统评价。
J Clin Nurs. 2020 Jul;29(13-14):2125-2137. doi: 10.1111/jocn.15261. Epub 2020 May 7.
9
Comparing the Efficiency of Software-Based Speech Recognition Versus Traditional Telephone Transcription in an Outpatient Physical Medicine and Rehabilitation Practice.比较基于软件的语音识别与传统电话转录在门诊物理医学与康复实践中的效率。
Mil Med. 2020 Aug 14;185(7-8):e1183-e1186. doi: 10.1093/milmed/usz374.
10
Automatized analysis of children's exposure to child-directed speech in reschool settings: Validation and application.儿童在幼儿园环境中接触到的儿童导向语言的自动化分析:验证与应用。
PLoS One. 2020 Nov 25;15(11):e0242511. doi: 10.1371/journal.pone.0242511. eCollection 2020.

引用本文的文献

1
Evaluation Framework of Large Language Models in Medical Documentation: Development and Usability Study.大型语言模型在医疗文档中的评估框架:开发和可用性研究。
J Med Internet Res. 2024 Nov 20;26:e58329. doi: 10.2196/58329.
2
Automated Information Extraction from Thyroid Operation Narrative: A Comparative Study of GPT-4 and Fine-tuned KoELECTRA.从甲状腺手术记录中自动提取信息:GPT-4与微调后的KoELECTRA的比较研究
AMIA Jt Summits Transl Sci Proc. 2024 May 31;2024:249-257. eCollection 2024.
3
Intelligent speech technologies for transcription, disease diagnosis, and medical equipment interactive control in smart hospitals: A review.

本文引用的文献

1
The prospective of Artificial Intelligence in COVID-19 Pandemic.人工智能在新冠疫情中的前景。
Health Technol (Berl). 2021;11(6):1311-1320. doi: 10.1007/s12553-021-00601-2. Epub 2021 Sep 28.
2
Automatic Classification of the Korean Triage Acuity Scale in Simulated Emergency Rooms Using Speech Recognition and Natural Language Processing: a Proof of Concept Study.使用语音识别和自然语言处理对模拟急救室中的韩国分诊 acuity 量表进行自动分类:概念验证研究。
J Korean Med Sci. 2021 Jul 12;36(27):e175. doi: 10.3346/jkms.2021.36.e175.
3
Digital scribe utility and barriers to implementation in clinical practice: a scoping review.
智能医院中用于转录、疾病诊断和医疗设备交互控制的智能语音技术:综述。
Comput Biol Med. 2023 Feb;153:106517. doi: 10.1016/j.compbiomed.2022.106517. Epub 2023 Jan 5.
数字抄写工具及其在临床实践中的实施障碍:一项范围综述
Health Technol (Berl). 2021;11(4):803-809. doi: 10.1007/s12553-021-00568-0. Epub 2021 Jun 2.
4
Does "AI" stand for augmenting inequality in the era of covid-19 healthcare?在新冠疫情时代,“人工智能”是否加剧了医疗保健领域的不平等?
BMJ. 2021 Mar 15;372:n304. doi: 10.1136/bmj.n304.
5
The Computer Will See You Now: Overcoming Barriers to Adoption of Computer-Assisted History Taking (CAHT) in Primary Care.《计算机将为您诊断:克服初级保健中采用计算机辅助病史采集(CAHT)的障碍》
J Med Internet Res. 2021 Feb 24;23(2):e19306. doi: 10.2196/19306.
6
Geographic Distribution of US Cohorts Used to Train Deep Learning Algorithms.美国用于训练深度学习算法的队列的地理分布。
JAMA. 2020 Sep 22;324(12):1212-1213. doi: 10.1001/jama.2020.12067.
7
Physician use of speech recognition versus typing in clinical documentation: A controlled observational study.医生在临床文档记录中使用语音识别与打字的比较:一项对照观察研究。
Int J Med Inform. 2020 Sep;141:104178. doi: 10.1016/j.ijmedinf.2020.104178. Epub 2020 May 15.
8
The Doctor-Patient Relationship With Artificial Intelligence.医患关系与人工智能。
AJR Am J Roentgenol. 2019 Feb;212(2):308-310. doi: 10.2214/AJR.18.20509. Epub 2018 Dec 12.
9
Analysis of Errors in Dictated Clinical Documents Assisted by Speech Recognition Software and Professional Transcriptionists.语音识别软件辅助下的口述临床文档与专业转录员的错误分析。
JAMA Netw Open. 2018 Jul;1(3):e180530. doi: 10.1001/jamanetworkopen.2018.0530. Epub 2018 Jul 6.
10
Investigating the relationship between consultation length and patient experience: a cross-sectional study in primary care.探讨会诊时长与患者体验之间的关系:一项初级医疗保健领域的横断面研究。
Br J Gen Pract. 2016 Dec;66(653):e896-e903. doi: 10.3399/bjgp16X687733. Epub 2016 Oct 24.