Suppr超能文献

基于语音识别的巴西葡萄牙语医疗报告书写的网络系统原型。

Web System Prototype based on speech recognition to construct medical reports in Brazilian Portuguese.

机构信息

Laboratory of Bioinformatics, Western Paraná State University, Presidente Tancredo Neves Avenue, 6731, ZIP code 85867-900, Foz do Iguaçu, Brazil.

Laboratory of Bioinformatics, Western Paraná State University, Presidente Tancredo Neves Avenue, 6731, ZIP code 85867-900, Foz do Iguaçu, Brazil; Service of Coloproctology, Faculty of Medical Sciences, University of Campinas, Tessália Vieira de Camargo Street, 126, ZIP code 13083-887, Campinas, Brazil.

出版信息

Int J Med Inform. 2019 Jan;121:39-52. doi: 10.1016/j.ijmedinf.2018.10.010. Epub 2018 Oct 26.

Abstract

The overall purpose of automatic speech recognition systems is to make possible the interaction between humans and electronic devices through speech. For example, the content captured from user's speech using a microphone can be transcribed into text. In general, such systems should be able to overcome adversities such as noise, communication channel variability, speaker's age and accent, speech speed, concurrent speeches from other speakers and spontaneous speech. Despite this challenging scenario, this study aims to develop a Web System Prototype to generate medical reports through automatic speech recognition in the Brazilian Portuguese language. The prototype was developed by applying a Software Engineering technique named Delivery in Stage. During the conduction of this technique, we integrated the Google Web Speech API and Microsoft Bing Speech API into the prototype to increase the number of compatible platforms. These automatic speech recognition systems were individually evaluated in the task of transcribing the dictation of a medical area text by 30 volunteers. The recognition performance was evaluated according to the Word Error Rate measure. The Google system achieved an error rate of 12.30%, which was statistically significantly better (p-value <0.0001) than the Microsoft one: 17.68%. Conducting this work allowed us to conclude that these automatic speech recognition systems are compatible with the prototype and can be used in the medical field. The findings also suggest that, besides supporting medical reports construction, the Web System Prototype can be useful for purposes such as recording physicians' notes during a clinical procedure.

摘要

自动语音识别系统的总体目的是通过语音实现人类与电子设备之间的交互。例如,可以将用户通过麦克风说出的内容转录成文本。一般来说,此类系统应该能够克服各种困难,例如噪声、通信信道变化、说话者年龄和口音、语速、来自其他说话者的并发语音以及即兴语音。尽管面临这种具有挑战性的情况,本研究仍旨在开发一个通过巴西葡萄牙语的自动语音识别生成医疗报告的 Web 系统原型。该原型是通过应用一种名为阶段交付的软件工程技术开发的。在实施该技术期间,我们将 Google Web Speech API 和 Microsoft Bing Speech API 集成到原型中,以增加兼容的平台数量。这两个自动语音识别系统在由 30 名志愿者转录医学领域文本的听写任务中进行了单独评估。识别性能根据单词错误率进行评估。Google 系统的错误率为 12.30%,明显优于 Microsoft 系统的 17.68%(p 值<0.0001)。开展这项工作使我们能够得出结论,这些自动语音识别系统与原型兼容,可用于医疗领域。研究结果还表明,除了支持医疗报告的构建外,Web 系统原型还可用于记录临床操作期间医生的笔记等目的。

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验