Suppr超能文献

用于内镜实验室报告听写和机器控制的基于语音的对话系统的开发。

Development of a speech-based dialogue system for report dictation and machine control in the endoscopic laboratory.

作者信息

Molnar B, Gergely J, Toth G, Pronai L, Zagoni T, Papik K, Tulassay Z

机构信息

Second Department of Medicine, Semmelweis University, Budapest, Hungary.

出版信息

Endoscopy. 2000 Jan;32(1):58-61. doi: 10.1055/s-2000-136.

Abstract

BACKGROUND AND STUDY AIMS

Reporting and machine control based on speech technology can enhance work efficiency in the gastrointestinal endoscopy laboratory.

MATERIALS AND METHODS

The status and activation of endoscopy laboratory equipment were described as a multivariate parameter and function system. Speech recognition, text evaluation and action definition engines were installed. Special programs were developed for the grammatical analysis of command sentences, and a rule-based expert system for the definition of machine answers. A speech backup engine provides feedback to the user. Techniques were applied based on the "Hidden Markov" model of discrete word, user-independent speech recognition and on phoneme-based speech synthesis. Speech samples were collected from three male low-tone investigators.

RESULTS

The dictation module and machine control modules were incorporated in a personal computer (PC) simulation program. Altogether 100 unidentified patient records were analyzed. The sentences were grouped according to keywords, which indicate the main topics of a gastrointestinal endoscopy report. They were: "endoscope", "esophagus", "cardia", "fundus", "corpus", "antrum", "pylorus", "bulbus", and "postbulbar section", in addition to the major pathological findings: "erosion", "ulceration", and "malignancy". "Biopsy" and "diagnosis" were also included. We implemented wireless speech communication control commands for equipment including an endoscopy unit, video, monitor, printer, and PC. The recognition rate was 95%.

CONCLUSIONS

Speech technology may soon become an integrated part of our daily routine in the endoscopy laboratory. A central speech and laboratory computer could be the most efficient alternative to having separate speech recognition units in all items of equipment.

摘要

背景与研究目的

基于语音技术的报告和机器控制可提高胃肠内镜检查实验室的工作效率。

材料与方法

将内镜检查实验室设备的状态和激活情况描述为一个多变量参数和功能系统。安装了语音识别、文本评估和动作定义引擎。开发了用于命令语句语法分析的特殊程序,以及用于定义机器答案的基于规则的专家系统。语音备份引擎向用户提供反馈。应用了基于离散词的“隐马尔可夫”模型、独立于用户的语音识别以及基于音素的语音合成技术。从三名男性低音调研究者那里收集了语音样本。

结果

听写模块和机器控制模块被整合到一个个人计算机(PC)模拟程序中。总共分析了100份未识别的患者记录。这些句子根据关键词进行分组,这些关键词表明了胃肠内镜检查报告的主要主题。它们是:“内镜”、“食管”、“贲门”、“胃底”、“胃体”、“胃窦”、“幽门”、“球部”和“球后部”,此外还有主要的病理发现:“糜烂”、“溃疡”和“恶性肿瘤”。还包括“活检”和“诊断”。我们为包括内镜单元、视频、监视器、打印机和PC在内的设备实现了无线语音通信控制命令。识别率为95%。

结论

语音技术可能很快会成为我们内镜检查实验室日常工作中不可或缺的一部分。一台中央语音和实验室计算机可能是在所有设备项目中配备单独语音识别单元的最有效替代方案。

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验