Happe André, Pouliquen Bruno, Burgun Anita, Cuggia Marc, Le Beux Pierre
Intermède, La Basse Revachais, 35580 Guignen, France.
Int J Med Inform. 2003 Jul;70(2-3):255-63. doi: 10.1016/s1386-5056(03)00055-8.
The objective of this project is to investigate methods whereby a combination of speech recognition and automated indexing methods substitute for current transcription and indexing practices.
We based our study on existing speech recognition software programs and on NOMINDEX, a tool that extracts MeSH concepts from medical text in natural language and that is mainly based on a French medical lexicon and on the UMLS. For each document, the process consists of three steps: (1) dictation and digital audio recording, (2) speech recognition, (3) automatic indexing. The evaluation consisted of a comparison between the set of concepts extracted by NOMINDEX after the speech recognition phase and the set of keywords manually extracted from the initial document. The method was evaluated on a set of 28 patient discharge summaries extracted from the MENELAS corpus in French, corresponding to in-patients admitted for coronarography.
The overall precision was 73% and the overall recall was 90%. Indexing errors were mainly due to word sense ambiguity and abbreviations. A specific issue was the fact that the standard French translation of MeSH terms lacks diacritics. A preliminary evaluation of speech recognition tools showed that the rate of accurate recognition was higher than 98%. Only 3% of the indexing errors were generated by inadequate speech recognition.
We discuss several areas to focus on to improve this prototype. However, the very low rate of indexing errors due to speech recognition errors highlights the potential benefits of combining speech recognition techniques and automatic indexing.
本项目的目的是研究语音识别与自动索引方法相结合以替代当前转录和索引做法的方法。
我们的研究基于现有的语音识别软件程序以及NOMINDEX,NOMINDEX是一种从自然语言的医学文本中提取医学主题词(MeSH)概念的工具,主要基于法语医学词汇表和统一医学语言系统(UMLS)。对于每份文档,该过程包括三个步骤:(1)听写和数字音频录制,(2)语音识别,(3)自动索引。评估包括在语音识别阶段后由NOMINDEX提取的概念集与从原始文档中手动提取的关键词集之间的比较。该方法在从法语MENELAS语料库中提取的28份患者出院小结上进行了评估,这些小结对应于因冠状动脉造影入院的住院患者。
总体精确率为73%,总体召回率为90%。索引错误主要归因于词义模糊和缩写。一个具体问题是医学主题词的标准法语翻译缺少变音符号。语音识别工具的初步评估表明,准确识别率高于98%。只有3%的索引错误是由语音识别不足产生的。
我们讨论了几个需要关注的领域以改进这个原型。然而,由于语音识别错误导致的索引错误率非常低,这凸显了结合语音识别技术和自动索引的潜在益处。