Michel E, Michel E M, Hägele W, Zernikow B
Medizinischer Dienst der Krankenversicherung Westfalen-Lippe, Münster.
Gesundheitswesen. 1998 Oct;60(10):567-71.
Automatic speech recognition systems are already being used in spheres employing a restricted vocabulary.
Our aim was to investigate whether low-cost speech recognition software for PC is capable of being usefully employed in the sphere of sociomedicine.
To this end 34 representative pages of text (a total of 11,000 words) taken from expertises on cases of suspected medical malpractice (many different subspecialties) were dictated using IBM's "Voice Type Simply Speaking" software. Having completed a page, the resulting error rate was recorded, and the text was corrected before we proceeded with the dictation. Finally, 3 pages of text were re-dictated and the resulting error rate determined.
The error rate in the previously unknown text ranged between 10 and 23 per cent (mean 15.9%) without any significant reduction during the training phase, while that in the re-dictated text was drastically reduced to less than 3 per cent. It became evident that once a word was corrected the system hardly ever repeated that particular mistake.
The system's poor performance on unknown text and the missing reduction in the error rate during the training phase are obviously not due to any incompetence of the system but to the huge amount of technical jargon in the scope of medical writing. To attain an acceptable performance we suggest to either extend the training phase, or, preferably, to confine the application to a single medical subspecialty. Its overwhelming learning ability makes the system a serious candidate typist in the sphere of sociomedicine.
自动语音识别系统已在词汇受限的领域中得到应用。
我们的目的是研究用于个人电脑的低成本语音识别软件是否能够有效地应用于社会医学领域。
为此,我们使用IBM的“简单语音打字”软件听写了34页具有代表性的文本(共计11000个单词),这些文本摘自医疗事故疑似案例的专家意见(涉及许多不同的亚专业)。完成一页文本听写后,记录产生的错误率,并在继续听写之前对文本进行校正。最后,重新听写3页文本并确定产生的错误率。
在之前未知的文本中,错误率在10%至23%之间(平均为15.9%),在训练阶段没有显著降低,而在重新听写的文本中,错误率大幅降至3%以下。很明显,一旦某个单词被校正,系统几乎不会再重复那个特定的错误。
该系统在未知文本上表现不佳,且在训练阶段错误率没有降低,这显然不是由于系统本身的无能,而是由于医学写作范围内大量的专业术语。为了获得可接受的性能,我们建议要么延长训练阶段,要么更好的做法是将应用范围限制在单一医学亚专业。其强大的学习能力使该系统成为社会医学领域中一个有竞争力的打字替代方案。