Hawley Mark S, Enderby Pam, Green Phil, Cunningham Stuart, Brownsell Simon, Carmichael James, Parker Mark, Hatzis Athanassios, O'Neill Peter, Palmer Rebecca
Department of Medical Physics and Clinical Engineering, Barnsley Hospital NHS Foundation Trust, UK.
Med Eng Phys. 2007 Jun;29(5):586-93. doi: 10.1016/j.medengphy.2006.06.009. Epub 2006 Oct 17.
Automatic speech recognition (ASR) can provide a rapid means of controlling electronic assistive technology. Off-the-shelf ASR systems function poorly for users with severe dysarthria because of the increased variability of their articulations. We have developed a limited vocabulary speaker dependent speech recognition application which has greater tolerance to variability of speech, coupled with a computerised training package which assists dysarthric speakers to improve the consistency of their vocalisations and provides more data for recogniser training. These applications, and their implementation as the interface for a speech-controlled environmental control system (ECS), are described. The results of field trials to evaluate the training program and the speech-controlled ECS are presented. The user-training phase increased the recognition rate from 88.5% to 95.4% (p<0.001). Recognition rates were good for people with even the most severe dysarthria in everyday usage in the home (mean word recognition rate 86.9%). Speech-controlled ECS were less accurate (mean task completion accuracy 78.6% versus 94.8%) but were faster to use than switch-scanning systems, even taking into account the need to repeat unsuccessful operations (mean task completion time 7.7s versus 16.9s, p<0.001). It is concluded that a speech-controlled ECS is a viable alternative to switch-scanning systems for some people with severe dysarthria and would lead, in many cases, to more efficient control of the home.
自动语音识别(ASR)可为控制电子辅助技术提供一种快速手段。由于发音变异性增加,现成的ASR系统对严重构音障碍患者的功能不佳。我们开发了一种有限词汇量的特定说话者语音识别应用程序,它对语音变异性具有更高的耐受性,同时还开发了一个计算机化训练包,可帮助构音障碍患者提高发声的一致性,并为识别器训练提供更多数据。本文描述了这些应用程序及其作为语音控制环境控制系统(ECS)接口的实现方式。文中还给出了评估训练程序和语音控制ECS的现场试验结果。用户训练阶段使识别率从88.5%提高到了95.4%(p<0.001)。对于在家中日常使用的严重构音障碍患者,识别率也很高(平均单词识别率86.9%)。语音控制的ECS准确性较低(平均任务完成准确率78.6%,而开关扫描系统为94.8%),但使用起来比开关扫描系统更快,即使考虑到需要重复未成功的操作(平均任务完成时间7.7秒对16.9秒,p<0.001)。得出的结论是,对于一些严重构音障碍患者,语音控制的ECS是开关扫描系统的可行替代方案,并且在许多情况下将导致对家庭环境更有效的控制管理。