Department of Electrical Engineering, University of Cambridge, Cambridge CB3 0FA, UK.
Sensors (Basel). 2021 Dec 31;22(1):299. doi: 10.3390/s22010299.
Silent speech recognition is the ability to recognise intended speech without audio information. Useful applications can be found in situations where sound waves are not produced or cannot be heard. Examples include speakers with physical voice impairments or environments in which audio transference is not reliable or secure. Developing a device which can detect non-auditory signals and map them to intended phonation could be used to develop a device to assist in such situations. In this work, we propose a graphene-based strain gauge sensor which can be worn on the throat and detect small muscle movements and vibrations. Machine learning algorithms then decode the non-audio signals and create a prediction on intended speech. The proposed strain gauge sensor is highly wearable, utilising graphene's unique and beneficial properties including strength, flexibility and high conductivity. A highly flexible and wearable sensor able to pick up small throat movements is fabricated by screen printing graphene onto lycra fabric. A framework for interpreting this information is proposed which explores the use of several machine learning techniques to predict intended words from the signals. A dataset of 15 unique words and four movements, each with 20 repetitions, was developed and used for the training of the machine learning algorithms. The results demonstrate the ability for such sensors to be able to predict spoken words. We produced a word accuracy rate of 55% on the word dataset and 85% on the movements dataset. This work demonstrates a proof-of-concept for the viability of combining a highly wearable graphene strain gauge and machine leaning methods to automate silent speech recognition.
无声语音识别是指在没有音频信息的情况下识别预期语音的能力。在无法产生或无法听到声波的情况下,这种能力非常有用。例如,对于有身体语音障碍的说话者,或者在音频传输不可靠或不安全的环境中,都可以使用这种技术。开发一种能够检测非听觉信号并将其映射到预期发音的设备,可以用来开发一种辅助此类情况的设备。在这项工作中,我们提出了一种基于石墨烯的应变计传感器,它可以戴在喉咙上,检测微小的肌肉运动和振动。然后,机器学习算法对非音频信号进行解码,并对预期语音进行预测。所提出的应变计传感器具有高度的可穿戴性,利用了石墨烯独特的有益特性,包括强度、柔韧性和高导电性。通过将石墨烯丝网印刷到莱卡织物上,制造出一种高度灵活和可穿戴的传感器,可以检测到微小的喉咙运动。提出了一种解释这种信息的框架,该框架探索了使用几种机器学习技术从信号中预测预期单词的方法。开发了一个包含 15 个独特单词和 4 个动作的数据集,每个单词重复 20 次,用于训练机器学习算法。结果表明,这种传感器能够预测口语单词。我们在单词数据集中获得了 55%的单词准确率,在动作数据集中获得了 85%的准确率。这项工作证明了将高度可穿戴的石墨烯应变计和机器学习方法相结合以实现无声语音识别的可行性。