Tates A, Matran-Fernandez A, Halder S, Daly I
Brain-Computer Interfaces and Neural Engineering Lab, School of Computer Science and Electronic Engineering, University of Essex, Colchester, United kingdom.
J Neural Eng. 2025 Jun 26;22(3). doi: 10.1088/1741-2552/ade28e.
Speech Imagery (SI) refers to the mental experience of hearing speech and may be the core of verbal thinking for people who undergo internal monologues. It belongs to the set of possible mental imagery states that produce kinesthetic experiences whose sensations are similar to their non-imagery counterparts. SI underpins language processes and may have similar building blocks to overt speech without the final articulatory outcome. The kinesthetic experience of SI has been proposed to be a projection of the expected articulatory outcome in a top-down processing manner. As SI seems to be a core human cognitive task it has been proposed as a paradigm for Brain-Computer Interfaces (BCI). One important aspect of BCI designs is usability, and SI may present an intuitive paradigm, which has brought the attention of researchers to attempt to decode SI from brain signals. In this paper we review the important aspects of SI-BCI decoding pipelines.. We conducted this review according to the Preferred Reporting Items for Systematic reviews and Meta-Analysis guidelines. Specifically, we filtered peer-reviewed reports via a search of Google Scholar and PubMed. We selected a total of 104 reports that attempted to decode SI from neural activity.. Our review reveals a growing interest in SI decoding in the last 20 years, and shows how different neuroimaging modalities have been employed to record SI in distinct ways to instruct participants to perform this task. We discuss the signal processing methods used along with feature extraction techniques and found a high preference for Deep Learning models. We have summarized and compared the decoding attempts by quantifying the efficacy of decoding by measuring Information Transfer Rates. Notably, fewer than 6% of studies reported real-time decoding, with the vast majority focused on offline analyses. This suggests existing challenges of this paradigm, as the variety of approaches and outcomes prevents a clear identification of the field's current state-of-the-art. We offer a discussion of future research directions.SI is an attractive BCI paradigm. This review outlines the increasing interest in SI, the methodological trends, the efficacy of different approaches, and the current progress toward real-time decoding systems.
言语表象(SI)是指听到言语的心理体验,对于进行内部独白的人来说,它可能是言语思维的核心。它属于一组可能的心理表象状态,这些状态会产生动觉体验,其感觉与非表象对应物相似。SI是语言过程的基础,可能具有与公开言语相似的构建块,但没有最终的发音结果。SI的动觉体验被认为是以自上而下的方式对预期发音结果的投射。由于SI似乎是一项核心的人类认知任务,它已被提议作为脑机接口(BCI)的一种范式。BCI设计的一个重要方面是可用性,而SI可能提供一种直观的范式,这引起了研究人员试图从脑信号中解码SI的关注。在本文中,我们回顾了SI-BCI解码流程的重要方面。我们根据系统评价和Meta分析的首选报告项目指南进行了这项综述。具体来说,我们通过在谷歌学术和PubMed上搜索来筛选同行评审报告。我们总共选择了104份试图从神经活动中解码SI的报告。我们的综述揭示了在过去20年中对SI解码的兴趣日益浓厚,并展示了如何采用不同的神经成像方式以不同的方式记录SI来指导参与者执行这项任务。我们讨论了所使用的信号处理方法以及特征提取技术,发现对深度学习模型有很高的偏好。我们通过测量信息传输率来量化解码效果,总结并比较了解码尝试。值得注意的是,不到6%的研究报告了实时解码,绝大多数研究集中在离线分析上。这表明了这种范式存在的现有挑战,因为方法和结果的多样性阻碍了对该领域当前技术水平的清晰识别。我们对未来的研究方向进行了讨论。SI是一种有吸引力的BCI范式。本综述概述了对SI日益增长的兴趣、方法学趋势、不同方法的效果以及实时解码系统的当前进展。