Annu Int Conf IEEE Eng Med Biol Soc. 2022 Jul;2022:690-693. doi: 10.1109/EMBC48229.2022.9871669.
Brain-Machine Interface (BMI) translates paralyzed people's neural activity into control commands of the prosthesis so that their lost motor functions could be restored. The neural activities represent brain states that change continuously over time which brings the challenge to the online decoder. Reinforcement Learning (RL) has the advantage to construct the dynamic neural-kinematic mapping during the interaction. However, existing RL decoders output discrete actions as a classification problem and cannot provide continuous estimation. Previous work has combined Kalman Filter (KF) with RL for BMI, which achieves a continuous motor state estimation. However, this method adopts a neural network structure, which might get stuck in local optimum and cannot provide an efficient online update for the neural-kinematic mapping. In this paper, we propose a Cluster Kernel Reinforcement Learning-based Kalman Filter (CKRL-based KF) to avoid the local optimum problem for online neural-kinematic updating. The neural patterns are projected into Reproducing Kernel Hilbert Space (RKHS), which builds a universal approximation to guarantee the global optimum. We compare our proposed algorithm with the existing method on rat data collected during a brain control three-lever discrimination task. Our preliminary results show that the proposed method has a higher trial accuracy with lower variance across data segments, which shows its potential to improve the performance for online BMI control. Clinical Relevance- This paper provides a more stable decoding method for adaptive and continuous neural decoding. It is promising for clinical applications in BMI.
脑机接口 (BMI) 将瘫痪患者的神经活动转化为假肢的控制命令,从而恢复他们失去的运动功能。神经活动代表随时间不断变化的大脑状态,这给在线解码器带来了挑战。强化学习 (RL) 具有在交互过程中构建动态神经运动映射的优势。然而,现有的 RL 解码器将离散动作作为分类问题输出,无法提供连续估计。先前的工作已经将卡尔曼滤波器 (KF) 与 BMI 中的 RL 相结合,实现了连续的运动状态估计。然而,这种方法采用神经网络结构,可能会陷入局部最优,无法为神经运动映射提供有效的在线更新。在本文中,我们提出了一种基于聚类核强化学习的卡尔曼滤波器 (CKRL-based KF),以避免在线神经运动更新中的局部最优问题。神经模式被投影到再生核希尔伯特空间 (RKHS) 中,这建立了一个通用逼近,以保证全局最优。我们在大鼠脑控制三杆辨别任务中收集的数据上比较了我们提出的算法和现有的方法。我们的初步结果表明,该方法在数据段之间具有更高的试验准确率和更低的方差,这表明它有潜力提高在线 BMI 控制的性能。临床相关性-本文为自适应和连续神经解码提供了一种更稳定的解码方法。它有望在 BMI 的临床应用中得到应用。