使用演员-评论家强化学习对机器人手臂进行脑机接口控制。

Brain-Machine Interface control of a robot arm using actor-critic rainforcement learning.

作者信息

Pohlmeyer Eric A, Mahmoudi Babak, Geng Shijia, Prins Noeline, Sanchez Justin C

机构信息

Department of Biomedical Engineering, Miami University, Coral Gables, Fl 33146, USA.

出版信息

Annu Int Conf IEEE Eng Med Biol Soc. 2012;2012:4108-11. doi: 10.1109/EMBC.2012.6346870.

DOI:10.1109/EMBC.2012.6346870

PMID:23366831

Abstract

Here we demonstrate how a marmoset monkey can use a reinforcement learning (RL) Brain-Machine Interface (BMI) to effectively control the movements of a robot arm for a reaching task. In this work, an actor-critic RL algorithm used neural ensemble activity in the monkey's motor cortext to control the robot movements during a two-target decision task. This novel approach to decoding offers unique advantages for BMI control applications. Compared to supervised learning decoding methods, the actor-critic RL algorithm does not require an explicit set of training data to create a static control model, but rather it incrementally adapts the model parameters according to its current performance, in this case requiring only a very basic feedback signal. We show how this algorithm achieved high performance when mapping the monkey's neural states (94%) to robot actions, and only needed to experience a few trials before obtaining accurate real-time control of the robot arm. Since RL methods responsively adapt and adjust their parameters, they can provide a method to create BMIs that are robust against perturbations caused by changes in either the neural input space or the output actions they generate under different task requirements or goals.

摘要

在此，我们展示了一只狨猴如何使用强化学习（RL）脑机接口（BMI）来有效控制机器人手臂在伸手抓取任务中的运动。在这项工作中，一种演员-评论家RL算法利用猴子运动皮层中的神经集群活动，在双目标决策任务期间控制机器人的运动。这种新颖的解码方法为BMI控制应用提供了独特的优势。与监督学习解码方法相比，演员-评论家RL算法不需要一组明确的训练数据来创建静态控制模型，而是根据其当前性能逐步调整模型参数，在这种情况下仅需要一个非常基本的反馈信号。我们展示了该算法在将猴子的神经状态（94%）映射到机器人动作时如何实现高性能，并且在获得机器人手臂的精确实时控制之前只需要进行几次试验。由于RL方法能够响应式地适应和调整其参数，它们可以提供一种创建对神经输入空间变化或在不同任务要求或目标下产生的输出动作变化所引起的扰动具有鲁棒性的BMI的方法。

相似文献

Brain-Machine Interface control of a robot arm using actor-critic rainforcement learning.

Annu Int Conf IEEE Eng Med Biol Soc. 2012;2012:4108-11. doi: 10.1109/EMBC.2012.6346870.

Using reinforcement learning to provide stable brain-machine interface control despite neural input reorganization.

PLoS One. 2014 Jan 30;9(1):e87253. doi: 10.1371/journal.pone.0087253. eCollection 2014.

Feedback for reinforcement learning based brain-machine interfaces using confidence metrics.

J Neural Eng. 2017 Jun;14(3):036016. doi: 10.1088/1741-2552/aa6317. Epub 2017 Feb 27.

A confidence metric for using neurobiological feedback in actor-critic reinforcement learning based brain-machine interfaces.

Front Neurosci. 2014 May 26;8:111. doi: 10.3389/fnins.2014.00111. eCollection 2014.

Control of Redundant Kinematic Degrees of Freedom in a Closed-Loop Brain-Machine Interface.

IEEE Trans Neural Syst Rehabil Eng. 2017 Jun;25(6):750-760. doi: 10.1109/TNSRE.2016.2593696. Epub 2016 Jul 21.

High-Density Electromyography and Motor Skill Learning for Robust Long-Term Control of a 7-DoF Robot Arm.

IEEE Trans Neural Syst Rehabil Eng. 2016 Apr;24(4):424-33. doi: 10.1109/TNSRE.2015.2417775. Epub 2015 Mar 31.

Intermediate Sensory Feedback Assisted Multi-Step Neural Decoding for Reinforcement Learning Based Brain-Machine Interfaces.

IEEE Trans Neural Syst Rehabil Eng. 2022;30:2834-2844. doi: 10.1109/TNSRE.2022.3210700. Epub 2022 Oct 20.

A Weight Transfer Mechanism for Kernel Reinforcement Learning Decoding in Brain-Machine Interfaces.

Annu Int Conf IEEE Eng Med Biol Soc. 2019 Jul;2019:3547-3550. doi: 10.1109/EMBC.2019.8856555.

Training an Actor-Critic Reinforcement Learning Controller for Arm Movement Using Human-Generated Rewards.

IEEE Trans Neural Syst Rehabil Eng. 2017 Oct;25(10):1892-1905. doi: 10.1109/TNSRE.2017.2700395. Epub 2017 May 2.

A new method of concurrently visualizing states, values, and actions in reinforcement based brain machine interfaces.

Annu Int Conf IEEE Eng Med Biol Soc. 2013;2013:5402-5. doi: 10.1109/EMBC.2013.6610770.

引用本文的文献

Neural Decoders Using Reinforcement Learning in Brain Machine Interfaces: A Technical Review.

Front Syst Neurosci. 2022 Aug 26;16:836778. doi: 10.3389/fnsys.2022.836778. eCollection 2022.

Mirror neurons are modulated by grip force and reward expectation in the sensorimotor cortices (S1, M1, PMd, PMv).

Sci Rep. 2021 Aug 5;11(1):15959. doi: 10.1038/s41598-021-95536-z.

A platform for semiautomated voluntary training of common marmosets for behavioral neuroscience.

J Neurophysiol. 2020 Apr 1;123(4):1420-1426. doi: 10.1152/jn.00300.2019. Epub 2020 Mar 4.

HD-EEG Based Classification of Motor-Imagery Related Activity in Patients With Spinal Cord Injury.

Front Neurol. 2018 Nov 19;9:955. doi: 10.3389/fneur.2018.00955. eCollection 2018.

Progress in EEG-Based Brain Robot Interaction Systems.

Comput Intell Neurosci. 2017;2017:1742862. doi: 10.1155/2017/1742862. Epub 2017 Apr 5.

Toward an autonomous brain machine interface: integrating sensorimotor reward modulation and reinforcement learning.

J Neurosci. 2015 May 13;35(19):7374-87. doi: 10.1523/JNEUROSCI.1802-14.2015.

An adaptive brain actuated system for augmenting rehabilitation.

Front Neurosci. 2014 Dec 12;8:415. doi: 10.3389/fnins.2014.00415. eCollection 2014.

Using reinforcement learning to provide stable brain-machine interface control despite neural input reorganization.

PLoS One. 2014 Jan 30;9(1):e87253. doi: 10.1371/journal.pone.0087253. eCollection 2014.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

使用演员-评论家强化学习对机器人手臂进行脑机接口控制。

Brain-Machine Interface control of a robot arm using actor-critic rainforcement learning.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献