基于演员-批评家强化学习的脑机接口中使用神经生物学反馈的置信度度量。

A confidence metric for using neurobiological feedback in actor-critic reinforcement learning based brain-machine interfaces.

机构信息

Department of Biomedical Engineering, University of Miami Coral Gables, FL, USA.

Department of Biomedical Engineering, University of Miami Coral Gables, FL, USA ; Department of Neuroscience, University of Miami Coral Gables, FL, USA ; Miami Project to Cure Paralysis, University of Miami Coral Gables, FL, USA.

出版信息

Front Neurosci. 2014 May 26;8:111. doi: 10.3389/fnins.2014.00111. eCollection 2014.

DOI:10.3389/fnins.2014.00111

PMID:24904257

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4033619/

Abstract

Brain-Machine Interfaces (BMIs) can be used to restore function in people living with paralysis. Current BMIs require extensive calibration that increase the set-up times and external inputs for decoder training that may be difficult to produce in paralyzed individuals. Both these factors have presented challenges in transitioning the technology from research environments to activities of daily living (ADL). For BMIs to be seamlessly used in ADL, these issues should be handled with minimal external input thus reducing the need for a technician/caregiver to calibrate the system. Reinforcement Learning (RL) based BMIs are a good tool to be used when there is no external training signal and can provide an adaptive modality to train BMI decoders. However, RL based BMIs are sensitive to the feedback provided to adapt the BMI. In actor-critic BMIs, this feedback is provided by the critic and the overall system performance is limited by the critic accuracy. In this work, we developed an adaptive BMI that could handle inaccuracies in the critic feedback in an effort to produce more accurate RL based BMIs. We developed a confidence measure, which indicated how appropriate the feedback is for updating the decoding parameters of the actor. The results show that with the new update formulation, the critic accuracy is no longer a limiting factor for the overall performance. We tested and validated the system onthree different data sets: synthetic data generated by an Izhikevich neural spiking model, synthetic data with a Gaussian noise distribution, and data collected from a non-human primate engaged in a reaching task. All results indicated that the system with the critic confidence built in always outperformed the system without the critic confidence. Results of this study suggest the potential application of the technique in developing an autonomous BMI that does not need an external signal for training or extensive calibration.

摘要

脑机接口（BMI）可用于恢复瘫痪患者的功能。当前的 BMI 需要进行广泛的校准，这增加了解码器训练的设置时间和外部输入，而瘫痪患者可能难以产生这些输入。这两个因素都给该技术从研究环境向日常生活活动（ADL）的过渡带来了挑战。为了使 BMI 在 ADL 中无缝使用，应该以最小的外部输入来处理这些问题，从而减少技术人员/护理人员校准系统的需求。基于强化学习（RL）的 BMI 是在没有外部训练信号时的一种很好的工具，并且可以为训练 BMI 解码器提供一种自适应模式。然而，基于 RL 的 BMI 对提供给适应 BMI 的反馈很敏感。在演员-评论家 BMI 中，该反馈由评论家提供，并且整个系统性能受到评论家准确性的限制。在这项工作中，我们开发了一种自适应 BMI，它可以处理评论家反馈中的不准确性，以产生更准确的基于 RL 的 BMI。我们开发了一种置信度度量，它指示反馈对于更新演员的解码参数有多合适。结果表明，使用新的更新公式，评论家的准确性不再是整体性能的限制因素。我们在三个不同的数据集上测试和验证了该系统：由 Izhikevich 神经尖峰模型生成的合成数据、具有高斯噪声分布的合成数据以及从参与伸手任务的非人类灵长类动物收集的数据。所有结果均表明，内置评论家置信度的系统始终优于没有评论家置信度的系统。这项研究的结果表明，该技术有可能开发出不需要外部信号进行训练或广泛校准的自主 BMI。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/df03/4033619/07ad6a2c134d/fnins-08-00111-g0001.jpg

相似文献

A confidence metric for using neurobiological feedback in actor-critic reinforcement learning based brain-machine interfaces.

Front Neurosci. 2014 May 26;8:111. doi: 10.3389/fnins.2014.00111. eCollection 2014.

Feedback for reinforcement learning based brain-machine interfaces using confidence metrics.

J Neural Eng. 2017 Jun;14(3):036016. doi: 10.1088/1741-2552/aa6317. Epub 2017 Feb 27.

Brain-Machine Interface control of a robot arm using actor-critic rainforcement learning.

Annu Int Conf IEEE Eng Med Biol Soc. 2012;2012:4108-11. doi: 10.1109/EMBC.2012.6346870.

Task Learning Over Multi-Day Recording via Internally Rewarded Reinforcement Learning Based Brain Machine Interfaces.

IEEE Trans Neural Syst Rehabil Eng. 2020 Dec;28(12):3089-3099. doi: 10.1109/TNSRE.2020.3039970. Epub 2021 Jan 28.

Using reinforcement learning to provide stable brain-machine interface control despite neural input reorganization.

PLoS One. 2014 Jan 30;9(1):e87253. doi: 10.1371/journal.pone.0087253. eCollection 2014.

A Kernel Reinforcement Learning Decoding Framework Integrating Neural and Feedback Signals for Brain Control.

Annu Int Conf IEEE Eng Med Biol Soc. 2023 Jul;2023:1-4. doi: 10.1109/EMBC40787.2023.10340203.

A symbiotic brain-machine interface through value-based decision making.

PLoS One. 2011 Mar 14;6(3):e14760. doi: 10.1371/journal.pone.0014760.

Neural Decoders Using Reinforcement Learning in Brain Machine Interfaces: A Technical Review.

Front Syst Neurosci. 2022 Aug 26;16:836778. doi: 10.3389/fnsys.2022.836778. eCollection 2022.

Audio-induced medial prefrontal cortical dynamics enhances coadaptive learning in brain-machine interfaces.

J Neural Eng. 2023 Oct 17;20(5). doi: 10.1088/1741-2552/ad017d.

Near Perfect Neural Critic from Motor Cortical Activity Toward an Autonomously Updating Brain Machine Interface.

Annu Int Conf IEEE Eng Med Biol Soc. 2018 Jul;2018:73-76. doi: 10.1109/EMBC.2018.8512274.

引用本文的文献

Neural Decoders Using Reinforcement Learning in Brain Machine Interfaces: A Technical Review.

Front Syst Neurosci. 2022 Aug 26;16:836778. doi: 10.3389/fnsys.2022.836778. eCollection 2022.

A New Frontier: The Convergence of Nanotechnology, Brain Machine Interfaces, and Artificial Intelligence.

Front Neurosci. 2018 Nov 16;12:843. doi: 10.3389/fnins.2018.00843. eCollection 2018.

Evolutionary algorithm optimization of biological learning parameters in a biomimetic neuroprosthesis.

IBM J Res Dev. 2017 Mar-May;61(2-3):6.1-6.14. doi: 10.1147/JRD.2017.2656758. Epub 2017 May 23.

Common marmoset (Callithrix jacchus) as a primate model for behavioral neuroscience studies.

J Neurosci Methods. 2017 Jun 1;284:35-46. doi: 10.1016/j.jneumeth.2017.04.004. Epub 2017 Apr 8.

Cortical Spiking Network Interfaced with Virtual Musculoskeletal Arm and Robotic Arm.

Front Neurorobot. 2015 Nov 25;9:13. doi: 10.3389/fnbot.2015.00013. eCollection 2015.

Editorial: Biosignal processing and computational methods to enhance sensory motor neuroprosthetics.

Front Neurosci. 2015 Nov 5;9:434. doi: 10.3389/fnins.2015.00434. eCollection 2015.

本文引用的文献

Using reinforcement learning to provide stable brain-machine interface control despite neural input reorganization.

PLoS One. 2014 Jan 30;9(1):e87253. doi: 10.1371/journal.pone.0087253. eCollection 2014.

Feature extraction and unsupervised classification of neural population reward signals for reinforcement based BMI.

Annu Int Conf IEEE Eng Med Biol Soc. 2013;2013:5250-3. doi: 10.1109/EMBC.2013.6610733.

Towards autonomous neuroprosthetic control using Hebbian reinforcement learning.

J Neural Eng. 2013 Dec;10(6):066005. doi: 10.1088/1741-2560/10/6/066005. Epub 2013 Oct 8.

Long term, stable brain machine interface performance using local field potentials and multiunit spikes.

J Neural Eng. 2013 Oct;10(5):056005. doi: 10.1088/1741-2560/10/5/056005. Epub 2013 Aug 5.

High-performance neuroprosthetic control by an individual with tetraplegia.

Lancet. 2013 Feb 16;381(9866):557-64. doi: 10.1016/S0140-6736(12)61816-9. Epub 2012 Dec 17.

Unsupervised adaptation of brain-machine interface decoders.

Front Neurosci. 2012 Nov 16;6:164. doi: 10.3389/fnins.2012.00164. eCollection 2012.

A high-performance neural prosthesis enabled by control algorithm design.

Nat Neurosci. 2012 Dec;15(12):1752-7. doi: 10.1038/nn.3265. Epub 2012 Nov 18.

Comprehensive characterization and failure modes of tungsten microwire arrays in chronic neural implants.

J Neural Eng. 2012 Oct;9(5):056015. doi: 10.1088/1741-2560/9/5/056015. Epub 2012 Sep 25.

A neurally-interfaced hand prosthesis tuned inter-hemispheric communication.

Restor Neurol Neurosci. 2012;30(5):407-18. doi: 10.3233/RNN-2012-120224.

Reach and grasp by people with tetraplegia using a neurally controlled robotic arm.

Nature. 2012 May 16;485(7398):372-5. doi: 10.1038/nature11076.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

基于演员-批评家强化学习的脑机接口中使用神经生物学反馈的置信度度量。

A confidence metric for using neurobiological feedback in actor-critic reinforcement learning based brain-machine interfaces.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献