• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于演员-批评家强化学习的脑机接口中使用神经生物学反馈的置信度度量。

A confidence metric for using neurobiological feedback in actor-critic reinforcement learning based brain-machine interfaces.

机构信息

Department of Biomedical Engineering, University of Miami Coral Gables, FL, USA.

Department of Biomedical Engineering, University of Miami Coral Gables, FL, USA ; Department of Neuroscience, University of Miami Coral Gables, FL, USA ; Miami Project to Cure Paralysis, University of Miami Coral Gables, FL, USA.

出版信息

Front Neurosci. 2014 May 26;8:111. doi: 10.3389/fnins.2014.00111. eCollection 2014.

DOI:10.3389/fnins.2014.00111
PMID:24904257
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4033619/
Abstract

Brain-Machine Interfaces (BMIs) can be used to restore function in people living with paralysis. Current BMIs require extensive calibration that increase the set-up times and external inputs for decoder training that may be difficult to produce in paralyzed individuals. Both these factors have presented challenges in transitioning the technology from research environments to activities of daily living (ADL). For BMIs to be seamlessly used in ADL, these issues should be handled with minimal external input thus reducing the need for a technician/caregiver to calibrate the system. Reinforcement Learning (RL) based BMIs are a good tool to be used when there is no external training signal and can provide an adaptive modality to train BMI decoders. However, RL based BMIs are sensitive to the feedback provided to adapt the BMI. In actor-critic BMIs, this feedback is provided by the critic and the overall system performance is limited by the critic accuracy. In this work, we developed an adaptive BMI that could handle inaccuracies in the critic feedback in an effort to produce more accurate RL based BMIs. We developed a confidence measure, which indicated how appropriate the feedback is for updating the decoding parameters of the actor. The results show that with the new update formulation, the critic accuracy is no longer a limiting factor for the overall performance. We tested and validated the system onthree different data sets: synthetic data generated by an Izhikevich neural spiking model, synthetic data with a Gaussian noise distribution, and data collected from a non-human primate engaged in a reaching task. All results indicated that the system with the critic confidence built in always outperformed the system without the critic confidence. Results of this study suggest the potential application of the technique in developing an autonomous BMI that does not need an external signal for training or extensive calibration.

摘要

脑机接口(BMI)可用于恢复瘫痪患者的功能。当前的 BMI 需要进行广泛的校准,这增加了解码器训练的设置时间和外部输入,而瘫痪患者可能难以产生这些输入。这两个因素都给该技术从研究环境向日常生活活动(ADL)的过渡带来了挑战。为了使 BMI 在 ADL 中无缝使用,应该以最小的外部输入来处理这些问题,从而减少技术人员/护理人员校准系统的需求。基于强化学习(RL)的 BMI 是在没有外部训练信号时的一种很好的工具,并且可以为训练 BMI 解码器提供一种自适应模式。然而,基于 RL 的 BMI 对提供给适应 BMI 的反馈很敏感。在演员-评论家 BMI 中,该反馈由评论家提供,并且整个系统性能受到评论家准确性的限制。在这项工作中,我们开发了一种自适应 BMI,它可以处理评论家反馈中的不准确性,以产生更准确的基于 RL 的 BMI。我们开发了一种置信度度量,它指示反馈对于更新演员的解码参数有多合适。结果表明,使用新的更新公式,评论家的准确性不再是整体性能的限制因素。我们在三个不同的数据集上测试和验证了该系统:由 Izhikevich 神经尖峰模型生成的合成数据、具有高斯噪声分布的合成数据以及从参与伸手任务的非人类灵长类动物收集的数据。所有结果均表明,内置评论家置信度的系统始终优于没有评论家置信度的系统。这项研究的结果表明,该技术有可能开发出不需要外部信号进行训练或广泛校准的自主 BMI。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/df03/4033619/ed827883f04f/fnins-08-00111-g0006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/df03/4033619/07ad6a2c134d/fnins-08-00111-g0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/df03/4033619/236909bafe33/fnins-08-00111-g0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/df03/4033619/14458eb3faaa/fnins-08-00111-g0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/df03/4033619/8fd83ef947f1/fnins-08-00111-g0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/df03/4033619/a71b95ec8bb4/fnins-08-00111-g0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/df03/4033619/ed827883f04f/fnins-08-00111-g0006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/df03/4033619/07ad6a2c134d/fnins-08-00111-g0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/df03/4033619/236909bafe33/fnins-08-00111-g0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/df03/4033619/14458eb3faaa/fnins-08-00111-g0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/df03/4033619/8fd83ef947f1/fnins-08-00111-g0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/df03/4033619/a71b95ec8bb4/fnins-08-00111-g0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/df03/4033619/ed827883f04f/fnins-08-00111-g0006.jpg

相似文献

1
A confidence metric for using neurobiological feedback in actor-critic reinforcement learning based brain-machine interfaces.基于演员-批评家强化学习的脑机接口中使用神经生物学反馈的置信度度量。
Front Neurosci. 2014 May 26;8:111. doi: 10.3389/fnins.2014.00111. eCollection 2014.
2
Feedback for reinforcement learning based brain-machine interfaces using confidence metrics.基于置信度指标的用于脑机接口的强化学习反馈
J Neural Eng. 2017 Jun;14(3):036016. doi: 10.1088/1741-2552/aa6317. Epub 2017 Feb 27.
3
Brain-Machine Interface control of a robot arm using actor-critic rainforcement learning.使用演员-评论家强化学习对机器人手臂进行脑机接口控制。
Annu Int Conf IEEE Eng Med Biol Soc. 2012;2012:4108-11. doi: 10.1109/EMBC.2012.6346870.
4
Task Learning Over Multi-Day Recording via Internally Rewarded Reinforcement Learning Based Brain Machine Interfaces.基于内部奖励强化学习的多日记录任务学习脑机接口。
IEEE Trans Neural Syst Rehabil Eng. 2020 Dec;28(12):3089-3099. doi: 10.1109/TNSRE.2020.3039970. Epub 2021 Jan 28.
5
Using reinforcement learning to provide stable brain-machine interface control despite neural input reorganization.利用强化学习在神经输入重组的情况下提供稳定的脑机接口控制。
PLoS One. 2014 Jan 30;9(1):e87253. doi: 10.1371/journal.pone.0087253. eCollection 2014.
6
A Kernel Reinforcement Learning Decoding Framework Integrating Neural and Feedback Signals for Brain Control.一种整合神经和反馈信号的核强化学习解码框架,用于脑控。
Annu Int Conf IEEE Eng Med Biol Soc. 2023 Jul;2023:1-4. doi: 10.1109/EMBC40787.2023.10340203.
7
A symbiotic brain-machine interface through value-based decision making.通过基于价值的决策实现共生的脑机接口。
PLoS One. 2011 Mar 14;6(3):e14760. doi: 10.1371/journal.pone.0014760.
8
Neural Decoders Using Reinforcement Learning in Brain Machine Interfaces: A Technical Review.脑机接口中使用强化学习的神经解码器:技术综述
Front Syst Neurosci. 2022 Aug 26;16:836778. doi: 10.3389/fnsys.2022.836778. eCollection 2022.
9
Audio-induced medial prefrontal cortical dynamics enhances coadaptive learning in brain-machine interfaces.音频诱导的内侧前额叶皮质动态增强了脑机接口中的共同适应学习。
J Neural Eng. 2023 Oct 17;20(5). doi: 10.1088/1741-2552/ad017d.
10
Near Perfect Neural Critic from Motor Cortical Activity Toward an Autonomously Updating Brain Machine Interface.基于运动皮层活动构建近乎完美的神经评判器以实现自主更新的脑机接口
Annu Int Conf IEEE Eng Med Biol Soc. 2018 Jul;2018:73-76. doi: 10.1109/EMBC.2018.8512274.

引用本文的文献

1
Neural Decoders Using Reinforcement Learning in Brain Machine Interfaces: A Technical Review.脑机接口中使用强化学习的神经解码器:技术综述
Front Syst Neurosci. 2022 Aug 26;16:836778. doi: 10.3389/fnsys.2022.836778. eCollection 2022.
2
A New Frontier: The Convergence of Nanotechnology, Brain Machine Interfaces, and Artificial Intelligence.一个新的前沿领域:纳米技术、脑机接口与人工智能的融合
Front Neurosci. 2018 Nov 16;12:843. doi: 10.3389/fnins.2018.00843. eCollection 2018.
3
Evolutionary algorithm optimization of biological learning parameters in a biomimetic neuroprosthesis.

本文引用的文献

1
Using reinforcement learning to provide stable brain-machine interface control despite neural input reorganization.利用强化学习在神经输入重组的情况下提供稳定的脑机接口控制。
PLoS One. 2014 Jan 30;9(1):e87253. doi: 10.1371/journal.pone.0087253. eCollection 2014.
2
Feature extraction and unsupervised classification of neural population reward signals for reinforcement based BMI.用于基于强化的脑机接口的神经群体奖励信号的特征提取与无监督分类
Annu Int Conf IEEE Eng Med Biol Soc. 2013;2013:5250-3. doi: 10.1109/EMBC.2013.6610733.
3
Towards autonomous neuroprosthetic control using Hebbian reinforcement learning.
仿生神经假体中生物学习参数的进化算法优化
IBM J Res Dev. 2017 Mar-May;61(2-3):6.1-6.14. doi: 10.1147/JRD.2017.2656758. Epub 2017 May 23.
4
Common marmoset (Callithrix jacchus) as a primate model for behavioral neuroscience studies.普通狨猴(绢毛猴)作为行为神经科学研究的灵长类动物模型。
J Neurosci Methods. 2017 Jun 1;284:35-46. doi: 10.1016/j.jneumeth.2017.04.004. Epub 2017 Apr 8.
5
Cortical Spiking Network Interfaced with Virtual Musculoskeletal Arm and Robotic Arm.与虚拟肌肉骨骼手臂和机器人手臂相连的皮质尖峰网络。
Front Neurorobot. 2015 Nov 25;9:13. doi: 10.3389/fnbot.2015.00013. eCollection 2015.
6
Editorial: Biosignal processing and computational methods to enhance sensory motor neuroprosthetics.社论:用于增强感觉运动神经假体的生物信号处理与计算方法
Front Neurosci. 2015 Nov 5;9:434. doi: 10.3389/fnins.2015.00434. eCollection 2015.
使用赫布强化学习实现自主神经假肢控制。
J Neural Eng. 2013 Dec;10(6):066005. doi: 10.1088/1741-2560/10/6/066005. Epub 2013 Oct 8.
4
Long term, stable brain machine interface performance using local field potentials and multiunit spikes.长期使用局部场电位和多单位尖峰的稳定脑机接口性能。
J Neural Eng. 2013 Oct;10(5):056005. doi: 10.1088/1741-2560/10/5/056005. Epub 2013 Aug 5.
5
High-performance neuroprosthetic control by an individual with tetraplegia.高位截瘫患者的高性能神经假体控制。
Lancet. 2013 Feb 16;381(9866):557-64. doi: 10.1016/S0140-6736(12)61816-9. Epub 2012 Dec 17.
6
Unsupervised adaptation of brain-machine interface decoders.无监督的脑机接口解码器自适应。
Front Neurosci. 2012 Nov 16;6:164. doi: 10.3389/fnins.2012.00164. eCollection 2012.
7
A high-performance neural prosthesis enabled by control algorithm design.通过控制算法设计实现高性能神经假体。
Nat Neurosci. 2012 Dec;15(12):1752-7. doi: 10.1038/nn.3265. Epub 2012 Nov 18.
8
Comprehensive characterization and failure modes of tungsten microwire arrays in chronic neural implants.慢性神经植入物中钨丝微阵列的综合特征和失效模式。
J Neural Eng. 2012 Oct;9(5):056015. doi: 10.1088/1741-2560/9/5/056015. Epub 2012 Sep 25.
9
A neurally-interfaced hand prosthesis tuned inter-hemispheric communication.神经接口手假肢调节半球间通信。
Restor Neurol Neurosci. 2012;30(5):407-18. doi: 10.3233/RNN-2012-120224.
10
Reach and grasp by people with tetraplegia using a neurally controlled robotic arm.四肢瘫痪患者使用神经控制的机器臂进行触及和抓握。
Nature. 2012 May 16;485(7398):372-5. doi: 10.1038/nature11076.