• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

后见之明经验回放改进了多输入多输出人体手臂运动骨骼肌肉模型的强化学习控制。

Hindsight Experience Replay Improves Reinforcement Learning for Control of a MIMO Musculoskeletal Model of the Human Arm.

出版信息

IEEE Trans Neural Syst Rehabil Eng. 2021;29:1016-1025. doi: 10.1109/TNSRE.2021.3081056. Epub 2021 Jun 8.

DOI:10.1109/TNSRE.2021.3081056
PMID:33999822
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8630802/
Abstract

High-level spinal cord injuries often result in paralysis of all four limbs, leading to decreased patient independence and quality of life. Coordinated functional electrical stimulation (FES) of paralyzed muscles can be used to restore some motor function in the upper extremity. To coordinate functional movements, FES controllers should be developed to exploit the complex characteristics of human movement and produce the intended movement kinematics and/or kinetics. Here, we demonstrate the ability of a controller trained using reinforcement learning to generate desired movements of a horizontal planar musculoskeletal model of the human arm with 2 degrees of freedom and 6 actuators. The controller is given information about the kinematics of the arm, but not the internal state of the actuators. In particular, we demonstrate that a technique called "hindsight experience replay" can improve controller performance while also decreasing controller training time.

摘要

高位脊髓损伤常导致四肢瘫痪,导致患者独立性和生活质量下降。瘫痪肌肉的协调功能性电刺激 (FES) 可用于恢复上肢的一些运动功能。为了协调功能运动,应开发 FES 控制器以利用人类运动的复杂特征并产生预期的运动运动学和/或动力学。在这里,我们展示了使用强化学习训练的控制器生成具有 2 个自由度和 6 个执行器的人体手臂水平平面肌肉骨骼模型的期望运动的能力。控制器提供有关手臂运动学的信息,但不提供执行器的内部状态。特别是,我们证明了一种称为“后见之明经验重放”的技术可以在提高控制器性能的同时减少控制器训练时间。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1b44/8630802/5b2fdaf66acc/nihms-1712878-f0007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1b44/8630802/2c1d4f4b5600/nihms-1712878-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1b44/8630802/7e66e9d0e63b/nihms-1712878-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1b44/8630802/431111b6e8e5/nihms-1712878-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1b44/8630802/f7adb82a19ef/nihms-1712878-f0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1b44/8630802/c0f2341e17bf/nihms-1712878-f0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1b44/8630802/9e80ac0a236c/nihms-1712878-f0006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1b44/8630802/5b2fdaf66acc/nihms-1712878-f0007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1b44/8630802/2c1d4f4b5600/nihms-1712878-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1b44/8630802/7e66e9d0e63b/nihms-1712878-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1b44/8630802/431111b6e8e5/nihms-1712878-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1b44/8630802/f7adb82a19ef/nihms-1712878-f0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1b44/8630802/c0f2341e17bf/nihms-1712878-f0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1b44/8630802/9e80ac0a236c/nihms-1712878-f0006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1b44/8630802/5b2fdaf66acc/nihms-1712878-f0007.jpg

相似文献

1
Hindsight Experience Replay Improves Reinforcement Learning for Control of a MIMO Musculoskeletal Model of the Human Arm.后见之明经验回放改进了多输入多输出人体手臂运动骨骼肌肉模型的强化学习控制。
IEEE Trans Neural Syst Rehabil Eng. 2021;29:1016-1025. doi: 10.1109/TNSRE.2021.3081056. Epub 2021 Jun 8.
2
Training an Actor-Critic Reinforcement Learning Controller for Arm Movement Using Human-Generated Rewards.使用人类生成的奖励训练用于手臂运动的 Actor-Critic 强化学习控制器。
IEEE Trans Neural Syst Rehabil Eng. 2017 Oct;25(10):1892-1905. doi: 10.1109/TNSRE.2017.2700395. Epub 2017 May 2.
3
Deep Reinforcement Learning for Control of Time-Varying Musculoskeletal Systems With High Fatigability: A Feasibility Study.基于强化学习的时变肌肉骨骼系统疲劳控制:一项可行性研究。
IEEE Trans Neural Syst Rehabil Eng. 2022;30:2613-2622. doi: 10.1109/TNSRE.2022.3203970. Epub 2022 Sep 19.
4
Improving the Learning Rate, Accuracy, and Workspace of Reinforcement Learning Controllers for a Musculoskeletal Model of the Human Arm.提高强化学习控制器在人类手臂骨骼肌肉模型中的学习率、准确性和工作空间。
IEEE Trans Neural Syst Rehabil Eng. 2022;30:30-39. doi: 10.1109/TNSRE.2021.3135471. Epub 2022 Jan 28.
5
A neural tracking and motor control approach to improve rehabilitation of upper limb movements.一种用于改善上肢运动康复的神经跟踪与运动控制方法。
J Neuroeng Rehabil. 2008 Feb 5;5:5. doi: 10.1186/1743-0003-5-5.
6
An optimized proportional-derivative controller for the human upper extremity with gravity.一种针对带重力的人体上肢的优化比例-微分控制器。
J Biomech. 2015 Oct 15;48(13):3692-700. doi: 10.1016/j.jbiomech.2015.08.016. Epub 2015 Aug 29.
7
Optimization and evaluation of a proportional derivative controller for planar arm movement.平面手臂运动比例微分控制器的优化与评估。
J Biomech. 2010 Apr 19;43(6):1086-91. doi: 10.1016/j.jbiomech.2009.12.017. Epub 2010 Jan 25.
8
Construction of efficacious gait and upper limb functional interventions based on brain plasticity evidence and model-based measures for stroke patients.基于脑可塑性证据和基于模型的测量方法,为中风患者构建有效的步态和上肢功能干预措施。
ScientificWorldJournal. 2007 Dec 20;7:2031-45. doi: 10.1100/tsw.2007.299.
9
The Impact of Hill-Type Actuator Components on the Performance of Reinforcement Learning Controllers to Reverse Upper-Limb Paralysis.希尔型致动器组件对用于逆转上肢瘫痪的强化学习控制器性能的影响。
IEEE Trans Neural Syst Rehabil Eng. 2023;31:2883-2892. doi: 10.1109/TNSRE.2023.3285983. Epub 2023 Jul 10.
10
Combined feedforward and feedback control of a redundant, nonlinear, dynamic musculoskeletal system.冗余、非线性、动态肌肉骨骼系统的前馈与反馈联合控制
Med Biol Eng Comput. 2009 May;47(5):533-42. doi: 10.1007/s11517-009-0479-3. Epub 2009 Apr 3.

引用本文的文献

1
A Musculoskeletal Model of the Hand and Wrist Capable of Simulating Functional Tasks.一种能够模拟手部和腕部功能任务的肌肉骨骼模型。
IEEE Trans Biomed Eng. 2023 May;70(5):1424-1435. doi: 10.1109/TBME.2022.3217722. Epub 2023 Apr 20.
2
Generating Human Arm Kinematics Using Reinforcement Learning to Train Active Muscle Behavior in Automotive Research.使用强化学习生成人类手臂运动学,以训练汽车研究中的主动肌肉行为。
J Biomech Eng. 2022 Dec 1;144(12). doi: 10.1115/1.4055680.
3
Improving the Learning Rate, Accuracy, and Workspace of Reinforcement Learning Controllers for a Musculoskeletal Model of the Human Arm.

本文引用的文献

1
Training an Actor-Critic Reinforcement Learning Controller for Arm Movement Using Human-Generated Rewards.使用人类生成的奖励训练用于手臂运动的 Actor-Critic 强化学习控制器。
IEEE Trans Neural Syst Rehabil Eng. 2017 Oct;25(10):1892-1905. doi: 10.1109/TNSRE.2017.2700395. Epub 2017 May 2.
2
Restoration of reaching and grasping movements through brain-controlled muscle stimulation in a person with tetraplegia: a proof-of-concept demonstration.脑控肌肉刺激恢复四肢瘫痪患者的上肢运动:概念验证研究。
Lancet. 2017 May 6;389(10081):1821-1830. doi: 10.1016/S0140-6736(17)30601-3. Epub 2017 Mar 28.
3
Multi-muscle FES force control of the human arm for arbitrary goals.
提高强化学习控制器在人类手臂骨骼肌肉模型中的学习率、准确性和工作空间。
IEEE Trans Neural Syst Rehabil Eng. 2022;30:30-39. doi: 10.1109/TNSRE.2021.3135471. Epub 2022 Jan 28.
多肌肉 FES 对人体手臂的力控制,用于任意目标。
IEEE Trans Neural Syst Rehabil Eng. 2014 May;22(3):654-63. doi: 10.1109/TNSRE.2013.2282903. Epub 2013 Oct 7.
4
Combined feedforward and feedback control of a redundant, nonlinear, dynamic musculoskeletal system.冗余、非线性、动态肌肉骨骼系统的前馈与反馈联合控制
Med Biol Eng Comput. 2009 May;47(5):533-42. doi: 10.1007/s11517-009-0479-3. Epub 2009 Apr 3.
5
Joint angle control by FES using a feedback error learning controller.使用反馈误差学习控制器通过功能性电刺激进行关节角度控制。
IEEE Trans Neural Syst Rehabil Eng. 2005 Sep;13(3):359-71. doi: 10.1109/TNSRE.2005.847355.
6
The role of multisensor data fusion in neuromuscular control of a sagittal arm with a pair of muscles using actor-critic reinforcement learning method.多传感器数据融合在使用演员-评论家强化学习方法对具有一对肌肉的矢状臂进行神经肌肉控制中的作用。
Technol Health Care. 2004;12(6):425-38.
7
Estimation of musculotendon properties in the human upper limb.人体上肢肌肉肌腱特性的评估。
Ann Biomed Eng. 2003 Feb;31(2):207-20. doi: 10.1114/1.1540105.
8
The functional impact of the Freehand System on tetraplegic hand function. Clinical Results.徒手系统对四肢瘫手功能的功能影响。临床结果。
Spinal Cord. 2002 Nov;40(11):560-6. doi: 10.1038/sj.sc.3101373.
9
Computational nature of human adaptive control during learning of reaching movements in force fields.在力场中学习伸手动作时人类适应性控制的计算本质。
Biol Cybern. 1999 Jul;81(1):39-60. doi: 10.1007/s004220050543.
10
Speed/accuracy trade-offs in target-directed movements.目标导向运动中的速度/准确性权衡。
Behav Brain Sci. 1997 Jun;20(2):279-303; discussion 303-49. doi: 10.1017/s0140525x97001441.