• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于肌肉张力实现人体手臂姿态稳定的高效强化学习智能体。

Efficient Actor-Critic Reinforcement Learning With Embodiment of Muscle Tone for Posture Stabilization of the Human Arm.

机构信息

Toyota Central R&D Labs., Aichi 480-1192 Japan

出版信息

Neural Comput. 2021 Jan;33(1):129-156. doi: 10.1162/neco_a_01333. Epub 2020 Oct 20.

DOI:10.1162/neco_a_01333
PMID:33080164
Abstract

This letter proposes a new idea to improve learning efficiency in reinforcement learning (RL) with the actor-critic method used as a muscle controller for posture stabilization of the human arm. Actor-critic RL (ACRL) is used for simulations to realize posture controls in humans or robots using muscle tension control. However, it requires very high computational costs to acquire a better muscle control policy for desirable postures. For efficient ACRL, we focused on embodiment that is supposed to potentially achieve efficient controls in research fields of artificial intelligence or robotics. According to the neurophysiology of motion control obtained from experimental studies using animals or humans, the pedunculopontine tegmental nucleus (PPTn) induces muscle tone suppression, and the midbrain locomotor region (MLR) induces muscle tone promotion. PPTn and MLR modulate the activation levels of mutually antagonizing muscles such as flexors and extensors in a process through which control signals are translated from the substantia nigra reticulata to the brain stem. Therefore, we hypothesized that the PPTn and MLR could control muscle tone, that is, the maximum values of activation levels of mutually antagonizing muscles using different sigmoidal functions for each muscle; then we introduced antagonism function models (AFMs) of PPTn and MLR for individual muscles, incorporating the hypothesis into the process to determine the activation level of each muscle based on the output of the actor in ACRL. ACRL with AFMs representing the embodiment of muscle tone successfully achieved posture stabilization in five joint motions of the right arm of a human adult male under gravity in predetermined target angles at an earlier period of learning than the learning methods without AFMs. The results obtained from this study suggest that the introduction of embodiment of muscle tone can enhance learning efficiency in posture stabilization disorders of humans or humanoid robots.

摘要

这封信提出了一个新的想法,即在强化学习(RL)中使用动作-评价器方法,将其作为人类手臂姿势稳定的肌肉控制器,以提高学习效率。使用肌肉张力控制来实现人类或机器人的姿势控制,这是动作-评价器 RL(ACRL)的模拟。然而,为了获得更好的肌肉控制策略来实现理想的姿势,这需要非常高的计算成本。为了实现高效的 ACRL,我们专注于体现,这有望在人工智能或机器人学的研究领域实现高效控制。根据使用动物或人类进行实验研究获得的运动控制神经生理学,脚桥核被盖部(PPTn)抑制肌肉张力,中脑运动区(MLR)促进肌肉张力。PPTn 和 MLR 调节相互拮抗的肌肉(如屈肌和伸肌)的激活水平,控制信号通过黑质网状部传递到脑干。因此,我们假设 PPTn 和 MLR 可以控制肌肉张力,即使用每个肌肉的不同 sigmoid 函数来控制相互拮抗的肌肉的最大激活水平;然后,我们为每个肌肉引入了 PPTn 和 MLR 的拮抗作用函数模型(AFMs),将假设纳入基于 ACRL 中的评价器的输出来确定每个肌肉的激活水平的过程中。代表肌肉张力体现的具有 AFMs 的 ACRL 在学习早期成功地实现了成年男性右上肢五个关节运动在预定目标角度下的重力姿势稳定,而没有 AFMs 的学习方法则无法实现。这项研究的结果表明,引入肌肉张力体现可以提高人类或类人机器人姿势稳定障碍的学习效率。

相似文献

1
Efficient Actor-Critic Reinforcement Learning With Embodiment of Muscle Tone for Posture Stabilization of the Human Arm.基于肌肉张力实现人体手臂姿态稳定的高效强化学习智能体。
Neural Comput. 2021 Jan;33(1):129-156. doi: 10.1162/neco_a_01333. Epub 2020 Oct 20.
2
The role of multisensor data fusion in neuromuscular control of a sagittal arm with a pair of muscles using actor-critic reinforcement learning method.多传感器数据融合在使用演员-评论家强化学习方法对具有一对肌肉的矢状臂进行神经肌肉控制中的作用。
Technol Health Care. 2004;12(6):425-38.
3
Meta attention for Off-Policy Actor-Critic.用于离策略演员-评论家的元注意力机制
Neural Netw. 2023 Jun;163:86-96. doi: 10.1016/j.neunet.2023.03.024. Epub 2023 Mar 28.
4
Training an Actor-Critic Reinforcement Learning Controller for Arm Movement Using Human-Generated Rewards.使用人类生成的奖励训练用于手臂运动的 Actor-Critic 强化学习控制器。
IEEE Trans Neural Syst Rehabil Eng. 2017 Oct;25(10):1892-1905. doi: 10.1109/TNSRE.2017.2700395. Epub 2017 May 2.
5
Neuromuscular control of the point to point and oscillatory movements of a sagittal arm with the actor-critic reinforcement learning method.基于演员-评论家强化学习方法对矢状臂点对点运动和振荡运动的神经肌肉控制
Comput Methods Biomech Biomed Engin. 2005 Apr;8(2):103-13. doi: 10.1080/10255840500167952.
6
Brain-Machine Interface control of a robot arm using actor-critic rainforcement learning.使用演员-评论家强化学习对机器人手臂进行脑机接口控制。
Annu Int Conf IEEE Eng Med Biol Soc. 2012;2012:4108-11. doi: 10.1109/EMBC.2012.6346870.
7
Basal ganglia efferents to the brainstem centers controlling postural muscle tone and locomotion: a new concept for understanding motor disorders in basal ganglia dysfunction.基底神经节向控制姿势肌张力和运动的脑干中枢发出的传出纤维:理解基底神经节功能障碍中运动障碍的一个新概念。
Neuroscience. 2003;119(1):293-308. doi: 10.1016/s0306-4522(03)00095-2.
8
Learning arm's posture control using reinforcement learning and feedback-error-learning.
Conf Proc IEEE Eng Med Biol Soc. 2004;2006:486-9. doi: 10.1109/IEMBS.2004.1403200.
9
Actor-critic models of the basal ganglia: new anatomical and computational perspectives.基底神经节的 Actor-评论家模型:新的解剖学和计算视角。
Neural Netw. 2002 Jun-Jul;15(4-6):535-47. doi: 10.1016/s0893-6080(02)00047-3.
10
The Intelligent Path Planning System of Agricultural Robot via Reinforcement Learning.农业机器人的强化学习智能路径规划系统。
Sensors (Basel). 2022 Jun 7;22(12):4316. doi: 10.3390/s22124316.

引用本文的文献

1
In-silico simultaneous respiratory and circulatory measurement during voluntary breathing, exercise, and mental stress: A computational approach.在自愿呼吸、运动和精神压力期间进行的计算机模拟同步呼吸和循环测量:一种计算方法。
PLoS Comput Biol. 2024 Dec 17;20(12):e1012645. doi: 10.1371/journal.pcbi.1012645. eCollection 2024 Dec.
2
Antagonistic Feedback Control of Muscle Length Changes for Efficient Involuntary Posture Stabilization.肌肉长度变化的拮抗反馈控制以实现高效的非自主姿势稳定
Biomimetics (Basel). 2024 Oct 11;9(10):618. doi: 10.3390/biomimetics9100618.
3
Generating Human Arm Kinematics Using Reinforcement Learning to Train Active Muscle Behavior in Automotive Research.
使用强化学习生成人类手臂运动学,以训练汽车研究中的主动肌肉行为。
J Biomech Eng. 2022 Dec 1;144(12). doi: 10.1115/1.4055680.