Ashith Shyam R B, Hao Zhou, Montanaro Umberto, Dixit Shilp, Rathinam Arunkumar, Gao Yang, Neumann Gerhard, Fallah Saber
Department of Electrical and Electronic Engineering, Surrey Space Center, University of Surrey, Guildford, United Kingdom.
Department of Mechanical Engineering, University of Surrey, Guildford, United Kingdom.
Front Robot AI. 2021 May 4;8:638849. doi: 10.3389/frobt.2021.638849. eCollection 2021.
This paper adds on to the on-going efforts to provide more autonomy to space robots and introduces the concept of programming by demonstration or imitation learning for trajectory planning of manipulators on free-floating spacecraft. A redundant 7-DoF robotic arm is mounted on small spacecraft dedicated for debris removal, on-orbit servicing and assembly, autonomous and rendezvous docking. The motion of robot (or manipulator) arm induces reaction forces on the spacecraft and hence its attitude changes prompting the Attitude Determination and Control System (ADCS) to take large corrective action. The method introduced here is capable of finding the trajectory that minimizes the attitudinal changes thereby reducing the load on ADCS. One of the critical elements in spacecraft trajectory planning and control is the power consumption. The approach introduced in this work carry out trajectory learning offline by collecting data from demonstrations and encoding it as a probabilistic distribution of trajectories. The learned trajectory distribution can be used for planning in previously unseen situations by conditioning the probabilistic distribution. Hence almost no power is required for computations after deployment. Sampling from a conditioned distribution provides several possible trajectories from the same start to goal state. To determine the trajectory that minimizes attitudinal changes, a cost term is defined and the trajectory which minimizes this cost is considered the optimal one.
本文在为太空机器人提供更多自主性的持续努力基础上,引入了示范编程或模仿学习的概念,用于自由漂浮航天器上机械手的轨迹规划。一个冗余的七自由度机器人手臂安装在专门用于碎片清除、在轨服务与组装、自主交会对接的小型航天器上。机器人(或机械手)手臂的运动会在航天器上产生反作用力,进而导致其姿态改变,促使姿态确定与控制系统(ADCS)采取大幅纠正措施。这里介绍的方法能够找到使姿态变化最小化的轨迹,从而减轻ADCS的负担。航天器轨迹规划与控制中的关键要素之一是功耗。这项工作中引入的方法通过从示范中收集数据并将其编码为轨迹的概率分布来进行离线轨迹学习。通过对概率分布进行条件设定,学习到的轨迹分布可用于在以前未见过的情况下进行规划。因此,部署后几乎无需计算功耗。从条件分布中采样可提供从相同起始状态到目标状态的多条可能轨迹。为了确定使姿态变化最小化的轨迹,定义了一个成本项,使该成本最小化的轨迹被视为最优轨迹。