Tong Kang, Li Man, Qin Jiahu, Ma Qichao, Zhang Jie, Liu Qingchen
IEEE Trans Cybern. 2024 Nov;54(11):6832-6842. doi: 10.1109/TCYB.2024.3402353. Epub 2024 Oct 30.
Differential game is an effective technique to describe the negotiation between the humans and robots, which is widely used to realize the trajectory tracking tasks in the human-robot interaction (HRI). However, most existing works consider the control-affine HRI systems and assume the desired trajectory is available to both the human and the robot, which limit the scope of applications. To overcome these difficulties, this work focuses on the nonaffine HRI system and supposes that the desired trajectory is not available to the robot. A novel differential game framework encoding the desired trajectory estimator is proposed, where the desired trajectory is estimated via the Gaussian process regression (GPR) technique. To address the challenge arising from the nonlinearity of the HRI system, we equivalently transform the original problem into the one in a differentially flat space, and seek the equilibrium strategies for the transformed problem substitutionally. We further prove that the trajectory tracking error satisfies a probabilistic bound, whose confidence interval tightens as the decrease of noise variance during the interaction. Comparative simulation results show that our method outperforms the learning-based method in terms of robustness, parameters setting, and time consumption. Experiment results further show that the tracking error under the proposed human-robot cooperative algorithm is reduced by 55% compared to the human direct control.
微分博弈是描述人与机器人之间协商的一种有效技术,广泛应用于实现人机交互(HRI)中的轨迹跟踪任务。然而,现有的大多数工作都考虑控制仿射HRI系统,并假设人和机器人都能获得期望轨迹,这限制了应用范围。为克服这些困难,这项工作聚焦于非仿射HRI系统,并假设机器人无法获得期望轨迹。提出了一种编码期望轨迹估计器的新型微分博弈框架,其中通过高斯过程回归(GPR)技术估计期望轨迹。为应对HRI系统非线性带来的挑战,我们将原问题等效变换到微分平坦空间中的一个问题,并替代地寻求变换后问题的均衡策略。我们进一步证明轨迹跟踪误差满足概率界,其置信区间随着交互过程中噪声方差的减小而收紧。对比仿真结果表明,我们的方法在鲁棒性、参数设置和时间消耗方面优于基于学习的方法。实验结果进一步表明,与人类直接控制相比,所提出的人机协作算法下的跟踪误差降低了55%。