Phang Chun-Ren, Hirata Akimasa
Department of Electrical and Mechanical Engineering, Nagoya Institute of Technology, Nagoya, Japan.
Center of Biomedical Physics and Information Technology, Nagoya Institute of Technology, Nagoya, Japan.
Ann N Y Acad Sci. 2025 Apr;1546(1):157-172. doi: 10.1111/nyas.15322. Epub 2025 Mar 30.
Deep reinforcement learning (RL) algorithms enable the development of fully autonomous agents that can interact with the environment. Brain-computer interface (BCI) systems decipher human implicit brain signals regardless of the explicit environment. We proposed a novel integration technique between deep RL and BCI to improve beneficial human interventions in autonomous systems and the performance in decoding brain activities by considering environmental factors. Shared autonomy was allowed between the action command decoded from the electroencephalography (EEG) of the human agent and the action generated from the twin delayed DDPG (TD3) agent for a given complex environment. Our proposed copilot control scheme with a full blocker (Co-FB) significantly outperformed the individual EEG (EEG-NB) or TD3 control. The Co-FB model achieved a higher target-approaching score, lower failure rate, and lower human workload than the EEG-NB model. We also proposed a disparity -index to evaluate the effect of contradicting agent decisions on the control accuracy and authority of the copilot model. We observed that shifting control authority to the TD3 agent improved performance when BCI decoding was not optimal. These findings indicate that the copilot system can effectively handle complex environments and that BCI performance can be improved by considering environmental factors.
深度强化学习(RL)算法能够开发出可与环境交互的完全自主智能体。脑机接口(BCI)系统能够解读人类的隐式脑信号,而无需考虑明确的环境。我们提出了一种深度强化学习与脑机接口之间的新型集成技术,通过考虑环境因素来改善人类在自主系统中的有益干预以及解码脑活动的性能。在给定的复杂环境中,允许从人类智能体的脑电图(EEG)解码得到的动作指令与双延迟深度确定性策略梯度(TD3)智能体生成的动作之间实现共享自主性。我们提出的带有完全阻断器的副驾驶控制方案(Co-FB)显著优于单独的脑电图控制(EEG-NB)或TD3控制。与EEG-NB模型相比,Co-FB模型实现了更高的目标接近得分、更低的失败率和更低的人类工作量。我们还提出了一个差异指数,以评估相互矛盾的智能体决策对副驾驶模型的控制精度和权威性的影响。我们观察到,当BCI解码不理想时,将控制权转移到TD3智能体可提高性能。这些发现表明,副驾驶系统能够有效处理复杂环境,并且通过考虑环境因素可以提高BCI性能。