IEEE Trans Pattern Anal Mach Intell. 2024 Jan;46(1):267-279. doi: 10.1109/TPAMI.2023.3322426. Epub 2023 Dec 5.
Ensuring safety and achieving human-level driving performance remain challenges for autonomous vehicles, especially in safety-critical situations. As a key component of artificial intelligence, reinforcement learning is promising and has shown great potential in many complex tasks; however, its lack of safety guarantees limits its real-world applicability. Hence, further advancing reinforcement learning, especially from the safety perspective, is of great importance for autonomous driving. As revealed by cognitive neuroscientists, the amygdala of the brain can elicit defensive responses against threats or hazards, which is crucial for survival in and adaptation to risky environments. Drawing inspiration from this scientific discovery, we present a fear-neuro-inspired reinforcement learning framework to realize safe autonomous driving through modeling the amygdala functionality. This new technique facilitates an agent to learn defensive behaviors and achieve safe decision making with fewer safety violations. Through experimental tests, we show that the proposed approach enables the autonomous driving agent to attain state-of-the-art performance compared to the baseline agents and perform comparably to 30 certified human drivers, across various safety-critical scenarios. The results demonstrate the feasibility and effectiveness of our framework while also shedding light on the crucial role of simulating the amygdala function in the application of reinforcement learning to safety-critical autonomous driving domains.
确保自动驾驶汽车的安全性并实现人类驾驶水平仍然是一项挑战,尤其是在安全关键情况下。作为人工智能的关键组成部分,强化学习具有广阔的前景,在许多复杂任务中展现出巨大的潜力;然而,其缺乏安全保障限制了其在实际应用中的适用性。因此,进一步推进强化学习,特别是从安全角度来看,对于自动驾驶至关重要。认知神经科学家揭示,大脑的杏仁核可以对威胁或危险产生防御反应,这对于在危险环境中生存和适应至关重要。受此科学发现的启发,我们提出了一种基于恐惧神经的强化学习框架,通过模拟杏仁核功能来实现安全的自动驾驶。这种新技术可以使代理通过学习防御行为并减少安全违规来实现安全决策。通过实验测试,我们表明,与基线代理相比,所提出的方法使自动驾驶代理能够达到最新的性能水平,并在各种安全关键场景中与 30 名经过认证的人类驾驶员表现相当。结果表明了我们框架的可行性和有效性,同时也揭示了在强化学习应用于安全关键的自动驾驶领域时模拟杏仁核功能的关键作用。