Andriella Antonio, Torras Carme, Abdelnour Carla, Alenyà Guillem
CSIC-UPC, Institut de Robòtica i Informàtica Industrial, C/Llorens i Artigas 4-6, 08028 Barcelona, Spain.
Research Center and Memory Clinic, Fundació ACE, Institut Català de Neurociències Aplicades, Universitat Internacional de Catalunya, Barcelona, Spain.
User Model User-adapt Interact. 2023;33(2):441-496. doi: 10.1007/s11257-021-09316-5. Epub 2022 Mar 12.
Socially assistive robots have the potential to augment and enhance therapist's effectiveness in repetitive tasks such as cognitive therapies. However, their contribution has generally been limited as domain experts have not been fully involved in the entire pipeline of the design process as well as in the automatisation of the robots' behaviour. In this article, we present aCtive leARning agEnt aSsiStive bEhaviouR (CARESSER), a novel framework that actively learns robotic assistive behaviour by leveraging the therapist's expertise (knowledge-driven approach) and their demonstrations (data-driven approach). By exploiting that hybrid approach, the presented method enables in situ fast learning, in a fully autonomous fashion, of personalised patient-specific policies. With the purpose of evaluating our framework, we conducted two user studies in a daily care centre in which older adults affected by mild dementia and mild cognitive impairment ( = 22) were requested to solve cognitive exercises with the support of a therapist and later on of a robot endowed with CARESSER. Results showed that: (i) the robot managed to keep the patients' performance stable during the sessions even more so than the therapist; (ii) the assistance offered by the robot during the sessions eventually matched the therapist's preferences. We conclude that CARESSER, with its stakeholder-centric design, can pave the way to new AI approaches that learn by leveraging human-human interactions along with human expertise, which has the benefits of speeding up the learning process, eliminating the need for the design of complex reward functions, and finally avoiding undesired states.
社交辅助机器人有潜力在诸如认知疗法等重复性任务中增强治疗师的效能。然而,它们的贡献通常有限,因为领域专家尚未充分参与到设计过程的整个流程以及机器人行为的自动化中。在本文中,我们提出了主动学习智能体辅助行为(CARESSER),这是一个新颖的框架,它通过利用治疗师的专业知识(知识驱动方法)及其示范(数据驱动方法)来主动学习机器人辅助行为。通过采用这种混合方法,所提出的方法能够以完全自主的方式就地快速学习个性化的针对患者的策略。为了评估我们的框架,我们在一家日托中心进行了两项用户研究,其中要求22名受轻度痴呆和轻度认知障碍影响的老年人在治疗师以及后来配备了CARESSER的机器人的支持下完成认知练习。结果表明:(i)机器人在 sessions 期间能够使患者的表现保持稳定,甚至比治疗师做得更好;(ii)机器人在 sessions 期间提供的帮助最终与治疗师的偏好相匹配。我们得出结论,CARESSER以其以利益相关者为中心的设计,可以为通过利用人际互动和人类专业知识进行学习的新人工智能方法铺平道路,这具有加快学习过程、消除设计复杂奖励函数的需求并最终避免不良状态的好处。 (注:原文中“sessions”未明确其准确含义,按原样保留)