Heskebeck Frida, Bergeling Carolina, Bernhardsson Bo
Department of Automatic Control, Lund University, Lund, Sweden.
Department of Mathematics and Natural Sciences, Blekinge Tekniska Högskola, Karlskrona, Sweden.
Front Hum Neurosci. 2022 Jul 5;16:931085. doi: 10.3389/fnhum.2022.931085. eCollection 2022.
The multi-armed bandit (MAB) problem models a decision-maker that optimizes its actions based on current and acquired new knowledge to maximize its reward. This type of online decision is prominent in many procedures of Brain-Computer Interfaces (BCIs) and MAB has previously been used to investigate, e.g., what mental commands to use to optimize BCI performance. However, MAB optimization in the context of BCI is still relatively unexplored, even though it has the potential to improve BCI performance during both calibration and real-time implementation. Therefore, this review aims to further describe the fruitful area of MABs to the BCI community. The review includes a background on MAB problems and standard solution methods, and interpretations related to BCI systems. Moreover, it includes state-of-the-art concepts of MAB in BCI and suggestions for future research.
多臂赌博机(MAB)问题对一个决策者进行建模,该决策者基于当前和新获取的知识来优化其行动,以最大化其奖励。这种在线决策在脑机接口(BCI)的许多过程中都很突出,并且MAB先前已被用于研究,例如,使用哪些心理命令来优化BCI性能。然而,尽管MAB在BCI校准和实时实施过程中都有提高BCI性能的潜力,但在BCI背景下的MAB优化仍相对未被探索。因此,本综述旨在向BCI社区进一步介绍MAB这一富有成果的领域。该综述包括MAB问题和标准解决方法的背景,以及与BCI系统相关的解释。此外,它还包括BCI中MAB的最新概念和未来研究建议。