Université Grenoble Alpes, CNRS, Institut Néel, 38042, Grenoble, France.
Department of Information Physics and Computing, Graduate School of Information Science and Technology, The University of Tokyo, 7-3-1 Hongo, Bunkyo-ku, Tokyo, 113-8656, Japan.
Sci Rep. 2019 Aug 22;9(1):12229. doi: 10.1038/s41598-019-48647-7.
The competitive multi-armed bandit (CMAB) problem is related to social issues such as maximizing total social benefits while preserving equality among individuals by overcoming conflicts between individual decisions, which could seriously decrease social benefits. The study described herein provides experimental evidence that entangled photons physically resolve the CMAB in the 2-arms 2-players case, maximizing the social rewards while ensuring equality. Moreover, we demonstrated that deception, or outperforming the other player by receiving a greater reward, cannot be accomplished in a polarization-entangled-photon-based system, while deception is achievable in systems based on classical polarization-correlated photons with fixed polarizations. Besides, random polarization-correlated photons have been studied numerically and shown to ensure equality between players and deception prevention as well, although the CMAB maximum performance is reduced as compared with entangled photon experiments. Autonomous alignment schemes for polarization bases were also experimentally demonstrated based only on decision conflict information observed by an individual without communications between players. This study paves a way for collective decision making in uncertain dynamically changing environments based on entangled quantum states, a crucial step toward utilizing quantum systems for intelligent functionalities.
竞争多臂老虎机(CMAB)问题与社会问题相关,例如通过克服个体决策之间的冲突,在保持个体平等的同时最大化总体社会效益,否则这可能会严重降低社会效益。本文提供的实验证据表明,纠缠光子在 2 臂 2 玩家案例中实际解决了 CMAB 问题,在确保平等的同时最大化了社会效益。此外,我们还证明,在基于偏振纠缠光子的系统中无法实现欺骗行为(即通过获得更大的奖励而超过另一名玩家),而在基于经典偏振相关光子且偏振固定的系统中可以实现欺骗行为。此外,还对随机偏振相关光子进行了数值研究,结果表明,尽管与纠缠光子实验相比,CMAB 的最大性能有所降低,但它们可以确保玩家之间的平等和防止欺骗行为。此外,还基于仅通过个人观察到的决策冲突信息,实验证明了自主偏振基对准方案的可行性,而无需玩家之间进行通信。这项研究为基于纠缠量子态的不确定动态变化环境中的集体决策铺平了道路,是迈向利用量子系统实现智能功能的重要一步。