• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

利用半导体激光器中混沌时间波形的偏差控制解决大规模多臂老虎机问题的决策方法。

Decision making for large-scale multi-armed bandit problems using bias control of chaotic temporal waveforms in semiconductor lasers.

作者信息

Morijiri Kensei, Mihana Takatomo, Kanno Kazutaka, Naruse Makoto, Uchida Atsushi

机构信息

Department of Information and Computer Sciences, Saitama University, 255 Shimo-okubo, Sakura-ku, Saitama City, Saitama, 338-8570, Japan.

Department of Information Physics and Computing, Graduate School of Information Science and Technology, The University of Tokyo, 7-3-1 Hongo, Bunkyo-ku, Tokyo, 113-8656, Japan.

出版信息

Sci Rep. 2022 May 16;12(1):8073. doi: 10.1038/s41598-022-12155-y.

DOI:10.1038/s41598-022-12155-y
PMID:35577847
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9110346/
Abstract

Decision making using photonic technologies has been intensively researched for solving the multi-armed bandit problem, which is fundamental to reinforcement learning. However, these technologies are yet to be extended to large-scale multi-armed bandit problems. In this study, we conduct a numerical investigation of decision making to solve large-scale multi-armed bandit problems by controlling the biases of chaotic temporal waveforms generated in semiconductor lasers with optical feedback. We generate chaotic temporal waveforms using the semiconductor lasers, and each waveform is assigned to a slot machine (or choice) in the multi-armed bandit problem. The biases in the amplitudes of the chaotic waveforms are adjusted based on rewards using the tug-of-war method. Subsequently, the slot machine that yields the maximum-amplitude chaotic temporal waveform with bias is selected. The scaling properties of the correct decision-making process are examined by increasing the number of slot machines to 1024, and the scaling exponent of the power-law distribution is 0.97. We demonstrate that the proposed method outperforms existing software algorithms in terms of the scaling exponent. This result paves the way for photonic decision making in large-scale multi-armed bandit problems using photonic accelerators.

摘要

利用光子技术进行决策已被深入研究,以解决多臂老虎机问题,这是强化学习的基础。然而,这些技术尚未扩展到大规模多臂老虎机问题。在本研究中,我们通过控制光反馈半导体激光器中产生的混沌时间波形的偏差,对解决大规模多臂老虎机问题的决策进行了数值研究。我们使用半导体激光器生成混沌时间波形,并且每个波形被分配到多臂老虎机问题中的一个老虎机(或选择)。基于奖励,使用拔河方法调整混沌波形幅度的偏差。随后,选择产生具有偏差的最大幅度混沌时间波形的老虎机。通过将老虎机数量增加到1024来检查正确决策过程的标度性质,幂律分布的标度指数为0.97。我们证明,所提出的方法在标度指数方面优于现有的软件算法。这一结果为使用光子加速器在大规模多臂老虎机问题中进行光子决策铺平了道路。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2227/9110346/4b2566d5e911/41598_2022_12155_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2227/9110346/cbf3c35396b2/41598_2022_12155_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2227/9110346/39eb0544d047/41598_2022_12155_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2227/9110346/752c84f9087b/41598_2022_12155_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2227/9110346/1434908927d4/41598_2022_12155_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2227/9110346/b54f6314989b/41598_2022_12155_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2227/9110346/e52c2836ede7/41598_2022_12155_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2227/9110346/4b2566d5e911/41598_2022_12155_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2227/9110346/cbf3c35396b2/41598_2022_12155_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2227/9110346/39eb0544d047/41598_2022_12155_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2227/9110346/752c84f9087b/41598_2022_12155_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2227/9110346/1434908927d4/41598_2022_12155_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2227/9110346/b54f6314989b/41598_2022_12155_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2227/9110346/e52c2836ede7/41598_2022_12155_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2227/9110346/4b2566d5e911/41598_2022_12155_Fig7_HTML.jpg

相似文献

1
Decision making for large-scale multi-armed bandit problems using bias control of chaotic temporal waveforms in semiconductor lasers.利用半导体激光器中混沌时间波形的偏差控制解决大规模多臂老虎机问题的决策方法。
Sci Rep. 2022 May 16;12(1):8073. doi: 10.1038/s41598-022-12155-y.
2
Decision making for the multi-armed bandit problem using lag synchronization of chaos in mutually coupled semiconductor lasers.利用相互耦合半导体激光器中混沌的滞后同步解决多臂老虎机问题的决策方法。
Opt Express. 2019 Sep 16;27(19):26989-27008. doi: 10.1364/OE.27.026989.
3
Experimental control of mode-competition dynamics in a chaotic multimode semiconductor laser for decision making.用于决策的混沌多模半导体激光器中模式竞争动力学的实验控制
Opt Express. 2024 May 6;32(10):17274-17294. doi: 10.1364/OE.517257.
4
Photonic decision-making for arbitrary-number-armed bandit problem utilizing parallel chaos generation.利用并行混沌生成解决任意臂数的强盗问题的光子决策。
Opt Express. 2021 Aug 2;29(16):25290-25301. doi: 10.1364/OE.432956.
5
Overtaking method based on sand-sifter mechanism: Why do optimistic value functions find optimal solutions in multi-armed bandit problems?基于筛沙机制的超越方法:为何乐观值函数能在多臂老虎机问题中找到最优解?
Biosystems. 2015 Sep;135:55-65. doi: 10.1016/j.biosystems.2015.06.009. Epub 2015 Jul 10.
6
Laser network decision making by lag synchronization of chaos in a ring configuration.通过环形配置中混沌的滞后同步进行激光网络决策
Opt Express. 2020 Dec 21;28(26):40112-40130. doi: 10.1364/OE.411140.
7
Controlling chaotic itinerancy in laser dynamics for reinforcement learning.控制激光动力学中的混沌遍历以进行强化学习。
Sci Adv. 2022 Dec 9;8(49):eabn8325. doi: 10.1126/sciadv.abn8325. Epub 2022 Dec 7.
8
Ultrafast photonic reinforcement learning based on laser chaos.基于激光混沌的超快光子强化学习。
Sci Rep. 2017 Aug 18;7(1):8772. doi: 10.1038/s41598-017-08585-8.
9
Scalable photonic reinforcement learning by time-division multiplexing of laser chaos.通过激光混沌的时分复用实现可扩展的光子强化学习。
Sci Rep. 2018 Jul 18;8(1):10890. doi: 10.1038/s41598-018-29117-y.
10
Time-delay signature concealment of chaos and ultrafast decision making in mutually coupled semiconductor lasers with a phase-modulated Sagnac loop.具有相位调制萨尼亚克环的相互耦合半导体激光器中混沌的时延特征隐藏与超快决策
Opt Express. 2020 Jan 20;28(2):1665-1678. doi: 10.1364/OE.384378.

本文引用的文献

1
Entangled and correlated photon mixed strategy for social decision making.用于社会决策的纠缠与关联光子混合策略。
Sci Rep. 2021 Mar 1;11(1):4832. doi: 10.1038/s41598-021-84199-5.
2
Laser network decision making by lag synchronization of chaos in a ring configuration.通过环形配置中混沌的滞后同步进行激光网络决策
Opt Express. 2020 Dec 21;28(26):40112-40130. doi: 10.1364/OE.411140.
3
Entangled N-photon states for fair and optimal social decision making.纠缠 N 光子态用于公平和最优的社会决策。
Sci Rep. 2020 Nov 24;10(1):20420. doi: 10.1038/s41598-020-77340-3.
4
Dynamic channel selection in wireless communications via a multi-armed bandit algorithm using laser chaos time series.通过使用激光混沌时间序列的多臂老虎机算法实现无线通信中的动态信道选择。
Sci Rep. 2020 Jan 31;10(1):1574. doi: 10.1038/s41598-020-58541-2.
5
Decision making for the multi-armed bandit problem using lag synchronization of chaos in mutually coupled semiconductor lasers.利用相互耦合半导体激光器中混沌的滞后同步解决多臂老虎机问题的决策方法。
Opt Express. 2019 Sep 16;27(19):26989-27008. doi: 10.1364/OE.27.026989.
6
Entangled-photon decision maker.纠缠光子决策器。
Sci Rep. 2019 Aug 22;9(1):12229. doi: 10.1038/s41598-019-48647-7.
7
On-chip photonic decision maker using spontaneous mode switching in a ring laser.基于环形激光器中自发模式切换的片上光子决策器。
Sci Rep. 2019 Jul 1;9(1):9429. doi: 10.1038/s41598-019-45754-3.
8
Compact reservoir computing with a photonic integrated circuit.基于光子集成电路的紧凑型储层计算
Opt Express. 2018 Oct 29;26(22):29424-29439. doi: 10.1364/OE.26.029424.
9
Scalable photonic reinforcement learning by time-division multiplexing of laser chaos.通过激光混沌的时分复用实现可扩展的光子强化学习。
Sci Rep. 2018 Jul 18;8(1):10890. doi: 10.1038/s41598-018-29117-y.
10
Ultrafast photonic reinforcement learning based on laser chaos.基于激光混沌的超快光子强化学习。
Sci Rep. 2017 Aug 18;7(1):8772. doi: 10.1038/s41598-017-08585-8.