• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于强化学习的光子储层计算中的自适应模型选择

Adaptive model selection in photonic reservoir computing by reinforcement learning.

作者信息

Kanno Kazutaka, Naruse Makoto, Uchida Atsushi

机构信息

Department of Information and Computer Sciences, Saitama University 255 Shimo-Okubo, Sakura-ku, Saitama City, Saitama, 338-8570, Japan.

Department of Information Physics and Computing, Graduate School of Information Science and Technology, University of Tokyo, 7-3-1 Hongo, Bunkyo-ku, Tokyo, 113-8654, Japan.

出版信息

Sci Rep. 2020 Jun 22;10(1):10062. doi: 10.1038/s41598-020-66441-8.

DOI:10.1038/s41598-020-66441-8
PMID:32572093
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7308406/
Abstract

Photonic reservoir computing is an emergent technology toward beyond-Neumann computing. Although photonic reservoir computing provides superior performance in environments whose characteristics are coincident with the training datasets for the reservoir, the performance is significantly degraded if these characteristics deviate from the original knowledge used in the training phase. Here, we propose a scheme of adaptive model selection in photonic reservoir computing using reinforcement learning. In this scheme, a temporal waveform is generated by different dynamic source models that change over time. The system autonomously identifies the best source model for the task of time series prediction using photonic reservoir computing and reinforcement learning. We prepare two types of output weights for the source models, and the system adaptively selected the correct model using reinforcement learning, where the prediction errors are associated with rewards. We succeed in adaptive model selection when the source signal is temporally mixed, having originally been generated by two different dynamic system models, as well as when the signal is a mixture from the same model but with different parameter values. This study paves the way for autonomous behavior in photonic artificial intelligence and could lead to new applications in load forecasting and multi-objective control, where frequent environment changes are expected.

摘要

光子储层计算是一种迈向超越冯·诺依曼计算的新兴技术。尽管光子储层计算在其特性与储层训练数据集相符的环境中能提供卓越性能,但如果这些特性偏离训练阶段所使用的原始知识,性能就会显著下降。在此,我们提出一种利用强化学习在光子储层计算中进行自适应模型选择的方案。在该方案中,由随时间变化的不同动态源模型生成一个时间波形。系统利用光子储层计算和强化学习自主识别用于时间序列预测任务的最佳源模型。我们为源模型准备了两种类型的输出权重,系统利用强化学习自适应地选择正确模型,其中预测误差与奖励相关联。当源信号在时间上混合时,最初由两个不同动态系统模型生成,以及当信号是来自同一模型但具有不同参数值的混合信号时,我们都成功实现了自适应模型选择。本研究为光子人工智能中的自主行为铺平了道路,并可能在负荷预测和多目标控制等预期环境频繁变化的新应用中有所应用。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/46d9/7308406/0f27a9dcc713/41598_2020_66441_Fig12_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/46d9/7308406/1ea6c1f0e10d/41598_2020_66441_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/46d9/7308406/0b1722afca7b/41598_2020_66441_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/46d9/7308406/d4ea75979603/41598_2020_66441_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/46d9/7308406/9846cb858c67/41598_2020_66441_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/46d9/7308406/d5de923098fc/41598_2020_66441_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/46d9/7308406/3f2b8b47a6f8/41598_2020_66441_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/46d9/7308406/91e33134d6e5/41598_2020_66441_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/46d9/7308406/f0ede461a2fb/41598_2020_66441_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/46d9/7308406/329d84a22404/41598_2020_66441_Fig9_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/46d9/7308406/dae51628be37/41598_2020_66441_Fig10_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/46d9/7308406/a8fdb5016957/41598_2020_66441_Fig11_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/46d9/7308406/0f27a9dcc713/41598_2020_66441_Fig12_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/46d9/7308406/1ea6c1f0e10d/41598_2020_66441_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/46d9/7308406/0b1722afca7b/41598_2020_66441_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/46d9/7308406/d4ea75979603/41598_2020_66441_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/46d9/7308406/9846cb858c67/41598_2020_66441_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/46d9/7308406/d5de923098fc/41598_2020_66441_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/46d9/7308406/3f2b8b47a6f8/41598_2020_66441_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/46d9/7308406/91e33134d6e5/41598_2020_66441_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/46d9/7308406/f0ede461a2fb/41598_2020_66441_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/46d9/7308406/329d84a22404/41598_2020_66441_Fig9_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/46d9/7308406/dae51628be37/41598_2020_66441_Fig10_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/46d9/7308406/a8fdb5016957/41598_2020_66441_Fig11_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/46d9/7308406/0f27a9dcc713/41598_2020_66441_Fig12_HTML.jpg

相似文献

1
Adaptive model selection in photonic reservoir computing by reinforcement learning.基于强化学习的光子储层计算中的自适应模型选择
Sci Rep. 2020 Jun 22;10(1):10062. doi: 10.1038/s41598-020-66441-8.
2
Photonic reinforcement learning based on optoelectronic reservoir computing.基于光电储能计算的光子强化学习。
Sci Rep. 2022 Mar 8;12(1):3720. doi: 10.1038/s41598-022-07404-z.
3
Impact of input mask signals on delay-based photonic reservoir computing with semiconductor lasers.输入掩码信号对基于延迟的半导体激光器光子储能计算的影响。
Opt Express. 2018 Mar 5;26(5):5777-5788. doi: 10.1364/OE.26.005777.
4
Adaptive time-delayed photonic reservoir computing based on Kalman-filter training.基于卡尔曼滤波器训练的自适应时延光子储层计算
Opt Express. 2022 Apr 11;30(8):13647-13658. doi: 10.1364/OE.454852.
5
Compact reservoir computing with a photonic integrated circuit.基于光子集成电路的紧凑型储层计算
Opt Express. 2018 Oct 29;26(22):29424-29439. doi: 10.1364/OE.26.029424.
6
Minimum complexity integrated photonic architecture for delay-based reservoir computing.基于延迟的储层计算的最小复杂度集成光子架构。
Opt Express. 2023 Mar 27;31(7):11610-11623. doi: 10.1364/OE.484052.
7
Toward optical signal processing using photonic reservoir computing.迈向利用光子储能计算进行光信号处理。
Opt Express. 2008 Jul 21;16(15):11182-92. doi: 10.1364/oe.16.011182.
8
Photonic reservoir computing enabled by stimulated Brillouin scattering.受激布里渊散射实现的光子存储计算。
Opt Express. 2023 Jun 19;31(13):22061-22074. doi: 10.1364/OE.489057.
9
A Reservoir Computing Model of Reward-Modulated Motor Learning and Automaticity.一种基于奖励调节的运动学习和自动化的储层计算模型。
Neural Comput. 2019 Jul;31(7):1430-1461. doi: 10.1162/neco_a_01198. Epub 2019 May 21.
10
Forecasting stock market with nanophotonic reservoir computing system based on silicon optomechanical oscillators.基于硅光机械振荡器的纳米光子储层计算系统预测股票市场。
Opt Express. 2022 Jun 20;30(13):23359-23381. doi: 10.1364/OE.454973.

引用本文的文献

1
Training of physical neural networks.物理神经网络的训练。
Nature. 2025 Sep;645(8079):53-61. doi: 10.1038/s41586-025-09384-2. Epub 2025 Sep 3.