• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于光电储能计算的光子强化学习。

Photonic reinforcement learning based on optoelectronic reservoir computing.

作者信息

Kanno Kazutaka, Uchida Atsushi

机构信息

Department of Information and Computer Sciences, Saitama University, 255 Shimo-Okubo, Sakura-ku, Saitama City, Saitama, 338-8570, Japan.

出版信息

Sci Rep. 2022 Mar 8;12(1):3720. doi: 10.1038/s41598-022-07404-z.

DOI:10.1038/s41598-022-07404-z
PMID:35260595
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8904492/
Abstract

Reinforcement learning has been intensively investigated and developed in artificial intelligence in the absence of training data, such as autonomous driving vehicles, robot control, internet advertising, and elastic optical networks. However, the computational cost of reinforcement learning with deep neural networks is extremely high and reducing the learning cost is a challenging issue. We propose a photonic on-line implementation of reinforcement learning using optoelectronic delay-based reservoir computing, both experimentally and numerically. In the proposed scheme, we accelerate reinforcement learning at a rate of several megahertz because there is no required learning process for the internal connection weights in reservoir computing. We perform two benchmark tasks, CartPole-v0 and MountanCar-v0 tasks, to evaluate the proposed scheme. Our results represent the first hardware implementation of reinforcement learning based on photonic reservoir computing and pave the way for fast and efficient reinforcement learning as a novel photonic accelerator.

摘要

在缺乏训练数据的情况下,强化学习在人工智能领域得到了深入研究和发展,例如自动驾驶车辆、机器人控制、互联网广告和弹性光网络。然而,基于深度神经网络的强化学习的计算成本极高,降低学习成本是一个具有挑战性的问题。我们通过实验和数值模拟,提出了一种使用基于光电延迟的储层计算的光子在线强化学习实现方案。在所提出的方案中,由于储层计算中内部连接权重无需学习过程,我们以几兆赫兹的速率加速强化学习。我们执行了两个基准任务,即CartPole-v0和MountanCar-v0任务,以评估所提出的方案。我们的结果代表了基于光子储层计算的强化学习的首次硬件实现,并为作为一种新型光子加速器的快速高效强化学习铺平了道路。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c2ca/8904492/09e087a39176/41598_2022_7404_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c2ca/8904492/338c77c90c19/41598_2022_7404_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c2ca/8904492/54bfdca3f454/41598_2022_7404_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c2ca/8904492/bf1d705f23a0/41598_2022_7404_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c2ca/8904492/305d78becad3/41598_2022_7404_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c2ca/8904492/09e087a39176/41598_2022_7404_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c2ca/8904492/338c77c90c19/41598_2022_7404_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c2ca/8904492/54bfdca3f454/41598_2022_7404_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c2ca/8904492/bf1d705f23a0/41598_2022_7404_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c2ca/8904492/305d78becad3/41598_2022_7404_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c2ca/8904492/09e087a39176/41598_2022_7404_Fig5_HTML.jpg

相似文献

1
Photonic reinforcement learning based on optoelectronic reservoir computing.基于光电储能计算的光子强化学习。
Sci Rep. 2022 Mar 8;12(1):3720. doi: 10.1038/s41598-022-07404-z.
2
Adaptive model selection in photonic reservoir computing by reinforcement learning.基于强化学习的光子储层计算中的自适应模型选择
Sci Rep. 2020 Jun 22;10(1):10062. doi: 10.1038/s41598-020-66441-8.
3
Photonic deep residual time-delay reservoir computing.光子深度学习时滞reservoir computing。
Neural Netw. 2024 Nov;179:106575. doi: 10.1016/j.neunet.2024.106575. Epub 2024 Jul 27.
4
Toward optical signal processing using photonic reservoir computing.迈向利用光子储能计算进行光信号处理。
Opt Express. 2008 Jul 21;16(15):11182-92. doi: 10.1364/oe.16.011182.
5
Deep Photonic Reservoir Computer for Speech Recognition.用于语音识别的深度光子储层计算机。
IEEE Trans Neural Netw Learn Syst. 2025 Apr;36(4):7606-7614. doi: 10.1109/TNNLS.2024.3400451. Epub 2025 Apr 4.
6
Photonic reservoir computing enabled by stimulated Brillouin scattering.受激布里渊散射实现的光子存储计算。
Opt Express. 2023 Jun 19;31(13):22061-22074. doi: 10.1364/OE.489057.
7
Compact reservoir computing with a photonic integrated circuit.基于光子集成电路的紧凑型储层计算
Opt Express. 2018 Oct 29;26(22):29424-29439. doi: 10.1364/OE.26.029424.
8
Photonic implementation of the input and reservoir layers for a reservoir computing system based on a single VCSEL with two Mach-Zehnder modulators.基于具有两个马赫-曾德尔调制器的单个垂直腔面发射激光器的储层计算系统中输入层和储层层的光子实现。
Opt Express. 2024 May 6;32(10):17452-17463. doi: 10.1364/OE.522336.
9
Impact of optical coherence on the performance of large-scale spatiotemporal photonic reservoir computing systems.光学相干对大规模时空光子储层计算系统性能的影响。
Opt Express. 2020 Sep 14;28(19):27989-28005. doi: 10.1364/OE.400546.
10
Integrated photonic reservoir computing with an all-optical readout.具有全光读出的集成光子储层计算
Opt Express. 2023 Oct 9;31(21):34843-34854. doi: 10.1364/OE.502354.

引用本文的文献

1
High-efficiency reinforcement learning with hybrid architecture photonic integrated circuit.基于混合架构光子集成电路的高效强化学习
Nat Commun. 2024 Feb 5;15(1):1044. doi: 10.1038/s41467-024-45305-z.
2
Input-Output-Improved Reservoir Computing Based on Duffing Resonator Processing Dynamic Temperature Compensation for MEMS Resonant Accelerometer.基于达夫振子处理的微机电系统(MEMS)谐振式加速度计动态温度补偿的输入-输出-改进型储层计算
Micromachines (Basel). 2023 Jan 8;14(1):161. doi: 10.3390/mi14010161.

本文引用的文献

1
Performance boost of time-delay reservoir computing by non-resonant clock cycle.非共振时钟周期提高时滞存储计算的性能。
Neural Netw. 2020 Apr;124:158-169. doi: 10.1016/j.neunet.2020.01.010. Epub 2020 Jan 15.
2
Grandmaster level in StarCraft II using multi-agent reinforcement learning.星际争霸 II 中的大师级水平使用多智能体强化学习。
Nature. 2019 Nov;575(7782):350-354. doi: 10.1038/s41586-019-1724-z. Epub 2019 Oct 30.
3
Recent advances in physical reservoir computing: A review.近期物理存储计算的进展:综述。
Neural Netw. 2019 Jul;115:100-123. doi: 10.1016/j.neunet.2019.03.005. Epub 2019 Mar 20.
4
Neuromorphic computing with nanoscale spintronic oscillators.基于纳米级自旋电子振荡器的神经形态计算。
Nature. 2017 Jul 26;547(7664):428-431. doi: 10.1038/nature23011.
5
Hybrid computing using a neural network with dynamic external memory.使用具有动态外部存储器的神经网络进行混合计算。
Nature. 2016 Oct 27;538(7626):471-476. doi: 10.1038/nature20101. Epub 2016 Oct 12.
6
Laser dynamical reservoir computing with consistency: an approach of a chaos mask signal.具有一致性的激光动态储层计算:一种混沌掩码信号方法。
Opt Express. 2016 Apr 18;24(8):8679-92. doi: 10.1364/OE.24.008679.
7
Fully analogue photonic reservoir computer.全模拟光子储层计算机。
Sci Rep. 2016 Mar 3;6:22381. doi: 10.1038/srep22381.
8
A Unified Framework for Reservoir Computing and Extreme Learning Machines based on a Single Time-delayed Neuron.基于单个时延神经元的水库计算与极限学习机统一框架
Sci Rep. 2015 Oct 8;5:14945. doi: 10.1038/srep14945.
9
Information processing via physical soft body.通过物理软体进行信息处理。
Sci Rep. 2015 May 27;5:10487. doi: 10.1038/srep10487.
10
Human-level control through deep reinforcement learning.通过深度强化学习实现人类水平的控制。
Nature. 2015 Feb 26;518(7540):529-33. doi: 10.1038/nature14236.