• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

超越严格阶段性环境的量子可及强化学习。

Quantum-accessible reinforcement learning beyond strictly epochal environments.

作者信息

Hamann A, Dunjko V, Wölk S

机构信息

Institut für Theoretische Physik, Universität Innsbruck, Technikerstraße 21a, 6020 Innsbruck, Austria.

LIACS, Leiden University, Niels Bohrweg 1, 2333 CA Leiden, The Netherlands.

出版信息

Quantum Mach Intell. 2021;3(2):22. doi: 10.1007/s42484-021-00049-7. Epub 2021 Aug 2.

DOI:10.1007/s42484-021-00049-7
PMID:34723097
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8550166/
Abstract

In recent years, quantum-enhanced machine learning has emerged as a particularly fruitful application of quantum algorithms, covering aspects of supervised, unsupervised and reinforcement learning. Reinforcement learning offers numerous options of how quantum theory can be applied, and is arguably the least explored, from a quantum perspective. Here, an agent explores an environment and tries to find a behavior optimizing some figure of merit. Some of the first approaches investigated settings where this exploration can be sped-up, by considering quantum analogs of classical environments, which can then be queried in superposition. If the environments have a strict periodic structure in time (i.e. are strictly episodic), such environments can be effectively converted to conventional oracles encountered in quantum information. However, in general environments, we obtain scenarios that generalize standard oracle tasks. In this work, we consider one such generalization, where the environment is not strictly episodic, which is mapped to an oracle identification setting with a changing oracle. We analyze this case and show that standard amplitude-amplification techniques can, with minor modifications, still be applied to achieve quadratic speed-ups. In addition, we prove that an algorithm based on Grover iterations is optimal for oracle identification even if the oracle changes over time in a way that the "rewarded space" is monotonically increasing. This result constitutes one of the first generalizations of quantum-accessible reinforcement learning.

摘要

近年来,量子增强机器学习已成为量子算法特别富有成果的应用领域,涵盖了监督学习、无监督学习和强化学习等方面。从量子角度来看,强化学习提供了多种应用量子理论的选择,并且可以说是探索最少的领域。在这里,智能体探索一个环境,并试图找到一种优化某些品质因数的行为。一些早期的方法研究了通过考虑经典环境的量子类似物来加速这种探索的设置,然后可以在叠加态中查询这些类似物。如果环境在时间上具有严格的周期性结构(即严格是 episodic 的),这样的环境可以有效地转换为量子信息中遇到的传统预言机。然而,在一般环境中,我们会得到一些推广了标准预言机任务的场景。在这项工作中,我们考虑一种这样的推广,即环境不是严格 episodic 的,它被映射到一个预言机识别设置,其中预言机是变化的。我们分析了这种情况,并表明标准的幅度放大技术只需稍作修改,仍然可以应用以实现二次加速。此外,我们证明即使预言机随时间以“奖励空间”单调增加的方式变化,基于格罗弗迭代的算法对于预言机识别也是最优的。这一结果构成了量子可及强化学习的首批一般化成果之一。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d696/8550166/cdf1bd53e9b2/42484_2021_49_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d696/8550166/24ec27f5e43b/42484_2021_49_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d696/8550166/69c35615b56c/42484_2021_49_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d696/8550166/cdf1bd53e9b2/42484_2021_49_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d696/8550166/24ec27f5e43b/42484_2021_49_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d696/8550166/69c35615b56c/42484_2021_49_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d696/8550166/cdf1bd53e9b2/42484_2021_49_Fig3_HTML.jpg

相似文献

1
Quantum-accessible reinforcement learning beyond strictly epochal environments.超越严格阶段性环境的量子可及强化学习。
Quantum Mach Intell. 2021;3(2):22. doi: 10.1007/s42484-021-00049-7. Epub 2021 Aug 2.
2
Quantum-Enhanced Machine Learning.量子增强机器学习
Phys Rev Lett. 2016 Sep 23;117(13):130501. doi: 10.1103/PhysRevLett.117.130501. Epub 2016 Sep 20.
3
On the convergence of projective-simulation-based reinforcement learning in Markov decision processes.基于投影模拟的强化学习在马尔可夫决策过程中的收敛性
Quantum Mach Intell. 2020;2(2):13. doi: 10.1007/s42484-020-00023-9. Epub 2020 Nov 5.
4
Quantum reinforcement learning.量子强化学习
IEEE Trans Syst Man Cybern B Cybern. 2008 Oct;38(5):1207-20. doi: 10.1109/TSMCB.2008.925743.
5
Experimental quantum speed-up in reinforcement learning agents.实验性量子强化学习代理中的速度提升。
Nature. 2021 Mar;591(7849):229-233. doi: 10.1038/s41586-021-03242-7. Epub 2021 Mar 10.
6
Fixed-point quantum search with an optimal number of queries.具有最优查询数量的定点量子搜索。
Phys Rev Lett. 2014 Nov 21;113(21):210501. doi: 10.1103/PhysRevLett.113.210501. Epub 2014 Nov 18.
7
Multiqubit and multilevel quantum reinforcement learning with quantum technologies.利用量子技术进行多量子位和多层次量子强化学习。
PLoS One. 2018 Jul 19;13(7):e0200455. doi: 10.1371/journal.pone.0200455. eCollection 2018.
8
Active Inference: Demystified and Compared.主动推断:去神秘化与比较。
Neural Comput. 2021 Mar;33(3):674-712. doi: 10.1162/neco_a_01357. Epub 2021 Jan 5.
9
Generalization Enhancement of Visual Reinforcement Learning through Internal States.通过内部状态增强视觉强化学习的泛化能力
Sensors (Basel). 2024 Jul 12;24(14):4513. doi: 10.3390/s24144513.
10
A general quantum algorithm for numerical integration.一种用于数值积分的通用量子算法。
Sci Rep. 2024 May 7;14(1):10432. doi: 10.1038/s41598-024-61010-9.

本文引用的文献

1
Experimental quantum speed-up in reinforcement learning agents.实验性量子强化学习代理中的速度提升。
Nature. 2021 Mar;591(7849):229-233. doi: 10.1038/s41586-021-03242-7. Epub 2021 Mar 10.
2
Supervised learning with quantum-enhanced feature spaces.基于量子增强特征空间的有监督学习。
Nature. 2019 Mar;567(7747):209-212. doi: 10.1038/s41586-019-0980-2. Epub 2019 Mar 13.
3
Machine learning & artificial intelligence in the quantum domain: a review of recent progress.机器学习与量子领域中的人工智能:近期进展综述。
Rep Prog Phys. 2018 Jul;81(7):074001. doi: 10.1088/1361-6633/aab406. Epub 2018 Mar 5.
4
Active learning machine learns to create new quantum experiments.主动学习机学会了创建新的量子实验。
Proc Natl Acad Sci U S A. 2018 Feb 6;115(6):1221-1226. doi: 10.1073/pnas.1714936115. Epub 2018 Jan 18.
5
Quantum machine learning.量子机器学习。
Nature. 2017 Sep 13;549(7671):195-202. doi: 10.1038/nature23474.
6
Quantum-Enhanced Machine Learning.量子增强机器学习
Phys Rev Lett. 2016 Sep 23;117(13):130501. doi: 10.1103/PhysRevLett.117.130501. Epub 2016 Sep 20.
7
Fixed-point quantum search with an optimal number of queries.具有最优查询数量的定点量子搜索。
Phys Rev Lett. 2014 Nov 21;113(21):210501. doi: 10.1103/PhysRevLett.113.210501. Epub 2014 Nov 18.
8
Projective simulation for artificial intelligence.人工智能的投影模拟。
Sci Rep. 2012;2:400. doi: 10.1038/srep00400. Epub 2012 May 15.
9
Quantum algorithm for linear systems of equations.量子方程组算法。
Phys Rev Lett. 2009 Oct 9;103(15):150502. doi: 10.1103/PhysRevLett.103.150502. Epub 2009 Oct 7.
10
The quantum internet.量子互联网。
Nature. 2008 Jun 19;453(7198):1023-30. doi: 10.1038/nature07127.