• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

通过强化学习发现的一氧化碳氧化的最优动态机制

Optimal Dynamic Regimes for CO Oxidation Discovered by Reinforcement Learning.

作者信息

Lifar Mikhail S, Tereshchenko Andrei A, Bulgakov Aleksei N, Guda Sergey A, Guda Alexander A, Soldatov Alexander V

机构信息

The Smart Materials Research Institute, Southern Federal University, 344090 Rostov-on-Don, Russia.

Institute for Mathematics, Mechanics and Computer Science in the name of I.I. Vorovich, Southern Federal University, 344090 Rostov-on-Don, Russia.

出版信息

ACS Omega. 2024 Jun 20;9(26):27987-27997. doi: 10.1021/acsomega.3c10422. eCollection 2024 Jul 2.

DOI:10.1021/acsomega.3c10422
PMID:38973853
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11223201/
Abstract

Metal nanoparticles are widely used as heterogeneous catalysts to activate adsorbed molecules and reduce the energy barrier of the reaction. Reaction product yield depends on the interplay between elementary processes: adsorption, activation, desorption, and reaction. These processes, in turn, depend on the inlet gas composition, temperature, and pressure. At a steady state, the active surface sites may be inaccessible due to adsorbed reagents. Periodic regime may thus improve the yield, but the appropriate period and waveform are not known in advance. Dynamic control should account for surface and atmospheric modifications and adjust reaction parameters according to the current state of the system and its history. In this work, we applied a reinforcement learning algorithm to control CO oxidation on a palladium catalyst. The policy gradient algorithm was trained in the theoretical environment, parametrized from experimental data. The algorithm learned to maximize the CO formation rate based on CO and O partial pressures for several successive time steps. Within a unified approach, we found optimal stationary, periodic, and nonperiodic regimes for different problem formulations and gained insight into why the dynamic regime can be preferential. In general, this work contributes to the task of popularizing the reinforcement learning approach in the field of catalytic science.

摘要

金属纳米颗粒作为多相催化剂被广泛用于活化吸附分子并降低反应的能量壁垒。反应产物的产率取决于基本过程之间的相互作用:吸附、活化、解吸和反应。而这些过程又取决于进气组成、温度和压力。在稳态下,由于吸附的试剂,活性表面位点可能无法被利用。因此,周期性状态可能会提高产率,但合适的周期和波形事先并不清楚。动态控制应考虑表面和大气的变化,并根据系统的当前状态及其历史来调整反应参数。在这项工作中,我们应用强化学习算法来控制钯催化剂上的一氧化碳氧化反应。策略梯度算法在理论环境中进行训练,该理论环境根据实验数据进行参数化。该算法学会了在几个连续的时间步长内,基于一氧化碳和氧气的分压来最大化一氧化碳的生成速率。在统一的方法中,我们针对不同的问题表述找到了最优的稳态、周期性和非周期性状态,并深入了解了为什么动态状态可能更具优势。总的来说,这项工作有助于在催化科学领域推广强化学习方法这一任务。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5e32/11223201/138cd89eb504/ao3c10422_0009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5e32/11223201/ab66c48915c0/ao3c10422_0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5e32/11223201/fa23e16cc117/ao3c10422_0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5e32/11223201/1ae1dfefae17/ao3c10422_0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5e32/11223201/9258e3f5e06b/ao3c10422_0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5e32/11223201/76cfa98bd700/ao3c10422_0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5e32/11223201/846e5e5ce1e4/ao3c10422_0006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5e32/11223201/4f3a38fb29b4/ao3c10422_0007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5e32/11223201/2c60a5cfeed3/ao3c10422_0008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5e32/11223201/138cd89eb504/ao3c10422_0009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5e32/11223201/ab66c48915c0/ao3c10422_0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5e32/11223201/fa23e16cc117/ao3c10422_0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5e32/11223201/1ae1dfefae17/ao3c10422_0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5e32/11223201/9258e3f5e06b/ao3c10422_0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5e32/11223201/76cfa98bd700/ao3c10422_0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5e32/11223201/846e5e5ce1e4/ao3c10422_0006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5e32/11223201/4f3a38fb29b4/ao3c10422_0007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5e32/11223201/2c60a5cfeed3/ao3c10422_0008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5e32/11223201/138cd89eb504/ao3c10422_0009.jpg

相似文献

1
Optimal Dynamic Regimes for CO Oxidation Discovered by Reinforcement Learning.通过强化学习发现的一氧化碳氧化的最优动态机制
ACS Omega. 2024 Jun 20;9(26):27987-27997. doi: 10.1021/acsomega.3c10422. eCollection 2024 Jul 2.
2
Reactivity of chemisorbed oxygen atoms and their catalytic consequences during CH4-O2 catalysis on supported Pt clusters.担载 Pt 团簇上 CH4-O2 催化反应中化学吸附氧原子的反应性及其催化后果。
J Am Chem Soc. 2011 Oct 12;133(40):15958-78. doi: 10.1021/ja202411v. Epub 2011 Sep 15.
3
Investigation of palladium catalysts in mesoporous silica support for CO oxidation and CO adsorption.介孔二氧化硅负载钯催化剂用于一氧化碳氧化和吸附的研究。
Heliyon. 2023 Jul 17;9(7):e18354. doi: 10.1016/j.heliyon.2023.e18354. eCollection 2023 Jul.
4
Tuning the properties of copper-based catalysts based on molecular in situ studies of model systems.基于模型体系的分子原位研究来调变铜基催化剂的性能。
Acc Chem Res. 2015 Jul 21;48(7):2151-8. doi: 10.1021/acs.accounts.5b00200. Epub 2015 Jun 23.
5
The cobalt oxidation state in preferential CO oxidation on CoO/Pt(111) investigated by X-ray photoemission spectroscopy.通过X射线光电子能谱研究CoO/Pt(111)上优先一氧化碳氧化中钴的氧化态。
Phys Chem Chem Phys. 2022 Apr 20;24(16):9236-9246. doi: 10.1039/d2cp00399f.
6
Unique properties of ceria nanoparticles supported on metals: novel inverse ceria/copper catalysts for CO oxidation and the water-gas shift reaction.担载于金属上的氧化铈纳米颗粒的独特性质:新型氧化铈/铜反相催化剂用于 CO 氧化和水汽变换反应。
Acc Chem Res. 2013 Aug 20;46(8):1702-11. doi: 10.1021/ar300231p. Epub 2013 Jan 3.
7
Kinetic Studies of the Pt Carbonate-Mediated, Room-Temperature Oxidation of Carbon Monoxide by Oxygen over Pt/AlO Using Combined, Time-Resolved XAFS, DRIFTS, and Mass Spectrometry.使用时间分辨XAFS、漫反射红外傅里叶变换光谱(DRIFTS)和质谱联用技术对Pt/AlO上碳酸铂介导的一氧化碳在室温下被氧气氧化的动力学研究
J Am Chem Soc. 2016 Oct 26;138(42):13930-13940. doi: 10.1021/jacs.6b06819. Epub 2016 Oct 17.
8
Insights into catalytic oxidation at the Au/TiO(2) dual perimeter sites.深入了解 Au/TiO(2) 双周界位点的催化氧化作用。
Acc Chem Res. 2014 Mar 18;47(3):805-15. doi: 10.1021/ar400196f. Epub 2013 Dec 30.
9
Single Atom Dynamics in Chemical Reactions.化学反应中单原子动力学。
Acc Chem Res. 2020 Feb 18;53(2):390-399. doi: 10.1021/acs.accounts.9b00500. Epub 2020 Feb 5.
10
Toward an Atomic-Level Understanding of Ceria-Based Catalysts: When Experiment and Theory Go Hand in Hand.迈向对二氧化铈基催化剂的原子级理解:实验与理论携手并进之时。
Acc Chem Res. 2021 Jul 6;54(13):2884-2893. doi: 10.1021/acs.accounts.1c00226. Epub 2021 Jun 17.

本文引用的文献

1
Optimizing the Catalytic Activity of Pd-Based Multinary Alloys toward Oxygen Reduction Reaction.优化钯基多元合金对氧还原反应的催化活性。
J Phys Chem Lett. 2022 Feb 3;13(4):1042-1048. doi: 10.1021/acs.jpclett.1c04128. Epub 2022 Jan 24.
2
Silica-Supported PdGa Nanoparticles: Metal Synergy for Highly Active and Selective CO-to-CHOH Hydrogenation.二氧化硅负载的钯镓纳米颗粒:用于高效选择性一氧化碳加氢制甲醇的金属协同作用
JACS Au. 2021 Mar 17;1(4):450-458. doi: 10.1021/jacsau.1c00021. eCollection 2021 Apr 26.
3
Three-Factor Kinetic Equation of Catalyst Deactivation.
催化剂失活的三因素动力学方程。
Entropy (Basel). 2021 Jun 27;23(7):818. doi: 10.3390/e23070818.
4
Reinforcement learning application in diabetes blood glucose control: A systematic review.强化学习在糖尿病血糖控制中的应用:一项系统综述。
Artif Intell Med. 2020 Apr;104:101836. doi: 10.1016/j.artmed.2020.101836. Epub 2020 Feb 21.
5
Planning chemical syntheses with deep neural networks and symbolic AI.用深度神经网络和符号人工智能规划化学合成。
Nature. 2018 Mar 28;555(7698):604-610. doi: 10.1038/nature25978.
6
Optimizing Chemical Reactions with Deep Reinforcement Learning.利用深度强化学习优化化学反应
ACS Cent Sci. 2017 Dec 27;3(12):1337-1344. doi: 10.1021/acscentsci.7b00492. Epub 2017 Dec 15.
7
Room-temperature carbon monoxide oxidation by oxygen over Pt/Al2O3 mediated by reactive platinum carbonates.在活性碳酸铂介导下,氧气在Pt/Al₂O₃上对室温一氧化碳的氧化作用。
Nat Commun. 2015 Oct 22;6:8675. doi: 10.1038/ncomms9675.
8
Reinforcement learning improves behaviour from evaluative feedback.强化学习通过评估反馈来改善行为。
Nature. 2015 May 28;521(7553):445-51. doi: 10.1038/nature14540.
9
A review of dry (CO2) reforming of methane over noble metal catalysts.关于贵金属催化剂上甲烷干(CO2)重整的综述。
Chem Soc Rev. 2014 Nov 21;43(22):7813-37. doi: 10.1039/c3cs60395d.
10
Reversible formation of a PdC(x) phase in Pd nanoparticles upon CO and O2 exposure.Pd 纳米颗粒在 CO 和 O2 暴露下可逆地形成 PdC(x)相。
Phys Chem Chem Phys. 2012 Apr 14;14(14):4796-801. doi: 10.1039/c2cp22873d. Epub 2012 Feb 24.