通过强化学习发现的一氧化碳氧化的最优动态机制

Optimal Dynamic Regimes for CO Oxidation Discovered by Reinforcement Learning.

作者信息

Lifar Mikhail S, Tereshchenko Andrei A, Bulgakov Aleksei N, Guda Sergey A, Guda Alexander A, Soldatov Alexander V

机构信息

The Smart Materials Research Institute, Southern Federal University, 344090 Rostov-on-Don, Russia.

Institute for Mathematics, Mechanics and Computer Science in the name of I.I. Vorovich, Southern Federal University, 344090 Rostov-on-Don, Russia.

出版信息

ACS Omega. 2024 Jun 20;9(26):27987-27997. doi: 10.1021/acsomega.3c10422. eCollection 2024 Jul 2.

DOI:10.1021/acsomega.3c10422

PMID:38973853

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11223201/

Abstract

Metal nanoparticles are widely used as heterogeneous catalysts to activate adsorbed molecules and reduce the energy barrier of the reaction. Reaction product yield depends on the interplay between elementary processes: adsorption, activation, desorption, and reaction. These processes, in turn, depend on the inlet gas composition, temperature, and pressure. At a steady state, the active surface sites may be inaccessible due to adsorbed reagents. Periodic regime may thus improve the yield, but the appropriate period and waveform are not known in advance. Dynamic control should account for surface and atmospheric modifications and adjust reaction parameters according to the current state of the system and its history. In this work, we applied a reinforcement learning algorithm to control CO oxidation on a palladium catalyst. The policy gradient algorithm was trained in the theoretical environment, parametrized from experimental data. The algorithm learned to maximize the CO formation rate based on CO and O partial pressures for several successive time steps. Within a unified approach, we found optimal stationary, periodic, and nonperiodic regimes for different problem formulations and gained insight into why the dynamic regime can be preferential. In general, this work contributes to the task of popularizing the reinforcement learning approach in the field of catalytic science.

摘要

金属纳米颗粒作为多相催化剂被广泛用于活化吸附分子并降低反应的能量壁垒。反应产物的产率取决于基本过程之间的相互作用：吸附、活化、解吸和反应。而这些过程又取决于进气组成、温度和压力。在稳态下，由于吸附的试剂，活性表面位点可能无法被利用。因此，周期性状态可能会提高产率，但合适的周期和波形事先并不清楚。动态控制应考虑表面和大气的变化，并根据系统的当前状态及其历史来调整反应参数。在这项工作中，我们应用强化学习算法来控制钯催化剂上的一氧化碳氧化反应。策略梯度算法在理论环境中进行训练，该理论环境根据实验数据进行参数化。该算法学会了在几个连续的时间步长内，基于一氧化碳和氧气的分压来最大化一氧化碳的生成速率。在统一的方法中，我们针对不同的问题表述找到了最优的稳态、周期性和非周期性状态，并深入了解了为什么动态状态可能更具优势。总的来说，这项工作有助于在催化科学领域推广强化学习方法这一任务。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5e32/11223201/ab66c48915c0/ao3c10422_0001.jpg

相似文献

Optimal Dynamic Regimes for CO Oxidation Discovered by Reinforcement Learning.

ACS Omega. 2024 Jun 20;9(26):27987-27997. doi: 10.1021/acsomega.3c10422. eCollection 2024 Jul 2.

Reactivity of chemisorbed oxygen atoms and their catalytic consequences during CH4-O2 catalysis on supported Pt clusters.

J Am Chem Soc. 2011 Oct 12;133(40):15958-78. doi: 10.1021/ja202411v. Epub 2011 Sep 15.

Investigation of palladium catalysts in mesoporous silica support for CO oxidation and CO adsorption.

Heliyon. 2023 Jul 17;9(7):e18354. doi: 10.1016/j.heliyon.2023.e18354. eCollection 2023 Jul.

Tuning the properties of copper-based catalysts based on molecular in situ studies of model systems.

Acc Chem Res. 2015 Jul 21;48(7):2151-8. doi: 10.1021/acs.accounts.5b00200. Epub 2015 Jun 23.

The cobalt oxidation state in preferential CO oxidation on CoO/Pt(111) investigated by X-ray photoemission spectroscopy.

Phys Chem Chem Phys. 2022 Apr 20;24(16):9236-9246. doi: 10.1039/d2cp00399f.

Unique properties of ceria nanoparticles supported on metals: novel inverse ceria/copper catalysts for CO oxidation and the water-gas shift reaction.

Acc Chem Res. 2013 Aug 20;46(8):1702-11. doi: 10.1021/ar300231p. Epub 2013 Jan 3.

Kinetic Studies of the Pt Carbonate-Mediated, Room-Temperature Oxidation of Carbon Monoxide by Oxygen over Pt/AlO Using Combined, Time-Resolved XAFS, DRIFTS, and Mass Spectrometry.

J Am Chem Soc. 2016 Oct 26;138(42):13930-13940. doi: 10.1021/jacs.6b06819. Epub 2016 Oct 17.

Insights into catalytic oxidation at the Au/TiO(2) dual perimeter sites.

Acc Chem Res. 2014 Mar 18;47(3):805-15. doi: 10.1021/ar400196f. Epub 2013 Dec 30.

Single Atom Dynamics in Chemical Reactions.

Acc Chem Res. 2020 Feb 18;53(2):390-399. doi: 10.1021/acs.accounts.9b00500. Epub 2020 Feb 5.

Toward an Atomic-Level Understanding of Ceria-Based Catalysts: When Experiment and Theory Go Hand in Hand.

Acc Chem Res. 2021 Jul 6;54(13):2884-2893. doi: 10.1021/acs.accounts.1c00226. Epub 2021 Jun 17.

本文引用的文献

Optimizing the Catalytic Activity of Pd-Based Multinary Alloys toward Oxygen Reduction Reaction.

J Phys Chem Lett. 2022 Feb 3;13(4):1042-1048. doi: 10.1021/acs.jpclett.1c04128. Epub 2022 Jan 24.

Silica-Supported PdGa Nanoparticles: Metal Synergy for Highly Active and Selective CO-to-CHOH Hydrogenation.

JACS Au. 2021 Mar 17;1(4):450-458. doi: 10.1021/jacsau.1c00021. eCollection 2021 Apr 26.

Three-Factor Kinetic Equation of Catalyst Deactivation.

Entropy (Basel). 2021 Jun 27;23(7):818. doi: 10.3390/e23070818.

Reinforcement learning application in diabetes blood glucose control: A systematic review.

Artif Intell Med. 2020 Apr;104:101836. doi: 10.1016/j.artmed.2020.101836. Epub 2020 Feb 21.

Planning chemical syntheses with deep neural networks and symbolic AI.

Nature. 2018 Mar 28;555(7698):604-610. doi: 10.1038/nature25978.

Optimizing Chemical Reactions with Deep Reinforcement Learning.

ACS Cent Sci. 2017 Dec 27;3(12):1337-1344. doi: 10.1021/acscentsci.7b00492. Epub 2017 Dec 15.

Room-temperature carbon monoxide oxidation by oxygen over Pt/Al2O3 mediated by reactive platinum carbonates.

Nat Commun. 2015 Oct 22;6:8675. doi: 10.1038/ncomms9675.

Reinforcement learning improves behaviour from evaluative feedback.

Nature. 2015 May 28;521(7553):445-51. doi: 10.1038/nature14540.

A review of dry (CO2) reforming of methane over noble metal catalysts.

Chem Soc Rev. 2014 Nov 21;43(22):7813-37. doi: 10.1039/c3cs60395d.

Reversible formation of a PdC(x) phase in Pd nanoparticles upon CO and O2 exposure.

Phys Chem Chem Phys. 2012 Apr 14;14(14):4796-801. doi: 10.1039/c2cp22873d. Epub 2012 Feb 24.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

通过强化学习发现的一氧化碳氧化的最优动态机制

Optimal Dynamic Regimes for CO Oxidation Discovered by Reinforcement Learning.

作者信息

机构信息

出版信息

相似文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

本文引用的文献