• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

柴油发动机排放控制中的安全深度强化学习

Safe deep reinforcement learning in diesel engine emission control.

作者信息

Norouzi Armin, Shahpouri Saeid, Gordon David, Shahbakhti Mahdi, Koch Charles Robert

机构信息

Department of Mechanical Engineering, University of Alberta, Edmonton, AB, Canada.

出版信息

Proc Inst Mech Eng Part I J Syst Control Eng. 2023 Sep;237(8):1440-1453. doi: 10.1177/09596518231153445. Epub 2023 Feb 17.

DOI:10.1177/09596518231153445
PMID:37692899
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10483989/
Abstract

A deep reinforcement learning application is investigated to control the emissions of a compression ignition diesel engine. The main purpose of this study is to reduce the engine-out nitrogen oxide emissions and to minimize fuel consumption while tracking a reference engine load. First, a physics-based engine simulation model is developed in GT-Power and calibrated using experimental data. Using this model and a GT-Power/Simulink co-simulation, a deep deterministic policy gradient is developed. To reduce the risk of an unwanted output, a safety filter is added to the deep reinforcement learning. Based on the simulation results, this filter has no effect on the final trained deep reinforcement learning; however, during the training process, it is crucial to enforce constraints on the controller output. The developed safe reinforcement learning is then compared with an iterative learning controller and a deep neural network-based nonlinear model predictive controller. This comparison shows that the safe reinforcement learning is capable of accurately tracking an arbitrary reference input while the iterative learning controller is limited to a repetitive reference. The comparison between the nonlinear model predictive control and reinforcement learning indicates that for this case reinforcement learning is able to learn the optimal control output directly from the experiment without the need for a model. However, to enforce output constraint for safe learning reinforcement learning, a simple model of system is required. In this work, reinforcement learning was able to reduce emissions more than the nonlinear model predictive control; however, it suffered from slightly higher error in load tracking and a higher fuel consumption.

摘要

研究了一种深度强化学习应用,用于控制压缩点火式柴油发动机的排放。本研究的主要目的是在跟踪参考发动机负荷的同时,减少发动机尾气中的氮氧化物排放,并使燃料消耗最小化。首先,在GT-Power中开发了一个基于物理的发动机仿真模型,并使用实验数据进行校准。利用该模型和GT-Power/Simulink联合仿真,开发了深度确定性策略梯度。为了降低出现意外输出的风险,在深度强化学习中添加了一个安全滤波器。基于仿真结果,该滤波器对最终训练的深度强化学习没有影响;然而,在训练过程中,对控制器输出施加约束至关重要。然后将所开发的安全强化学习与迭代学习控制器和基于深度神经网络的非线性模型预测控制器进行比较。该比较表明,安全强化学习能够准确跟踪任意参考输入,而迭代学习控制器仅限于重复参考。非线性模型预测控制与强化学习之间的比较表明,对于这种情况,强化学习能够直接从实验中学习最优控制输出,而无需模型。然而,为了对安全学习强化学习施加输出约束,需要一个简单的系统模型。在这项工作中,强化学习比非线性模型预测控制能够更多地减少排放;然而,它在负荷跟踪方面存在稍高的误差,且燃料消耗更高。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dfd6/10483989/b2142c544eb8/10.1177_09596518231153445-fig9.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dfd6/10483989/f0effca75a76/10.1177_09596518231153445-fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dfd6/10483989/356bdef46374/10.1177_09596518231153445-fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dfd6/10483989/f86c94214e24/10.1177_09596518231153445-fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dfd6/10483989/1ad39e7c44e4/10.1177_09596518231153445-fig4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dfd6/10483989/ae4147e8dde6/10.1177_09596518231153445-fig5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dfd6/10483989/31870035a10a/10.1177_09596518231153445-fig6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dfd6/10483989/d5f19cb76cd9/10.1177_09596518231153445-fig7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dfd6/10483989/7009d900b804/10.1177_09596518231153445-fig8.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dfd6/10483989/b2142c544eb8/10.1177_09596518231153445-fig9.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dfd6/10483989/f0effca75a76/10.1177_09596518231153445-fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dfd6/10483989/356bdef46374/10.1177_09596518231153445-fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dfd6/10483989/f86c94214e24/10.1177_09596518231153445-fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dfd6/10483989/1ad39e7c44e4/10.1177_09596518231153445-fig4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dfd6/10483989/ae4147e8dde6/10.1177_09596518231153445-fig5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dfd6/10483989/31870035a10a/10.1177_09596518231153445-fig6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dfd6/10483989/d5f19cb76cd9/10.1177_09596518231153445-fig7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dfd6/10483989/7009d900b804/10.1177_09596518231153445-fig8.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dfd6/10483989/b2142c544eb8/10.1177_09596518231153445-fig9.jpg

相似文献

1
Safe deep reinforcement learning in diesel engine emission control.柴油发动机排放控制中的安全深度强化学习
Proc Inst Mech Eng Part I J Syst Control Eng. 2023 Sep;237(8):1440-1453. doi: 10.1177/09596518231153445. Epub 2023 Feb 17.
2
Artificial Neural Network Modeling and Numerical Simulation of Syngas Fuel and Injection Timing Effects on the Performance and Emissions of a Heavy-Duty Compression Ignition Engine.合成气燃料及喷射正时对重型压燃式发动机性能和排放影响的人工神经网络建模与数值模拟
ACS Omega. 2021 Nov 26;6(48):32379-32394. doi: 10.1021/acsomega.1c02829. eCollection 2021 Dec 7.
3
Reinforcement-learning-based output-feedback control of nonstrict nonlinear discrete-time systems with application to engine emission control.基于强化学习的非严格非线性离散时间系统输出反馈控制及其在发动机排放控制中的应用
IEEE Trans Syst Man Cybern B Cybern. 2009 Oct;39(5):1162-79. doi: 10.1109/TSMCB.2009.2013272. Epub 2009 Mar 24.
4
Reinforcement-learning-based dual-control methodology for complex nonlinear discrete-time systems with application to spark engine EGR operation.基于强化学习的复杂非线性离散时间系统双控制方法及其在火花发动机废气再循环操作中的应用
IEEE Trans Neural Netw. 2008 Aug;19(8):1369-88. doi: 10.1109/TNN.2008.2000452.
5
Experimental investigation on NOx and green house gas emissions from a marine auxiliary diesel engine using ultralow sulfur light fuel.使用超低硫轻质燃料的船用辅助柴油机氮氧化物和温室气体排放的实验研究。
Sci Total Environ. 2016 Dec 1;572:467-475. doi: 10.1016/j.scitotenv.2016.08.047. Epub 2016 Aug 18.
6
Experimental assessment of performance and emissions for hydrogen-diesel dual fuel operation in a low displacement compression ignition engine.低排量压燃式发动机中氢-柴油双燃料运行性能及排放的实验评估
Heliyon. 2022 Apr 16;8(4):e09285. doi: 10.1016/j.heliyon.2022.e09285. eCollection 2022 Apr.
7
Modeling & implementation of DRLA based partially shaded solar system integration with 3- conventional grid using constant current controller.基于恒流控制器的部分阴影太阳能系统与三相传统电网集成的DRLA建模与实现
Heliyon. 2022 Jun 6;8(6):e09669. doi: 10.1016/j.heliyon.2022.e09669. eCollection 2022 Jun.
8
Role of fuel additives on reduction of NO emission from a diesel engine powered by camphor oil biofuel.燃料添加剂对以樟脑油生物燃料为动力的柴油机 NO 排放减少的作用。
Environ Sci Pollut Res Int. 2018 Jun;25(16):15368-15377. doi: 10.1007/s11356-018-1745-4. Epub 2018 Mar 21.
9
Combustion and emission characteristics for a marine low-speed diesel engine with high-pressure SCR system.船用低速柴油机高压 SCR 系统的燃烧和排放特性。
Environ Sci Pollut Res Int. 2020 Apr;27(12):12851-12865. doi: 10.1007/s11356-019-04194-2. Epub 2019 Feb 7.
10
Performance and emissions of a diesel engine fueled by coal-based diesel fuels and their blends with polyoxymethylene dimethyl ethers.以煤基柴油燃料及其与聚甲醛二甲醚的混合物为燃料的柴油机的性能和排放。
Sci Rep. 2023 Jan 19;13(1):1053. doi: 10.1038/s41598-023-28283-y.

本文引用的文献

1
Human-level control through deep reinforcement learning.通过深度强化学习实现人类水平的控制。
Nature. 2015 Feb 26;518(7540):529-33. doi: 10.1038/nature14236.
2
Reinforcement-learning-based output-feedback control of nonstrict nonlinear discrete-time systems with application to engine emission control.基于强化学习的非严格非线性离散时间系统输出反馈控制及其在发动机排放控制中的应用
IEEE Trans Syst Man Cybern B Cybern. 2009 Oct;39(5):1162-79. doi: 10.1109/TSMCB.2009.2013272. Epub 2009 Mar 24.
3
Reinforcement-learning-based dual-control methodology for complex nonlinear discrete-time systems with application to spark engine EGR operation.
基于强化学习的复杂非线性离散时间系统双控制方法及其在火花发动机废气再循环操作中的应用
IEEE Trans Neural Netw. 2008 Aug;19(8):1369-88. doi: 10.1109/TNN.2008.2000452.
4
Applications of the self-organising map to reinforcement learning.自组织映射在强化学习中的应用。
Neural Netw. 2002 Oct-Nov;15(8-9):1107-24. doi: 10.1016/s0893-6080(02)00083-7.