• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于FAE-CAE-LSTM和深度强化学习的传感器驱动的非线性动力系统代理建模与控制

Sensor-Driven Surrogate Modeling and Control of Nonlinear Dynamical Systems Using FAE-CAE-LSTM and Deep Reinforcement Learning.

作者信息

Kherad Mahdi, Moayyedi Mohammad Kazem, Fotouhi-Ghazvini Faranak, Vahabi Maryam, Fotouhi Hossein

机构信息

Department of Computer Engineering and IT, University of Qom, Qom 46611, Iran.

CFD and Turbulence Research Lab., Department of Mechanical Engineering, University of Qom, Qom 46611, Iran.

出版信息

Sensors (Basel). 2025 Aug 19;25(16):5149. doi: 10.3390/s25165149.

DOI:10.3390/s25165149
PMID:40872017
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12390565/
Abstract

In cyber-physical systems governed by nonlinear partial differential equations (PDEs), real-time control is often limited by sparse sensor data and high-dimensional system dynamics. Deep reinforcement learning (DRL) has shown promise for controlling such systems, but training DRL agents directly on full-order simulations is computationally intensive. This paper presents a sensor-driven, non-intrusive reduced-order modeling (NIROM) framework called FAE-CAE-LSTM, which combines convolutional and fully connected autoencoders with a long short-term memory (LSTM) network. The model compresses high-dimensional states into a latent space and captures their temporal evolution. A DRL agent is trained entirely in this reduced space, interacting with the surrogate built from sensor-like spatiotemporal measurements, such as pressure and velocity fields. A CNN-MLP reward estimator provides data-driven feedback without requiring access to governing equations. The method is tested on benchmark systems including Burgers' equation, the Kuramoto-Sivashinsky equation, and flow past a circular cylinder; accuracy is further validated on flow past a square cylinder. Experimental results show that the proposed approach achieves accurate reconstruction, robust control, and significant computational speedup over traditional simulation-based training. These findings confirm the effectiveness of the FAE-CAE-LSTM surrogate in enabling real-time, sensor-informed, scalable DRL-based control of nonlinear dynamical systems.

摘要

在由非线性偏微分方程(PDE)控制的网络物理系统中,实时控制常常受到稀疏传感器数据和高维系统动力学的限制。深度强化学习(DRL)已显示出控制此类系统的潜力,但直接在全阶模拟上训练DRL智能体计算量很大。本文提出了一种名为FAE-CAE-LSTM的传感器驱动的非侵入式降阶建模(NIROM)框架,该框架将卷积和全连接自动编码器与长短期记忆(LSTM)网络相结合。该模型将高维状态压缩到潜在空间并捕捉其时间演化。一个DRL智能体完全在这个降维空间中进行训练,与由类似传感器的时空测量(如压力和速度场)构建的代理模型进行交互。一个CNN-MLP奖励估计器提供数据驱动的反馈,而无需访问控制方程。该方法在包括伯格斯方程、Kuramoto-Sivashinsky方程以及绕圆柱流动等基准系统上进行了测试;在绕方柱流动上进一步验证了其准确性。实验结果表明,与传统的基于模拟的训练相比,所提出的方法实现了精确的重建、鲁棒的控制以及显著的计算加速。这些发现证实了FAE-CAE-LSTM代理模型在实现基于DRL的非线性动力系统实时、传感器驱动、可扩展控制方面的有效性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9417/12390565/9e180ccf0df6/sensors-25-05149-g010a.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9417/12390565/cafd53dc06ee/sensors-25-05149-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9417/12390565/df50637ab199/sensors-25-05149-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9417/12390565/f303a0546c56/sensors-25-05149-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9417/12390565/04906a9c8bba/sensors-25-05149-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9417/12390565/11674ea76475/sensors-25-05149-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9417/12390565/686e9d979866/sensors-25-05149-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9417/12390565/cb8a2c4928a6/sensors-25-05149-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9417/12390565/9e180ccf0df6/sensors-25-05149-g010a.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9417/12390565/cafd53dc06ee/sensors-25-05149-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9417/12390565/df50637ab199/sensors-25-05149-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9417/12390565/f303a0546c56/sensors-25-05149-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9417/12390565/04906a9c8bba/sensors-25-05149-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9417/12390565/11674ea76475/sensors-25-05149-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9417/12390565/686e9d979866/sensors-25-05149-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9417/12390565/cb8a2c4928a6/sensors-25-05149-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9417/12390565/9e180ccf0df6/sensors-25-05149-g010a.jpg

相似文献

1
Sensor-Driven Surrogate Modeling and Control of Nonlinear Dynamical Systems Using FAE-CAE-LSTM and Deep Reinforcement Learning.基于FAE-CAE-LSTM和深度强化学习的传感器驱动的非线性动力系统代理建模与控制
Sensors (Basel). 2025 Aug 19;25(16):5149. doi: 10.3390/s25165149.
2
Prescription of Controlled Substances: Benefits and Risks管制药品的处方:益处与风险
3
Short-Term Memory Impairment短期记忆障碍
4
Management of urinary stones by experts in stone disease (ESD 2025).结石病专家对尿路结石的管理(2025年结石病专家共识)
Arch Ital Urol Androl. 2025 Jun 30;97(2):14085. doi: 10.4081/aiua.2025.14085.
5
Integrated neural network framework for multi-object detection and recognition using UAV imagery.用于使用无人机图像进行多目标检测与识别的集成神经网络框架。
Front Neurorobot. 2025 Jul 30;19:1643011. doi: 10.3389/fnbot.2025.1643011. eCollection 2025.
6
Performance Improvement of Seismic Response Prediction Using the LSTM-PINN Hybrid Method.使用长短期记忆神经网络-物理信息神经网络混合方法改进地震响应预测性能
Biomimetics (Basel). 2025 Jul 24;10(8):490. doi: 10.3390/biomimetics10080490.
7
Anterior Approach Total Ankle Arthroplasty with Patient-Specific Cut Guides.使用患者特异性截骨导向器的前路全踝关节置换术。
JBJS Essent Surg Tech. 2025 Aug 15;15(3). doi: 10.2106/JBJS.ST.23.00027. eCollection 2025 Jul-Sep.
8
Idiopathic (Genetic) Generalized Epilepsy特发性(遗传性)全身性癫痫
9
Deep Learning Approaches to Surrogates for Solving the Diffusion Equation for Mechanistic Real-World Simulations.用于解决实际机理模拟扩散方程的替代物的深度学习方法。
Front Physiol. 2021 Jun 24;12:667828. doi: 10.3389/fphys.2021.667828. eCollection 2021.
10
Q-learning with temporal memory to navigate turbulence.基于时间记忆的Q学习以应对动荡。
Elife. 2025 Jul 21;13:RP102906. doi: 10.7554/eLife.102906.

本文引用的文献

1
Deep Learning-Based Transmitter Localization in Sparse Wireless Sensor Networks.稀疏无线传感器网络中基于深度学习的发射机定位
Sensors (Basel). 2024 Aug 18;24(16):5335. doi: 10.3390/s24165335.
2
ACTOR: Adaptive Control of Transmission Power in RPL.ACTOR:RPL中传输功率的自适应控制。
Sensors (Basel). 2024 Apr 6;24(7):2330. doi: 10.3390/s24072330.
3
Deep reinforcement learning for turbulent drag reduction in channel flows.用于渠道流中湍流减阻的深度强化学习。
Eur Phys J E Soft Matter. 2023 Apr 11;46(4):27. doi: 10.1140/epje/s10189-023-00285-8.
4
A Critical Review of Deep Learning-Based Multi-Sensor Fusion Techniques.深度学习的多传感器融合技术综述。
Sensors (Basel). 2022 Dec 1;22(23):9364. doi: 10.3390/s22239364.
5
A Comparative Analysis of Hybrid Deep Learning Models for Human Activity Recognition.混合深度学习模型在人体活动识别中的对比分析。
Sensors (Basel). 2020 Oct 7;20(19):5707. doi: 10.3390/s20195707.
6
Control of chaotic systems by deep reinforcement learning.基于深度强化学习的混沌系统控制
Proc Math Phys Eng Sci. 2019 Nov;475(2231):20190351. doi: 10.1098/rspa.2019.0351. Epub 2019 Nov 6.
7
Learning to soar in turbulent environments.学会在动荡环境中翱翔。
Proc Natl Acad Sci U S A. 2016 Aug 16;113(33):E4877-84. doi: 10.1073/pnas.1606075113. Epub 2016 Aug 1.
8
Deep learning.深度学习。
Nature. 2015 May 28;521(7553):436-44. doi: 10.1038/nature14539.
9
Auto-association by multilayer perceptrons and singular value decomposition.多层感知器和奇异值分解的自联想
Biol Cybern. 1988;59(4-5):291-4. doi: 10.1007/BF00332918.