• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于作用相关启发式动态规划的盾构机土压平衡最优控制。

Optimal control for earth pressure balance of shield machine based on action-dependent heuristic dynamic programming.

机构信息

School of Information and Control Engineering, Liaoning Shihua University, Fushun 113001, China.

School of Information and Control Engineering, Liaoning Shihua University, Fushun 113001, China.

出版信息

ISA Trans. 2019 Nov;94:28-35. doi: 10.1016/j.isatra.2019.04.007. Epub 2019 Apr 18.

DOI:10.1016/j.isatra.2019.04.007
PMID:31014545
Abstract

Earth pressure balance (EPB) shield has been widely used in underground construction. The excavation face stability is crucial to avoid the accidents caused by EPB shield tunneling, so that it is very important to propose an effective control method for the earth pressure balance in sealed cabin. Considering the problem that stable automatic control of the earth pressure in shield's sealed cabin is difficult, an optimal control method of the earth pressure is proposed based on action-dependent heuristic dynamic programming (ADHDP), which can realize online autonomous learning and adaptive control in tunneling process. According to Bellman's principle of optimality, the cost function with respect to the sealed cabin's earth pressure is given. In addition, the action network and critic network of ADHDP controller are constructed. The critic network approximates the cost function and feeds error back into the action network. With the goal of minimizing the cost function, the action network utilizes the critic network's error to optimize screw conveyor speed. The simulation results show that the earth pressure controller based on ADHDP can realize the earth pressure balance control, and the control process is steadier. Moreover, ADHDP controller has good dynamic performance and anti-interference ability.

摘要

土压平衡(EPB)盾构机已广泛应用于地下工程施工中。为避免因 EPB 盾构隧道施工引起的事故,掌子面稳定至关重要,因此提出一种有效的密封舱土压平衡控制方法非常重要。针对盾构密封舱土压稳定自动控制困难的问题,提出了一种基于动作相关启发式动态规划(ADHDP)的土压最优控制方法,该方法可在隧道施工过程中实现在线自主学习和自适应控制。根据最优性贝尔曼原理,给出了密封舱土压的代价函数。此外,构建了 ADHDP 控制器的动作网络和评价网络。评价网络逼近代价函数,并将误差反馈给动作网络。以代价函数最小化为目标,动作网络利用评价网络的误差来优化螺旋输送机的速度。仿真结果表明,基于 ADHDP 的土压控制器能够实现土压平衡控制,且控制过程更加平稳。此外,ADHDP 控制器具有良好的动态性能和抗干扰能力。

相似文献

1
Optimal control for earth pressure balance of shield machine based on action-dependent heuristic dynamic programming.基于作用相关启发式动态规划的盾构机土压平衡最优控制。
ISA Trans. 2019 Nov;94:28-35. doi: 10.1016/j.isatra.2019.04.007. Epub 2019 Apr 18.
2
Adaptive learning in tracking control based on the dual critic network design.基于双 Critic 网络设计的跟踪控制自适应学习。
IEEE Trans Neural Netw Learn Syst. 2013 Jun;24(6):913-28. doi: 10.1109/TNNLS.2013.2247627.
3
Air-Breathing Hypersonic Vehicle Tracking Control Based on Adaptive Dynamic Programming.基于自适应动态规划的吸气式高超音速飞行器跟踪控制。
IEEE Trans Neural Netw Learn Syst. 2017 Mar;28(3):584-598. doi: 10.1109/TNNLS.2016.2516948. Epub 2016 Feb 3.
4
Calculation model of the equivalent spiral shear stress of conditioned sand.原状砂等效螺旋剪切应力的计算模型。
PLoS One. 2019 Mar 13;14(3):e0212923. doi: 10.1371/journal.pone.0212923. eCollection 2019.
5
SVM-based tree-type neural networks as a critic in adaptive critic designs for control.基于支持向量机的树型神经网络在自适应评判控制设计中作为评判器。
IEEE Trans Neural Netw. 2007 Jul;18(4):1016-30. doi: 10.1109/TNN.2007.899255.
6
Online discrete-time LQR controller design with integral action for bulk Bucket Wheel Reclaimer operational processes via Action-Dependent Heuristic Dynamic Programming.基于动作依赖启发式动态规划的具有积分作用的在线离散时间LQR控制器设计,用于斗轮堆取料机的运行过程
ISA Trans. 2019 Jul;90:294-310. doi: 10.1016/j.isatra.2019.01.010. Epub 2019 Jan 30.
7
Use of a 10.22 m diameter EPB shield: a case study in Beijing subway construction.直径10.22米土压平衡盾构的应用:北京地铁建设中的一个案例研究
Springerplus. 2016 Nov 24;5(1):2004. doi: 10.1186/s40064-016-3672-5. eCollection 2016.
8
Reinforcement learning controller design for affine nonlinear discrete-time systems using online approximators.基于在线逼近器的仿射非线性离散时间系统强化学习控制器设计
IEEE Trans Syst Man Cybern B Cybern. 2012 Apr;42(2):377-90. doi: 10.1109/TSMCB.2011.2166384. Epub 2011 Sep 23.
9
Data-Driven Tracking Control With Adaptive Dynamic Programming for a Class of Continuous-Time Nonlinear Systems.基于自适应动态规划的数据驱动跟踪控制方法在一类连续时间非线性系统中的应用。
IEEE Trans Cybern. 2017 Jun;47(6):1460-1470. doi: 10.1109/TCYB.2016.2548941. Epub 2016 Apr 22.
10
Adaptive critic learning techniques for engine torque and air-fuel ratio control.用于发动机扭矩和空燃比控制的自适应评判学习技术。
IEEE Trans Syst Man Cybern B Cybern. 2008 Aug;38(4):988-93. doi: 10.1109/TSMCB.2008.922019.