• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于支持向量机的树型神经网络在自适应评判控制设计中作为评判器。

SVM-based tree-type neural networks as a critic in adaptive critic designs for control.

作者信息

Deb Alok Kanti, Gopal Madan, Chandra Suresh

机构信息

Department of Electrical Engineering, Indian Institute of Technology (IIT), New Delhi 110016, India.

出版信息

IEEE Trans Neural Netw. 2007 Jul;18(4):1016-30. doi: 10.1109/TNN.2007.899255.

DOI:10.1109/TNN.2007.899255
PMID:17668658
Abstract

In this paper, we use the approach of adaptive critic design (ACD) for control, specifically, the action-dependent heuristic dynamic programming (ADHDP) method. A least squares support vector machine (SVM) regressor has been used for generating the control actions, while an SVM-based tree-type neural network (NN) is used as the critic. After a failure occurs, the critic and action are retrained in tandem using the failure data. Failure data is binary classification data, where the number of failure states are very few as compared to the number of no-failure states. The difficulty of conventional multilayer feedforward NNs in learning this type of classification data has been overcome by using the SVM-based tree-type NN, which due to its feature to add neurons to learn misclassified data, has the capability to learn any binary classification data without a priori choice of the number of neurons or the structure of the network. The capability of the trained controller to handle unforeseen situations is demonstrated.

摘要

在本文中,我们使用自适应评判设计(ACD)方法进行控制,具体而言,是基于动作的启发式动态规划(ADHDP)方法。采用最小二乘支持向量机(SVM)回归器来生成控制动作,同时将基于SVM的树型神经网络(NN)用作评判器。故障发生后,利用故障数据对评判器和动作进行联合重新训练。故障数据是二元分类数据,与无故障状态的数量相比,故障状态的数量非常少。通过使用基于SVM的树型神经网络克服了传统多层前馈神经网络在学习这类分类数据时的困难,该树型神经网络由于具有添加神经元以学习误分类数据的特性,能够在无需事先选择神经元数量或网络结构的情况下学习任何二元分类数据。展示了训练有素的控制器处理意外情况的能力。

相似文献

1
SVM-based tree-type neural networks as a critic in adaptive critic designs for control.基于支持向量机的树型神经网络在自适应评判控制设计中作为评判器。
IEEE Trans Neural Netw. 2007 Jul;18(4):1016-30. doi: 10.1109/TNN.2007.899255.
2
Neural-network-based state feedback control of a nonlinear discrete-time system in nonstrict feedback form.非严格反馈形式下非线性离散时间系统的基于神经网络的状态反馈控制
IEEE Trans Neural Netw. 2008 Dec;19(12):2073-87. doi: 10.1109/TNN.2008.2003295.
3
Adaptive critic learning techniques for engine torque and air-fuel ratio control.用于发动机扭矩和空燃比控制的自适应评判学习技术。
IEEE Trans Syst Man Cybern B Cybern. 2008 Aug;38(4):988-93. doi: 10.1109/TSMCB.2008.922019.
4
Reinforcement-learning-based dual-control methodology for complex nonlinear discrete-time systems with application to spark engine EGR operation.基于强化学习的复杂非线性离散时间系统双控制方法及其在火花发动机废气再循环操作中的应用
IEEE Trans Neural Netw. 2008 Aug;19(8):1369-88. doi: 10.1109/TNN.2008.2000452.
5
Control of nonaffine nonlinear discrete-time systems using reinforcement-learning-based linearly parameterized neural networks.基于强化学习的线性参数化神经网络对非仿射非线性离散时间系统的控制
IEEE Trans Syst Man Cybern B Cybern. 2008 Aug;38(4):994-1001. doi: 10.1109/TSMCB.2008.926607.
6
A boundedness result for the direct heuristic dynamic programming.直接启发式动态规划的有界性结果。
Neural Netw. 2012 Aug;32:229-35. doi: 10.1016/j.neunet.2012.02.005. Epub 2012 Feb 14.
7
Online learning control using adaptive critic designs with sparse kernel machines.基于稀疏核机器的自适应评论家设计的在线学习控制。
IEEE Trans Neural Netw Learn Syst. 2013 May;24(5):762-75. doi: 10.1109/TNNLS.2012.2236354.
8
Discrete-time nonlinear HJB solution using approximate dynamic programming: convergence proof.使用近似动态规划的离散时间非线性HJB解:收敛性证明
IEEE Trans Syst Man Cybern B Cybern. 2008 Aug;38(4):943-9. doi: 10.1109/TSMCB.2008.926614.
9
Adaptive optimal control of unknown constrained-input systems using policy iteration and neural networks.基于策略迭代和神经网络的未知约束输入系统自适应最优控制。
IEEE Trans Neural Netw Learn Syst. 2013 Oct;24(10):1513-25. doi: 10.1109/TNNLS.2013.2276571.
10
Adaptive feedback control by constrained approximate dynamic programming.基于约束近似动态规划的自适应反馈控制。
IEEE Trans Syst Man Cybern B Cybern. 2008 Aug;38(4):982-7. doi: 10.1109/TSMCB.2008.924140.