• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

通过观察和经验构建高性能的类人战术智能体。

Building high-performing human-like tactical agents through observation and experience.

作者信息

Stein Gary, Gonzalez Avelino J

机构信息

School of Electrical Engineering and Computer Science, University of Central Florida, Orlando, FL 32826, USA.

出版信息

IEEE Trans Syst Man Cybern B Cybern. 2011 Jun;41(3):792-804. doi: 10.1109/TSMCB.2010.2091955. Epub 2010 Dec 17.

DOI:10.1109/TSMCB.2010.2091955
PMID:21172756
Abstract

This paper describes a two-phase approach for automating the agent-building process when the agent is to perform tactical tasks. The research is inspired by how humans learn-first by observation of a teacher's performance and then by practicing the performance themselves. The objectives of this approach are to produce a high-performing agent that 1) approaches or exceeds the proficiency of a human and 2) does so in a human-like manner. We accomplish these objectives by combining observational learning with experiential learning. These processes are executed sequentially, with the former creating a competent but somewhat limited human-like model from scratch, and the latter improving its performance without significantly eroding its human-like qualities. The process is described in detail, and test results confirming our hypothesis are described.

摘要

本文描述了一种两阶段方法,用于在智能体执行战术任务时自动执行智能体构建过程。该研究的灵感来源于人类的学习方式——首先观察教师的表现,然后自己练习该表现。这种方法的目标是生成一个高性能智能体,该智能体要做到:1)接近或超过人类的熟练程度;2)以类似人类的方式做到这一点。我们通过将观察学习与经验学习相结合来实现这些目标。这些过程按顺序执行,前者从零开始创建一个有能力但在某种程度上有限的类人模型,后者在不显著削弱其类人特质的情况下提高其性能。文中详细描述了该过程,并描述了证实我们假设的测试结果。

相似文献

1
Building high-performing human-like tactical agents through observation and experience.通过观察和经验构建高性能的类人战术智能体。
IEEE Trans Syst Man Cybern B Cybern. 2011 Jun;41(3):792-804. doi: 10.1109/TSMCB.2010.2091955. Epub 2010 Dec 17.
2
CPG-inspired workspace trajectory generation and adaptive locomotion control for quadruped robots.受CPG启发的四足机器人工作空间轨迹生成与自适应运动控制
IEEE Trans Syst Man Cybern B Cybern. 2011 Jun;41(3):867-80. doi: 10.1109/TSMCB.2010.2097589. Epub 2011 Jan 6.
3
A flooding algorithm for multirobot exploration.一种用于多机器人探索的泛洪算法。
IEEE Trans Syst Man Cybern B Cybern. 2012 Jun;42(3):850-63. doi: 10.1109/TSMCB.2011.2179799. Epub 2012 Jan 23.
4
Language bootstrapping: learning word meanings from perception-action association.语言自引导:从感知 - 行动关联中学习词义。
IEEE Trans Syst Man Cybern B Cybern. 2012 Jun;42(3):660-71. doi: 10.1109/TSMCB.2011.2172420. Epub 2011 Nov 16.
5
Discovery of high-level behavior from observation of human performance in a strategic game.
IEEE Trans Syst Man Cybern B Cybern. 2008 Jun;38(3):855-74. doi: 10.1109/TSMCB.2008.922062.
6
Flocking of multiple mobile robots based on backstepping.基于反步法的多移动机器人聚集
IEEE Trans Syst Man Cybern B Cybern. 2011 Apr;41(2):414-24. doi: 10.1109/TSMCB.2010.2056917. Epub 2010 Aug 12.
7
Contact-state classification in human-demonstrated robot compliant motion tasks using the boosting algorithm.基于提升算法的人体示范机器人柔顺运动任务中的接触状态分类
IEEE Trans Syst Man Cybern B Cybern. 2010 Oct;40(5):1372-86. doi: 10.1109/TSMCB.2009.2038492. Epub 2010 Jan 26.
8
An object-based visual attention model for robotic applications.一种用于机器人应用的基于对象的视觉注意力模型。
IEEE Trans Syst Man Cybern B Cybern. 2010 Oct;40(5):1398-412. doi: 10.1109/TSMCB.2009.2038895. Epub 2010 Feb 2.
9
Generalized sampling-based motion planners.基于广义采样的运动规划器。
IEEE Trans Syst Man Cybern B Cybern. 2011 Jun;41(3):855-66. doi: 10.1109/TSMCB.2010.2098438. Epub 2011 Jan 28.
10
Symbolic dynamic filtering and language measure for behavior identification of mobile robots.用于移动机器人行为识别的符号动态滤波与语言测度
IEEE Trans Syst Man Cybern B Cybern. 2012 Jun;42(3):647-59. doi: 10.1109/TSMCB.2011.2172419. Epub 2011 Nov 3.