• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

使用操作性行为模型学习避障

Learning obstacle avoidance with an operant behavior model.

作者信息

Gutnisky D A, Zanutto B S

机构信息

Instituto de Ingeniera de i Biomédica, FI-Universidad de Buenos Aires Paseo Colón 850, CP 1063, Buenos Aires, Argentina.

出版信息

Artif Life. 2004 Winter;10(1):65-81. doi: 10.1162/106454604322875913.

DOI:10.1162/106454604322875913
PMID:15035863
Abstract

Artificial intelligence researchers have been attracted by the idea of having robots learn how to accomplish a task, rather than being told explicitly. Reinforcement learning has been proposed as an appealing framework to be used in controlling mobile agents. Robot learning research, as well as research in biological systems, face many similar problems in order to display high flexibility in performing a variety of tasks. In this work, the controlling of a vehicle in an avoidance task by a previously developed operant learning model (a form of animal learning) is studied. An environment in which a mobile robot with proximity sensors has to minimize the punishment for colliding against obstacles is simulated. The results were compared with the Q-Learning algorithm, and the proposed model had better performance. In this way a new artificial intelligence agent inspired by neurobiology, psychology, and ethology research is proposed.

摘要

人工智能研究人员被让机器人学习如何完成任务而非被明确告知这一想法所吸引。强化学习已被提议作为用于控制移动智能体的一个有吸引力的框架。机器人学习研究以及生物系统研究在执行各种任务时为展现出高灵活性面临许多相似问题。在这项工作中,研究了通过先前开发的操作性学习模型(一种动物学习形式)在避障任务中对车辆的控制。模拟了一个环境,在该环境中配备接近传感器的移动机器人必须将与障碍物碰撞的惩罚降至最低。将结果与Q学习算法进行了比较,所提出的模型表现更好。通过这种方式,提出了一种受神经生物学、心理学和行为学研究启发的新型人工智能智能体。

相似文献

1
Learning obstacle avoidance with an operant behavior model.使用操作性行为模型学习避障
Artif Life. 2004 Winter;10(1):65-81. doi: 10.1162/106454604322875913.
2
A neural learning classifier system with self-adaptive constructivism for mobile robot control.一种用于移动机器人控制的具有自适应建构主义的神经学习分类器系统。
Artif Life. 2006 Summer;12(3):353-80. doi: 10.1162/artl.2006.12.3.353.
3
Adaptive learning via selectionism and Bayesianism, Part I: connection between the two.基于选择主义和贝叶斯主义的适应性学习,第一部分:两者之间的联系。
Neural Netw. 2009 Apr;22(3):220-8. doi: 10.1016/j.neunet.2009.03.018. Epub 2009 Apr 5.
4
Cooperation in the iterated prisoner's dilemma is learned by operant conditioning mechanisms.重复囚徒困境中的合作是通过操作性条件作用机制习得的。
Artif Life. 2004 Fall;10(4):433-61. doi: 10.1162/1064546041766479.
5
Codevelopmental learning between human and humanoid robot using a dynamic neural-network model.使用动态神经网络模型实现人类与类人机器人之间的共同发展学习。
IEEE Trans Syst Man Cybern B Cybern. 2008 Feb;38(1):43-59. doi: 10.1109/TSMCB.2007.907738.
6
Modelling brain emergent behaviours through coevolution of neural agents.通过神经智能体的协同进化对大脑涌现行为进行建模。
Neural Netw. 2006 Jun;19(5):705-20. doi: 10.1016/j.neunet.2005.02.007. Epub 2005 Jun 29.
7
Automatic generation of fuzzy inference systems via unsupervised learning.通过无监督学习自动生成模糊推理系统。
Neural Netw. 2008 Dec;21(10):1556-66. doi: 10.1016/j.neunet.2008.06.007. Epub 2008 Jun 25.
8
Modeling of autonomous problem solving process by dynamic construction of task models in multiple tasks environment.在多任务环境中通过动态构建任务模型对自主问题解决过程进行建模。
Neural Netw. 2006 Oct;19(8):1169-80. doi: 10.1016/j.neunet.2006.05.037. Epub 2006 Sep 20.
9
Behavioural and physiological responses of lambs to controllable vs. uncontrollable aversive events.羔羊对可控与不可控厌恶事件的行为和生理反应。
Psychoneuroendocrinology. 2009 Jul;34(6):805-14. doi: 10.1016/j.psyneuen.2008.10.025. Epub 2008 Dec 11.
10
The misbehavior of value and the discipline of the will.价值的不当行为与意志的自律。
Neural Netw. 2006 Oct;19(8):1153-60. doi: 10.1016/j.neunet.2006.03.002. Epub 2006 Aug 30.