基于词典选择的多任务进化机器人

In Evolutionary Robotics, Lexicase selection has proven effective when a single task is broken down into many individual parameterizations. Evolved individuals have generalized across unique configurations of an overarching task. Here, we investigate the ability of Lexicase selection to generalize across multiple tasks, with each task again broken down into many instances. There are three objectives: to determine the feasibility of introducing additional tasks to the existing platform; to investigate any consequential effects of introducing these additional tasks during evolutionary adaptation; and to explore whether the schedule of presentation of the additional tasks over evolutionary time affects the final outcome. To address these aims we use a quadruped animat controlled by a feed-forward neural network with joint-angle, bearing-to-target, and spontaneous sinusoidal inputs. Weights in this network are trained using evolution with Lexicase-based parent selection. Simultaneous adaptation in a wall crossing task (labelled wall-cross) is explored when one of two different alternative tasks is also present: turn-and-seek or cargo-carry. Each task is parameterized into 100 distinct variants, and these variants are used as environments for evaluation and selection with Lexicase. We use performance in a single-task wall-cross environment as a baseline against which to examine the multi-task configurations. In addition, the objective sampling strategy (the manner in which tasks are presented over evolutionary time) is varied, and so data for treatments implementing uniform sampling, even sampling, or degrees of generational sampling are also presented. The Lexicase mechanism successfully integrates evolution of both turn-and-seek and cargo-carry with wall-cross, though there is a performance penalty compared to single task evolution. The size of the penalty depends on the similarity of the tasks. Complementary tasks (wallcross/turn-and-seek) show better performance than antagonistic tasks (wall-cross/cargo-carry). In complementary tasks performance is not affected by the sampling strategy. Where tasks are antagonistic, uniform and even sampling strategies yield significantly better performance than generational sampling. In all cases the generational sampling requires more evaluations and consequently more computational resources. The results indicate that Lexicase is a viable mechanism for multitask evolution of animat neurocontrollers, though the degree of interference between tasks is a key consideration. The results also support the conclusion that the naive, uniform random sampling strategy is the best choice when considering final task performance, simplicity of implementation, and computational efficiency.

在进化机器人学中，当单个任务被分解为许多单独的参数化时，Lexicase 选择已被证明是有效的。进化个体在总体任务的独特配置上进行了泛化。在这里，我们研究了 Lexicase 选择在多个任务中进行泛化的能力，每个任务再次分解为许多实例。有三个目标：确定将其他任务引入现有平台的可行性；调查在进化适应过程中引入这些附加任务的任何后果；并探索在进化时间内呈现附加任务的时间表是否会影响最终结果。为了实现这些目标，我们使用由关节角度、目标指向和自发正弦输入控制的四足动物机器人。使用基于 Lexicase 的父代选择进行进化训练网络的权重。当两个不同的替代任务之一同时存在时，探索穿越墙壁任务（标记为 wall-cross）的同时适应：转向和搜索或货物运输。每个任务都被参数化为 100 个不同的变体，这些变体用作 Lexicase 评估和选择的环境。我们使用在单个任务穿越墙壁环境中的性能作为基线，以检查多任务配置。此外，还改变了目标采样策略（任务在进化过程中的呈现方式），因此还提供了实现均匀采样、均匀采样或世代采样程度的处理数据。Lexicase 机制成功地将转向和搜索以及货物运输的进化与穿越墙壁相结合，尽管与单个任务进化相比存在性能损失。惩罚的大小取决于任务的相似性。互补任务（wallcross/turn-and-seek）的性能优于拮抗任务（wall-cross/cargo-carry）。在互补任务中，采样策略对性能没有影响。在任务拮抗的情况下，均匀采样和均匀采样策略比世代采样策略产生显著更好的性能。在所有情况下，世代采样都需要更多的评估，因此需要更多的计算资源。结果表明，Lexicase 是多任务进化机器人神经控制器的可行机制，尽管任务之间的干扰程度是一个关键考虑因素。结果还支持这样的结论，即在考虑最终任务性能、实现简单性和计算效率时，幼稚的、均匀的随机采样策略是最佳选择。

Lexicase Selection for Multi-Task Evolutionary Robotics.

机构信息

出版信息

相似文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献