Kohl Nate, Miikkulainen Risto
Department of Computer Sciences, The University of Texas at Austin, 1 University Station C0500, Austin, TX, United States.
Neural Netw. 2009 Apr;22(3):326-37. doi: 10.1016/j.neunet.2009.03.001. Epub 2009 Mar 24.
Evolution of neural networks, or neuroevolution, has been a successful approach to many low-level control problems such as pole balancing, vehicle control, and collision warning. However, certain types of problems-such as those involving strategic decision-making-have remained difficult for neuroevolution to solve. This paper evaluates the hypothesis that such problems are difficult because they are fractured: The correct action varies discontinuously as the agent moves from state to state. A method for measuring fracture using the concept of function variation is proposed and, based on this concept, two methods for dealing with fracture are examined: neurons with local receptive fields, and refinement based on a cascaded network architecture. Experiments in several benchmark domains are performed to evaluate how different levels of fracture affect the performance of neuroevolution methods, demonstrating that these two modifications improve performance significantly. These results form a promising starting point for expanding neuroevolution to strategic tasks.
神经网络的进化,即神经进化,已成为解决许多低级控制问题(如摆杆平衡、车辆控制和碰撞预警)的成功方法。然而,某些类型的问题,比如涉及战略决策的问题,对于神经进化来说仍然难以解决。本文评估了这样一种假设,即此类问题之所以困难是因为它们是碎片化的:随着智能体从一个状态转移到另一个状态,正确的行动会不连续地变化。提出了一种使用函数变化概念来衡量碎片化的方法,并基于此概念研究了两种处理碎片化的方法:具有局部感受野的神经元,以及基于级联网络架构的细化。在几个基准领域进行了实验,以评估不同程度的碎片化如何影响神经进化方法的性能,结果表明这两种改进显著提高了性能。这些结果为将神经进化扩展到战略任务提供了一个有前景的起点。