Pontes-Filho Sidney, Olsen Kristoffer, Yazidi Anis, Riegler Michael A, Halvorsen Pål, Nichele Stefano
Department of Computer Science, Oslo Metropolitan University, Oslo, Norway.
Department of Computer Science, Norwegian University of Science and Technology, Trondheim, Norway.
Front Robot AI. 2022 Oct 14;9:1007547. doi: 10.3389/frobt.2022.1007547. eCollection 2022.
In this work, we argue that the search for Artificial General Intelligence should start from a much lower level than human-level intelligence. The circumstances of intelligent behavior in nature resulted from an organism interacting with its surrounding environment, which could change over time and exert pressure on the organism to allow for learning of new behaviors or environment models. Our hypothesis is that learning occurs through interpreting sensory feedback when an agent acts in an environment. For that to happen, a body and a reactive environment are needed. We evaluate a method to evolve a biologically-inspired artificial neural network that learns from environment reactions named Neuroevolution of Artificial General Intelligence, a framework for low-level artificial general intelligence. This method allows the evolutionary complexification of a randomly-initialized spiking neural network with adaptive synapses, which controls agents instantiated in mutable environments. Such a configuration allows us to benchmark the adaptivity and generality of the controllers. The chosen tasks in the mutable environments are food foraging, emulation of logic gates, and cart-pole balancing. The three tasks are successfully solved with rather small network topologies and therefore it opens up the possibility of experimenting with more complex tasks and scenarios where curriculum learning is beneficial.
在这项工作中,我们认为对通用人工智能的探索应该从比人类水平智能低得多的层面开始。自然界中智能行为的情况是由生物体与其周围环境相互作用产生的,环境会随时间变化并对生物体施加压力,从而促使其学习新行为或环境模型。我们的假设是,当智能体在环境中行动时,学习是通过解释感官反馈来发生的。为此,需要一个身体和一个反应性环境。我们评估了一种方法,用于进化一种受生物启发的人工神经网络,该网络从名为通用人工智能神经进化的环境反应中学习,这是一个用于低级通用人工智能的框架。这种方法允许对具有自适应突触的随机初始化脉冲神经网络进行进化复杂化,该网络控制在可变环境中实例化的智能体。这样的配置使我们能够对控制器的适应性和通用性进行基准测试。在可变环境中选择的任务是食物觅食、逻辑门模拟和推车-摆杆平衡。这三个任务通过相当小的网络拓扑结构成功解决,因此为试验更复杂的任务和课程学习有益的场景开辟了可能性。