Werbos Paul J
ECCS Division, Room 525, National Science Foundation, Arlington, VA 22230, United States.
Neural Netw. 2009 Apr;22(3):200-12. doi: 10.1016/j.neunet.2009.03.012. Epub 2009 Mar 29.
This paper presents a theory of how general-purpose learning-based intelligence is achieved in the mammal brain, and how we can replicate it. It reviews four generations of ever more powerful general-purpose learning designs in Adaptive, Approximate Dynamic Programming (ADP), which includes reinforcement learning as a special case. It reviews empirical results which fit the theory, and suggests important new directions for research, within the scope of NSF's recent initiative on Cognitive Optimization and Prediction. The appendices suggest possible connections to the realms of human subjective experience, comparative cognitive neuroscience, and new challenges in electric power. The major challenge before us today in mathematical neural networks is to replicate the "mouse level", but the paper does contain a few thoughts about building, understanding and nourishing levels of general intelligence beyond the mouse.
本文提出了一种关于哺乳动物大脑如何实现基于通用学习的智能以及我们如何复制它的理论。它回顾了自适应近似动态规划(ADP)中四代功能越来越强大的通用学习设计,其中强化学习是一个特例。它回顾了符合该理论的实证结果,并在国家科学基金会最近关于认知优化与预测的倡议范围内,提出了重要的新研究方向。附录提出了与人类主观体验领域、比较认知神经科学以及电力新挑战的可能联系。当今数学神经网络面临的主要挑战是复制“小鼠水平”,但本文确实包含了一些关于构建、理解和培育超越小鼠的通用智能水平的想法。