Bahmer Andreas, Gupta Daya, Effenberger Felix
RheinMain University of Applied Sciences, Ruesselsheim Campus, 65197 Wiesbaden, Germany
Husson University, Bangor, ME 04401, U.S.A.
Neural Comput. 2023 Apr 18;35(5):763-806. doi: 10.1162/neco_a_01575.
Machine learning tools, particularly artificial neural networks (ANN), have become ubiquitous in many scientific disciplines, and machine learning-based techniques flourish not only because of the expanding computational power and the increasing availability of labeled data sets but also because of the increasingly powerful training algorithms and refined topologies of ANN. Some refined topologies were initially motivated by neuronal network architectures found in the brain, such as convolutional ANN. Later topologies of neuronal networks departed from the biological substrate and began to be developed independently as the biological processing units are not well understood or are not transferable to in silico architectures. In the field of neuroscience, the advent of multichannel recordings has enabled recording the activity of many neurons simultaneously and characterizing complex network activity in biological neural networks (BNN). The unique opportunity to compare large neuronal network topologies, processing, and learning strategies with those that have been developed in state-of-the-art ANN has become a reality. The aim of this review is to introduce certain basic concepts of modern ANN, corresponding training algorithms, and biological counterparts. The selection of these modern ANN is prone to be biased (e.g., spiking neural networks are excluded) but may be sufficient for a concise overview.
机器学习工具,尤其是人工神经网络(ANN),在许多科学学科中已变得无处不在。基于机器学习的技术蓬勃发展,这不仅是因为计算能力不断提升以及标记数据集越来越容易获取,还因为训练算法日益强大以及人工神经网络的拓扑结构不断优化。一些优化的拓扑结构最初是受大脑中发现的神经网络架构启发,比如卷积人工神经网络。后来的神经网络拓扑结构脱离了生物基础,随着生物处理单元尚未被充分理解或无法转换为计算机架构,它们开始独立发展。在神经科学领域,多通道记录技术的出现使得同时记录多个神经元的活动并表征生物神经网络(BNN)中的复杂网络活动成为可能。将大型神经元网络拓扑结构、处理方式和学习策略与最先进的人工神经网络中所开发的进行比较,这一独特机会已成为现实。本综述的目的是介绍现代人工神经网络的某些基本概念、相应的训练算法以及生物对应物。对这些现代人工神经网络的选择可能会有偏差(例如,脉冲神经网络被排除在外),但对于简要概述而言可能已足够。