Department of Engineering Cybernetics, Norwegian University of Science and Technology, O. S. Bragstads plass 2, Trondheim, NO-7034, Norway.
School of Mechanical and Aerospace Engineering, Oklahoma State University, 201 General Academic Building Stillwater, OK 74078, United States of America.
Neural Netw. 2022 Oct;154:333-345. doi: 10.1016/j.neunet.2022.07.023. Epub 2022 Jul 26.
The success of the current wave of artificial intelligence can be partly attributed to deep neural networks, which have proven to be very effective in learning complex patterns from large datasets with minimal human intervention. However, it is difficult to train these models on complex dynamical systems from data alone due to their low data efficiency and sensitivity to hyperparameters and initialisation. This work demonstrates that injection of partially known information at an intermediate layer in a DNN can improve model accuracy, reduce model uncertainty, and yield improved convergence during the training. The value of these physics-guided neural networks has been demonstrated by learning the dynamics of a wide variety of nonlinear dynamical systems represented by five well-known equations in nonlinear systems theory: the Lotka-Volterra, Duffing, Van der Pol, Lorenz, and Henon-Heiles systems.
当前人工智能浪潮的成功部分归因于深度神经网络,这些网络在最小的人为干预下,从大型数据集学习复杂模式的能力非常有效。然而,由于其数据效率低、对超参数和初始化敏感,仅从数据上对这些模型进行训练是很困难的。这项工作表明,在 DNN 的中间层注入部分已知信息可以提高模型的准确性,降低模型的不确定性,并在训练过程中提高收敛速度。通过学习非线性系统理论中五个著名方程所表示的各种非线性动力系统的动力学,证明了这些物理引导神经网络的价值:Lotka-Volterra、Duffing、Van der Pol、Lorenz 和 Henon-Heiles 系统。