Department of Mathematics, UCLA, Los Angeles, CA 90024, USA.
Department of Mathematics, Florida State University, Tallahassee, FL 32304, USA.
Neural Netw. 2024 Dec;180:106707. doi: 10.1016/j.neunet.2024.106707. Epub 2024 Sep 20.
Approximating nonlinear differential equations using a neural network provides a robust and efficient tool for various scientific computing tasks, including real-time predictions, inverse problems, optimal controls, and surrogate modeling. Previous works have focused on embedding dynamical systems into networks through two approaches: learning a single operator (i.e., the mapping from input parameterised functions to solutions) or learning the governing system of equations (i.e., the constitutive model relative to the state variables). Both of these approaches yield different representations for the same underlying data or function. Observing that families of differential equations often share key characteristics, we seek one network representation across a wide range of equations. Our multimodality approach, called Predicting Multiple Operators and Symbolic Expressions (PROSE), is capable of constructing multi-operators and governing equations simultaneously through a novel fusion structure. In particular, PROSE solves differential equations, predicts future states, and generates the underlying equations of motion by incorporating symbolic "words" through a language model. Experiments with 25600 distinct equations show that PROSE benefits from its multimodal nature, resulting in robust generalization (e.g. noisy observations, equation misspecification, and data imbalance) supported by comparison and ablation studies. PROSE provides a new operator learning framework that incorporates multimodal input/output and language models for solving forward and inverse problems related to differential equations.
使用神经网络逼近非线性微分方程为各种科学计算任务提供了一个强大而高效的工具,包括实时预测、反问题、最优控制和代理建模。以前的工作主要集中在通过两种方法将动力系统嵌入到网络中:学习单个算子(即从输入参数化函数到解的映射)或学习控制方程的系统(即相对于状态变量的本构模型)。这两种方法都为相同的基础数据或函数提供了不同的表示。我们观察到微分方程族通常具有关键的共同特征,因此我们寻求在广泛的方程范围内使用一种网络表示。我们的多模态方法,称为预测多个算子和符号表达式 (PROSE),通过一种新颖的融合结构能够同时构建多算子和控制方程。具体来说,PROSE 通过语言模型结合符号“单词”来求解微分方程、预测未来状态并生成运动的基本方程。对 25600 个不同方程的实验表明,PROSE 受益于其多模态性质,通过比较和消融研究支持稳健的泛化(例如噪声观测、方程指定错误和数据不平衡)。PROSE 提供了一种新的算子学习框架,它结合了多模态输入/输出和语言模型,用于解决与微分方程相关的正向和反向问题。