Brunton Steven L, Proctor Joshua L, Kutz J Nathan
Department of Mechanical Engineering, University of Washington, Seattle, WA 98195;
Institute for Disease Modeling, Bellevue, WA 98005;
Proc Natl Acad Sci U S A. 2016 Apr 12;113(15):3932-7. doi: 10.1073/pnas.1517384113. Epub 2016 Mar 28.
Extracting governing equations from data is a central challenge in many diverse areas of science and engineering. Data are abundant whereas models often remain elusive, as in climate science, neuroscience, ecology, finance, and epidemiology, to name only a few examples. In this work, we combine sparsity-promoting techniques and machine learning with nonlinear dynamical systems to discover governing equations from noisy measurement data. The only assumption about the structure of the model is that there are only a few important terms that govern the dynamics, so that the equations are sparse in the space of possible functions; this assumption holds for many physical systems in an appropriate basis. In particular, we use sparse regression to determine the fewest terms in the dynamic governing equations required to accurately represent the data. This results in parsimonious models that balance accuracy with model complexity to avoid overfitting. We demonstrate the algorithm on a wide range of problems, from simple canonical systems, including linear and nonlinear oscillators and the chaotic Lorenz system, to the fluid vortex shedding behind an obstacle. The fluid example illustrates the ability of this method to discover the underlying dynamics of a system that took experts in the community nearly 30 years to resolve. We also show that this method generalizes to parameterized systems and systems that are time-varying or have external forcing.
从数据中提取控制方程是许多不同科学和工程领域的核心挑战。数据丰富,而模型往往难以捉摸,比如气候科学、神经科学、生态学、金融学和流行病学等,仅举几个例子。在这项工作中,我们将促进稀疏性的技术和机器学习与非线性动力系统相结合,以便从噪声测量数据中发现控制方程。关于模型结构的唯一假设是,只有少数重要项支配动力学,从而使方程在可能函数的空间中是稀疏的;在适当的基下,这个假设对许多物理系统都成立。特别是,我们使用稀疏回归来确定动态控制方程中准确表示数据所需的最少项。这就产生了简洁的模型,这些模型在准确性和模型复杂性之间取得平衡,以避免过拟合。我们在广泛的问题上展示了该算法,从简单的典型系统,包括线性和非线性振荡器以及混沌洛伦兹系统,到障碍物后面的流体涡旋脱落。流体的例子说明了这种方法发现一个系统潜在动力学的能力,而该系统的潜在动力学让该领域的专家花了近30年才得以解决。我们还表明,这种方法可以推广到参数化系统以及时变或有外部强迫的系统。