Nansen Center (NERSC), 5006 Bergen, Norway.
Sorbonne University, Paris, France.
Philos Trans A Math Phys Eng Sci. 2021 Apr 5;379(2194):20200086. doi: 10.1098/rsta.2020.0086. Epub 2021 Feb 15.
In recent years, machine learning (ML) has been proposed to devise data-driven parametrizations of unresolved processes in dynamical numerical models. In most cases, the ML training leverages high-resolution simulations to provide a dense, noiseless target state. Our goal is to go beyond the use of high-resolution simulations and train ML-based parametrization using direct data, in the realistic scenario of noisy and sparse observations. The algorithm proposed in this work is a two-step process. First, data assimilation (DA) techniques are applied to estimate the full state of the system from a truncated model. The unresolved part of the truncated model is viewed as a model error in the DA system. In a second step, ML is used to emulate the unresolved part, a predictor of model error given the state of the system. Finally, the ML-based parametrization model is added to the physical core truncated model to produce a hybrid model. The DA component of the proposed method relies on an ensemble Kalman filter while the ML parametrization is represented by a neural network. The approach is applied to the two-scale Lorenz model and to MAOOAM, a reduced-order coupled ocean-atmosphere model. We show that in both cases, the hybrid model yields forecasts with better skill than the truncated model. Moreover, the attractor of the system is significantly better represented by the hybrid model than by the truncated model. This article is part of the theme issue 'Machine learning for weather and climate modelling'.
近年来,机器学习(ML)已被提议用于设计动力数值模型中未解析过程的数据驱动参数化。在大多数情况下,ML 训练利用高分辨率模拟提供密集、无噪声的目标状态。我们的目标是超越使用高分辨率模拟,并在存在噪声和稀疏观测的现实情况下,使用直接数据训练基于 ML 的参数化。本文提出的算法是一个两步过程。首先,应用数据同化(DA)技术从截断模型中估计系统的完整状态。截断模型的未解析部分被视为 DA 系统中的模型误差。在第二步中,使用 ML 模拟未解析部分,即给定系统状态时的模型误差预测器。最后,将基于 ML 的参数化模型添加到物理核心截断模型中以生成混合模型。所提出方法的 DA 组件依赖于集合卡尔曼滤波器,而 ML 参数化由神经网络表示。该方法应用于两尺度洛伦兹模型和 MAOOAM,一种简化的耦合海洋-大气模型。我们表明,在这两种情况下,混合模型产生的预测比截断模型具有更好的技能。此外,混合模型比截断模型更能代表系统的吸引子。本文是主题为“机器学习在天气和气候建模中的应用”的一部分。