结合数据同化和机器学习来推断未解析尺度参数化。

Combining data assimilation and machine learning to infer unresolved scale parametrization.

机构信息

Nansen Center (NERSC), 5006 Bergen, Norway.

Sorbonne University, Paris, France.

出版信息

Philos Trans A Math Phys Eng Sci. 2021 Apr 5;379(2194):20200086. doi: 10.1098/rsta.2020.0086. Epub 2021 Feb 15.

DOI:10.1098/rsta.2020.0086

PMID:33583267

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7898132/

Abstract

In recent years, machine learning (ML) has been proposed to devise data-driven parametrizations of unresolved processes in dynamical numerical models. In most cases, the ML training leverages high-resolution simulations to provide a dense, noiseless target state. Our goal is to go beyond the use of high-resolution simulations and train ML-based parametrization using direct data, in the realistic scenario of noisy and sparse observations. The algorithm proposed in this work is a two-step process. First, data assimilation (DA) techniques are applied to estimate the full state of the system from a truncated model. The unresolved part of the truncated model is viewed as a model error in the DA system. In a second step, ML is used to emulate the unresolved part, a predictor of model error given the state of the system. Finally, the ML-based parametrization model is added to the physical core truncated model to produce a hybrid model. The DA component of the proposed method relies on an ensemble Kalman filter while the ML parametrization is represented by a neural network. The approach is applied to the two-scale Lorenz model and to MAOOAM, a reduced-order coupled ocean-atmosphere model. We show that in both cases, the hybrid model yields forecasts with better skill than the truncated model. Moreover, the attractor of the system is significantly better represented by the hybrid model than by the truncated model. This article is part of the theme issue 'Machine learning for weather and climate modelling'.

摘要

近年来，机器学习（ML）已被提议用于设计动力数值模型中未解析过程的数据驱动参数化。在大多数情况下，ML 训练利用高分辨率模拟提供密集、无噪声的目标状态。我们的目标是超越使用高分辨率模拟，并在存在噪声和稀疏观测的现实情况下，使用直接数据训练基于 ML 的参数化。本文提出的算法是一个两步过程。首先，应用数据同化（DA）技术从截断模型中估计系统的完整状态。截断模型的未解析部分被视为 DA 系统中的模型误差。在第二步中，使用 ML 模拟未解析部分，即给定系统状态时的模型误差预测器。最后，将基于 ML 的参数化模型添加到物理核心截断模型中以生成混合模型。所提出方法的 DA 组件依赖于集合卡尔曼滤波器，而 ML 参数化由神经网络表示。该方法应用于两尺度洛伦兹模型和 MAOOAM，一种简化的耦合海洋-大气模型。我们表明，在这两种情况下，混合模型产生的预测比截断模型具有更好的技能。此外，混合模型比截断模型更能代表系统的吸引子。本文是主题为“机器学习在天气和气候建模中的应用”的一部分。

相似文献

Combining data assimilation and machine learning to infer unresolved scale parametrization.结合数据同化和机器学习来推断未解析尺度参数化。

Philos Trans A Math Phys Eng Sci. 2021 Apr 5;379(2194):20200086. doi: 10.1098/rsta.2020.0086. Epub 2021 Feb 15.

Using data assimilation to train a hybrid forecast system that combines machine-learning and knowledge-based components.利用数据同化来训练一个结合机器学习和基于知识组件的混合预测系统。

Chaos. 2021 May;31(5):053114. doi: 10.1063/5.0048050.

Predicting atmospheric optical properties for radiative transfer computations using neural networks.利用神经网络预测辐射传输计算中的大气光学性质。

Philos Trans A Math Phys Eng Sci. 2021 Apr 5;379(2194):20200095. doi: 10.1098/rsta.2020.0095. Epub 2021 Feb 15.

Learning earth system models from observations: machine learning or data assimilation?从观测中学习地球系统模型：机器学习还是数据同化？

Philos Trans A Math Phys Eng Sci. 2021 Apr 5;379(2194):20200089. doi: 10.1098/rsta.2020.0089. Epub 2021 Feb 15.

Deep learning for post-processing ensemble weather forecasts.用于后处理集合天气预报的深度学习

Philos Trans A Math Phys Eng Sci. 2021 Apr 5;379(2194):20200092. doi: 10.1098/rsta.2020.0092. Epub 2021 Feb 15.

Machine learning for weather and climate are worlds apart.用于天气和气候的机器学习有着天壤之别。

Philos Trans A Math Phys Eng Sci. 2021 Apr 5;379(2194):20200098. doi: 10.1098/rsta.2020.0098. Epub 2021 Feb 15.

Stochastic parametrizations and model uncertainty in the Lorenz '96 system.洛伦茨 '96 系统中的随机参数化和模型不确定性。

Philos Trans A Math Phys Eng Sci. 2013 Apr 15;371(1991):20110479. doi: 10.1098/rsta.2011.0479. Print 2013 May 28.

Data-based stochastic subgrid-scale parametrization: an approach using cluster-weighted modelling.基于数据的随机子网格尺度参数化：一种使用聚类加权建模的方法。

Philos Trans A Math Phys Eng Sci. 2012 Mar 13;370(1962):1061-86. doi: 10.1098/rsta.2011.0384.

Domain-driven models yield better predictions at lower cost than reservoir computers in Lorenz systems.相比于 Lorenz 系统中的储层计算机，域驱动模型以更低的成本产生更好的预测结果。

Philos Trans A Math Phys Eng Sci. 2021 Apr 5;379(2194):20200246. doi: 10.1098/rsta.2020.0246. Epub 2021 Feb 15.

Can deep learning beat numerical weather prediction?深度学习能打败数值天气预报吗？

Philos Trans A Math Phys Eng Sci. 2021 Apr 5;379(2194):20200097. doi: 10.1098/rsta.2020.0097. Epub 2021 Feb 15.

引用本文的文献

Learning PDE to Model Self-Organization of Matter.学习偏微分方程以对物质的自组织进行建模。

Entropy (Basel). 2022 Aug 9;24(8):1096. doi: 10.3390/e24081096.

本文引用的文献

Learning latent dynamics for partially observed chaotic systems.学习部分观测混沌系统的潜在动力学。

Chaos. 2020 Oct;30(10):103121. doi: 10.1063/5.0019309.

Satellite-based time-series of sea-surface temperature since 1981 for climate applications.1981 年以来用于气候应用的基于卫星的海面温度时间序列。

Sci Data. 2019 Oct 22;6(1):223. doi: 10.1038/s41597-019-0236-x.

Deep learning to represent subgrid processes in climate models.深度学习在气候模型中表示次网格过程。

Proc Natl Acad Sci U S A. 2018 Sep 25;115(39):9684-9689. doi: 10.1073/pnas.1810286115. Epub 2018 Sep 6.

Origin and scaling of chaos in weakly coupled phase oscillators.弱耦合相振荡器中的混沌起源和标度。

Phys Rev E. 2018 Jan;97(1-1):012203. doi: 10.1103/PhysRevE.97.012203.

Model-Free Prediction of Large Spatiotemporally Chaotic Systems from Data: A Reservoir Computing Approach.基于数据的大时空混沌系统无模型预测：一种回声状态网络方法。

Phys Rev Lett. 2018 Jan 12;120(2):024102. doi: 10.1103/PhysRevLett.120.024102.

Discovering governing equations from data by sparse identification of nonlinear dynamical systems.通过非线性动力系统的稀疏识别从数据中发现控制方程。

Proc Natl Acad Sci U S A. 2016 Apr 12;113(15):3932-7. doi: 10.1073/pnas.1517384113. Epub 2016 Mar 28.

Deep learning.深度学习。

Nature. 2015 May 28;521(7553):436-44. doi: 10.1038/nature14539.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验