NERSC - Lawrence Berkeley National Lab, Berkeley, CA, USA.
Terrafuse Inc., Berkeley, CA, USA.
Philos Trans A Math Phys Eng Sci. 2021 Apr 5;379(2194):20200093. doi: 10.1098/rsta.2020.0093. Epub 2021 Feb 15.
Machine learning (ML) provides novel and powerful ways of accurately and efficiently recognizing complex patterns, emulating nonlinear dynamics, and predicting the spatio-temporal evolution of weather and climate processes. Off-the-shelf ML models, however, do not necessarily obey the fundamental governing laws of physical systems, nor do they generalize well to scenarios on which they have not been trained. We survey systematic approaches to incorporating physics and domain knowledge into ML models and distill these approaches into broad categories. Through 10 case studies, we show how these approaches have been used successfully for emulating, downscaling, and forecasting weather and climate processes. The accomplishments of these studies include greater physical consistency, reduced training time, improved data efficiency, and better generalization. Finally, we synthesize the lessons learned and identify scientific, diagnostic, computational, and resource challenges for developing truly robust and reliable physics-informed ML models for weather and climate processes. This article is part of the theme issue 'Machine learning for weather and climate modelling'.
机器学习(ML)提供了新颖而强大的方法,可以准确有效地识别复杂模式、模拟非线性动态,并预测天气和气候过程的时空演变。然而,现成的 ML 模型不一定遵守物理系统的基本控制定律,也不一定很好地推广到它们没有经过训练的场景。我们调查了将物理和领域知识纳入 ML 模型的系统方法,并将这些方法归纳为广泛的类别。通过 10 个案例研究,我们展示了这些方法如何成功地用于模拟、降尺度和预测天气和气候过程。这些研究的成果包括更高的物理一致性、更短的训练时间、更高的数据效率和更好的泛化能力。最后,我们综合了经验教训,并确定了开发用于天气和气候过程的真正稳健和可靠的物理信息 ML 模型的科学、诊断、计算和资源挑战。本文是“天气和气候建模中的机器学习”主题特刊的一部分。