• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

全球地球物理流中线性求解器的混合精度

Mixed-Precision for Linear Solvers in Global Geophysical Flows.

作者信息

Ackmann Jan, Dueben Peter D, Palmer Tim, Smolarkiewicz Piotr K

机构信息

University of Oxford Oxford UK.

European Centre for Medium Range Weather Forecasts Reading UK.

出版信息

J Adv Model Earth Syst. 2022 Sep;14(9):e2022MS003148. doi: 10.1029/2022MS003148. Epub 2022 Sep 10.

DOI:10.1029/2022MS003148
PMID:36248012
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9541378/
Abstract

Semi-implicit (SI) time-stepping schemes for atmosphere and ocean models require elliptic solvers that work efficiently on modern supercomputers. This paper reports our study of the potential computational savings when using mixed precision arithmetic in the elliptic solvers. Precision levels as low as half (16 bits) are used and a detailed evaluation of the impact of reduced precision on the solver convergence and the solution quality is performed. This study is conducted in the context of a novel SI shallow-water model on the sphere, purposely designed to mimic numerical intricacies of modern all-scale weather and climate (W&C) models. The governing algorithm of the shallow-water model is based on the non-oscillatory MPDATA methods for geophysical flows, whereas the resulting elliptic problem employs a strongly preconditioned non-symmetric Krylov-subspace Generalized Conjugated-Residual (GCR) solver, proven in advanced atmospheric applications. The classical longitude/latitude grid is deliberately chosen to retain the stiffness of global W&C models. The analysis of the precision reduction is done on a software level, using an emulator, whereas the performance is measured on actual reduced precision hardware. The reduced-precision experiments are conducted for established dynamical-core test-cases, like the Rossby-Haurwitz wavenumber 4 and a zonal orographic flow. The study shows that selected key components of the elliptic solver, most prominently the preconditioning and the application of the linear operator, can be performed at the level of half precision. For these components, the use of half precision is found to yield a speed-up of a factor 4 compared to double precision for a wide range of problem sizes.

摘要

用于大气和海洋模型的半隐式(SI)时间步长方案需要在现代超级计算机上高效运行的椭圆型求解器。本文报告了我们在椭圆型求解器中使用混合精度算法时对潜在计算节省的研究。使用了低至半精度(16位)的精度级别,并对精度降低对求解器收敛和求解质量的影响进行了详细评估。这项研究是在一个新颖的球面上的SI浅水模型的背景下进行的,该模型特意设计用于模拟现代全尺度天气和气候(W&C)模型的数值复杂性。浅水模型的控制算法基于用于地球物理流动的非振荡MPDATA方法,而由此产生的椭圆问题采用了一种强预处理的非对称Krylov子空间广义共轭残差(GCR)求解器,该求解器在先进的大气应用中已得到验证。特意选择经典的经纬度网格以保留全球W&C模型的刚性。在软件层面使用模拟器对精度降低进行分析,而在实际的降低精度硬件上测量性能。针对已建立的动力学核心测试用例进行了降低精度的实验,如罗斯比 - 豪维茨波数4和纬向地形气流。研究表明,椭圆型求解器的选定关键组件,最显著的是预处理和线性算子的应用,可以在半精度级别上执行。对于这些组件,发现使用半精度与双精度相比,在广泛的问题规模范围内可实现4倍的加速。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fe90/9541378/5439ad32d29e/JAME-14-e2022MS003148-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fe90/9541378/bbbb5e6c358d/JAME-14-e2022MS003148-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fe90/9541378/204b1119df66/JAME-14-e2022MS003148-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fe90/9541378/501815d739c2/JAME-14-e2022MS003148-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fe90/9541378/0a2116125861/JAME-14-e2022MS003148-g014.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fe90/9541378/b80c35654112/JAME-14-e2022MS003148-g011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fe90/9541378/12fd6e4418ab/JAME-14-e2022MS003148-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fe90/9541378/df482c6d90bd/JAME-14-e2022MS003148-g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fe90/9541378/ed548f8f81d3/JAME-14-e2022MS003148-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fe90/9541378/958d145a5f59/JAME-14-e2022MS003148-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fe90/9541378/93e7a5992767/JAME-14-e2022MS003148-g012.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fe90/9541378/0379df4c6e32/JAME-14-e2022MS003148-g013.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fe90/9541378/29aae01cb8b5/JAME-14-e2022MS003148-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fe90/9541378/27b33a4d496b/JAME-14-e2022MS003148-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fe90/9541378/5439ad32d29e/JAME-14-e2022MS003148-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fe90/9541378/bbbb5e6c358d/JAME-14-e2022MS003148-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fe90/9541378/204b1119df66/JAME-14-e2022MS003148-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fe90/9541378/501815d739c2/JAME-14-e2022MS003148-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fe90/9541378/0a2116125861/JAME-14-e2022MS003148-g014.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fe90/9541378/b80c35654112/JAME-14-e2022MS003148-g011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fe90/9541378/12fd6e4418ab/JAME-14-e2022MS003148-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fe90/9541378/df482c6d90bd/JAME-14-e2022MS003148-g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fe90/9541378/ed548f8f81d3/JAME-14-e2022MS003148-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fe90/9541378/958d145a5f59/JAME-14-e2022MS003148-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fe90/9541378/93e7a5992767/JAME-14-e2022MS003148-g012.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fe90/9541378/0379df4c6e32/JAME-14-e2022MS003148-g013.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fe90/9541378/29aae01cb8b5/JAME-14-e2022MS003148-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fe90/9541378/27b33a4d496b/JAME-14-e2022MS003148-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fe90/9541378/5439ad32d29e/JAME-14-e2022MS003148-g003.jpg

相似文献

1
Mixed-Precision for Linear Solvers in Global Geophysical Flows.全球地球物理流中线性求解器的混合精度
J Adv Model Earth Syst. 2022 Sep;14(9):e2022MS003148. doi: 10.1029/2022MS003148. Epub 2022 Sep 10.
2
Newton-Raphson preconditioner for Krylov type solvers on GPU devices.适用于GPU设备上Krylov型求解器的牛顿-拉弗森预条件器。
Springerplus. 2016 Jun 21;5(1):788. doi: 10.1186/s40064-016-2346-7. eCollection 2016.
3
A divergence-free semi-implicit finite volume scheme for ideal, viscous, and resistive magnetohydrodynamics.一种用于理想、粘性和电阻磁流体动力学的无散半隐式有限体积格式。
Int J Numer Methods Fluids. 2019 Jan 10;89(1-2):16-42. doi: 10.1002/fld.4681. Epub 2018 Sep 25.
4
Mixed-precision iterative refinement using tensor cores on GPUs to accelerate solution of linear systems.在图形处理器(GPU)上使用张量核进行混合精度迭代细化以加速线性系统求解。
Proc Math Phys Eng Sci. 2020 Nov;476(2243):20200110. doi: 10.1098/rspa.2020.0110. Epub 2020 Nov 25.
5
An aneurysm-specific preconditioning technique for the acceleration of Newton-Krylov method with application in the simulation of blood flows.一种用于加速牛顿-克里洛夫方法的动脉瘤特异性预处理技术及其在血流模拟中的应用。
Int J Numer Method Biomed Eng. 2023 Dec;39(12):e3771. doi: 10.1002/cnm.3771. Epub 2023 Sep 9.
6
Block-structured adaptive meshes and reduced grids for atmospheric general circulation models.块结构自适应网格和大气环流模式的粗网格。
Philos Trans A Math Phys Eng Sci. 2009 Nov 28;367(1907):4497-522. doi: 10.1098/rsta.2009.0150.
7
Performance impact of precision reduction in sparse linear systems solvers.稀疏线性系统求解器中精度降低的性能影响。
PeerJ Comput Sci. 2022 Jan 17;8:e778. doi: 10.7717/peerj-cs.778. eCollection 2022.
8
ARCTIC CHANGE AND POSSIBLE INFLUENCE ON MID-LATITUDE CLIMATE AND WEATHER: A US CLIVAR White Paper.北极变化及其对中纬度气候和天气的可能影响:一份美国气候变率和可预报性研究计划(CLIVAR)白皮书
US CLIVAR Rep. 2018 Mar;n/a. doi: 10.5065/D6TH8KGW.
9
Performance Analysis of the χMD Matrix Solver Package for MODFLOW-USG.χMD 矩阵求解器包在 MODFLOW-USG 中的性能分析。
Ground Water. 2021 Nov;59(6):905-912. doi: 10.1111/gwat.13110. Epub 2021 May 17.
10
Fixed-point iterative linear inverse solver with extended precision.具有扩展精度的定点迭代线性反演求解器。
Sci Rep. 2023 Mar 30;13(1):5198. doi: 10.1038/s41598-023-32338-5.

引用本文的文献

1
Compressing atmospheric data into its real information content.将大气数据压缩至其真实信息含量。
Nat Comput Sci. 2021 Nov;1(11):713-724. doi: 10.1038/s43588-021-00156-2. Epub 2021 Nov 25.

本文引用的文献

1
Fluid Simulations Accelerated With 16 Bits: Approaching 4x Speedup on A64FX by Squeezing ShallowWaters.jl Into Float16.基于16位加速的流体模拟:通过将ShallowWaters.jl压缩为Float16在A64FX上实现近4倍加速
J Adv Model Earth Syst. 2022 Feb;14(2):e2021MS002684. doi: 10.1029/2021MS002684. Epub 2022 Feb 11.