• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

Accuracy and performance of the lattice Boltzmann method with 64-bit, 32-bit, and customized 16-bit number formats.

作者信息

Lehmann Moritz, Krause Mathias J, Amati Giorgio, Sega Marcello, Harting Jens, Gekle Stephan

机构信息

Biofluid Simulation and Modeling-Theoretische Physik VI, University of Bayreuth, Bayreuth, Germany.

Institute of Mechanical Process Engineering and Mechanics, Karlsruhe Institute of Technology, Karlsruhe, Germany.

出版信息

Phys Rev E. 2022 Jul;106(1-2):015308. doi: 10.1103/PhysRevE.106.015308.

DOI:10.1103/PhysRevE.106.015308
PMID:35974647
Abstract

Fluid dynamics simulations with the lattice Boltzmann method (LBM) are very memory intensive. Alongside reduction in memory footprint, significant performance benefits can be achieved by using FP32 (single) precision compared to FP64 (double) precision, especially on GPUs. Here we evaluate the possibility to use even FP16 and posit16 (half) precision for storing fluid populations, while still carrying arithmetic operations in FP32. For this, we first show that the commonly occurring number range in the LBM is a lot smaller than the FP16 number range. Based on this observation, we develop customized 16-bit formats-based on a modified IEEE-754 and on a modified posit standard-that are specifically tailored to the needs of the LBM. We then carry out an in-depth characterization of LBM accuracy for six different test systems with increasing complexity: Poiseuille flow, Taylor-Green vortices, Karman vortex streets, lid-driven cavity, a microcapsule in shear flow (utilizing the immersed-boundary method), and, finally, the impact of a raindrop (based on a volume-of-fluid approach). We find that the difference in accuracy between FP64 and FP32 is negligible in almost all cases, and that for a large number of cases even 16-bit is sufficient. Finally, we provide a detailed performance analysis of all precision levels on a large number of hardware microarchitectures and show that significant speedup is achieved with mixed FP32/16-bit.

摘要

相似文献

1
Accuracy and performance of the lattice Boltzmann method with 64-bit, 32-bit, and customized 16-bit number formats.
Phys Rev E. 2022 Jul;106(1-2):015308. doi: 10.1103/PhysRevE.106.015308.
2
Mixed-precision iterative refinement using tensor cores on GPUs to accelerate solution of linear systems.在图形处理器(GPU)上使用张量核进行混合精度迭代细化以加速线性系统求解。
Proc Math Phys Eng Sci. 2020 Nov;476(2243):20200110. doi: 10.1098/rspa.2020.0110. Epub 2020 Nov 25.
3
Accelerating seminumerical Fock-exchange calculations using mixed single- and double-precision arithmethic.使用单双精度混合算术加速半数值福克交换计算。
J Chem Phys. 2021 Jun 7;154(21):214116. doi: 10.1063/5.0045084.
4
Number Formats, Error Mitigation, and Scope for 16-Bit Arithmetics in Weather and Climate Modeling Analyzed With a Shallow Water Model.利用浅水模型分析天气和气候建模中16位算术的数字格式、误差缓解及范围
J Adv Model Earth Syst. 2020 Oct;12(10):e2020MS002246. doi: 10.1029/2020MS002246. Epub 2020 Oct 14.
5
Onsager-regularized lattice Boltzmann method: A nonequilibrium thermodynamics-based regularized lattice Boltzmann method.昂萨格正则化格子玻尔兹曼方法:一种基于非平衡热力学的正则化格子玻尔兹曼方法。
Phys Rev E. 2021 Jul;104(1-2):015313. doi: 10.1103/PhysRevE.104.015313.
6
Hybrid Precision Floating-Point (HPFP) Selection to Optimize Hardware-Constrained Accelerator for CNN Training.用于优化受硬件约束的CNN训练加速器的混合精度浮点(HPFP)选择
Sensors (Basel). 2024 Mar 27;24(7):2145. doi: 10.3390/s24072145.
7
Fast Approximations of Activation Functions in Deep Neural Networks when using Posit Arithmetic.当使用正算数时,深度神经网络中激活函数的快速逼近。
Sensors (Basel). 2020 Mar 10;20(5):1515. doi: 10.3390/s20051515.
8
Taylor-series expansion and least-squares-based lattice Boltzmann method: Two-dimensional formulation and its applications.基于泰勒级数展开和最小二乘法的格子玻尔兹曼方法:二维公式及其应用
Phys Rev E Stat Nonlin Soft Matter Phys. 2002 Mar;65(3 Pt 2B):036708. doi: 10.1103/PhysRevE.65.036708. Epub 2002 Mar 6.
9
Chebyshev collocation spectral lattice Boltzmann method for simulation of low-speed flows.用于低速流动模拟的切比雪夫配置谱格子玻尔兹曼方法
Phys Rev E Stat Nonlin Soft Matter Phys. 2015 Jan;91(1):013301. doi: 10.1103/PhysRevE.91.013301. Epub 2015 Jan 7.
10
Accuracy of the lattice Boltzmann method for describing the behavior of a gas in the continuum limit.
Phys Rev E Stat Nonlin Soft Matter Phys. 2010 Nov;82(5 Pt 2):056709. doi: 10.1103/PhysRevE.82.056709. Epub 2010 Nov 15.