• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

GPU_PBTE:一种用于在图形处理单元上求解三声子和四声子散射率的高效求解器。

GPU_PBTE: an efficient solver for three and four phonon scattering rates on graphics processing units.

作者信息

Zhang Bo, Fan Zheyong, Zhao C Y, Gu Xiaokun

机构信息

Institute of Engineering Thermophysics, School of Mechanical Engineering, Shanghai Jiao Tong University, Shanghai 200240, People's Republic of China.

School of Mathematics and Physics, Bohai University, Jinzhou, People's Republic of China.

出版信息

J Phys Condens Matter. 2021 Sep 30;33(49). doi: 10.1088/1361-648X/ac268d.

DOI:10.1088/1361-648X/ac268d
PMID:34521073
Abstract

Lattice thermal conductivity (LTC) is a key parameter for many technological applications. Based on the Peierls-Boltzmann transport equation (PBTE), many unique phonon transport properties of various materials were revealed. Accurate calculation of LTC with PBTE, however, is a time-consuming task, especially for compounds with a complex crystal structure or taking high-order phonon scattering into consideration. Graphical processing units (GPUs) have been extensively used to accelerate scientific simulations, making it possible to use a single desktop workstation for calculations that used to require supercomputers. Due to its fundamental differences from traditional processors, GPUs are especially suited for executing a large group of similar tasks with minimal communication, but require completely different algorithm design. In this paper, we provide a new algorithm optimized for GPUs, where a two-kernel method is used to avoid divergent branching. A new open-source code, GPU_PBTE, is developed based on the proposed algorithm. As demonstrations, we investigate the thermal transport properties of silicon and silicon carbide, and find that accurate and reliable LTC can be obtained by our software. GPU_PBTE performed on NVIDIA Tesla V100 can extensively improve double precision performance, making it two to three orders of magnitude faster than our CPU version performed on Intel Xeon CPU Gold 6248 @2.5 GHz. Our work also provides an idea of accelerating calculations with other novel hardware that may come out in the future.

摘要

晶格热导率(LTC)是许多技术应用中的关键参数。基于派尔斯 - 玻尔兹曼输运方程(PBTE),揭示了各种材料许多独特的声子输运特性。然而,用PBTE精确计算LTC是一项耗时的任务,特别是对于具有复杂晶体结构的化合物或考虑高阶声子散射的情况。图形处理单元(GPU)已被广泛用于加速科学模拟,使得使用单个桌面工作站就能进行过去需要超级计算机才能完成的计算。由于其与传统处理器存在根本差异,GPU特别适合执行大量通信最少的类似任务,但需要完全不同的算法设计。在本文中,我们提供了一种针对GPU优化的新算法,其中使用双内核方法来避免发散分支。基于所提出的算法开发了一个新的开源代码GPU_PBTE。作为示例,我们研究了硅和碳化硅的热输运特性,发现通过我们的软件可以获得准确可靠的LTC。在NVIDIA Tesla V100上运行的GPU_PBTE可以大幅提高双精度性能,使其比在英特尔至强CPU Gold 6248 @2.5 GHz上运行的CPU版本快两到三个数量级。我们的工作还为未来可能出现的其他新型硬件加速计算提供了思路。

相似文献

1
GPU_PBTE: an efficient solver for three and four phonon scattering rates on graphics processing units.GPU_PBTE:一种用于在图形处理单元上求解三声子和四声子散射率的高效求解器。
J Phys Condens Matter. 2021 Sep 30;33(49). doi: 10.1088/1361-648X/ac268d.
2
Fast on-site Monte Carlo tool for dose calculations in CT applications.快速现场蒙特卡罗工具,用于 CT 应用中的剂量计算。
Med Phys. 2012 Jun;39(6):2985-96. doi: 10.1118/1.4711748.
3
Parallel beamlet dose calculation via beamlet contexts in a distributed multi-GPU framework.基于分布式多 GPU 框架中的束流子区域进行平行束流子剂量计算。
Med Phys. 2019 Aug;46(8):3719-3733. doi: 10.1002/mp.13651. Epub 2019 Jun 30.
4
On the origin of increased phonon scattering in nanostructured PbTe based thermoelectric materials.在基于 PbTe 的纳米结构热电材料中声子散射增加的起源。
J Am Chem Soc. 2010 Jun 30;132(25):8669-75. doi: 10.1021/ja1010948.
5
Fast Analysis of Molecular Dynamics Trajectories with Graphics Processing Units-Radial Distribution Function Histogramming.利用图形处理器进行分子动力学轨迹的快速分析——径向分布函数直方图法
J Comput Phys. 2011 May 1;230(9):3556-3569. doi: 10.1016/j.jcp.2011.01.048.
6
Modified Anderson Method for Accelerating 3D-RISM Calculations Using Graphics Processing Unit.使用图形处理单元加速三维反应扩散隐式溶剂模型(3D-RISM)计算的改进安德森方法。
J Chem Theory Comput. 2012 Sep 11;8(9):3015-21. doi: 10.1021/ct300355r. Epub 2012 Aug 7.
7
Novel insights into lattice thermal transport in nanocrystalline MgSb from first principles: the crucial role of higher-order phonon scattering.基于第一性原理对纳米晶MgSb中晶格热输运的新见解:高阶声子散射的关键作用
Phys Chem Chem Phys. 2022 Sep 14;24(35):20891-20900. doi: 10.1039/d2cp01967a.
8
Next-generation acceleration and code optimization for light transport in turbid media using GPUs.利用图形处理器(GPU)实现浑浊介质中光传输的下一代加速与代码优化。
Biomed Opt Express. 2010 Sep 1;1(2):658-75. doi: 10.1364/BOE.1.000658. Epub 2010 Aug 23.
9
Efficient methods for implementation of multi-level nonrigid mass-preserving image registration on GPUs and multi-threaded CPUs.在图形处理器(GPU)和多线程中央处理器(CPU)上实现多级非刚性质量守恒图像配准的高效方法。
Comput Methods Programs Biomed. 2016 Apr;127:290-300. doi: 10.1016/j.cmpb.2015.12.018. Epub 2016 Jan 6.
10
Accelerating epistasis analysis in human genetics with consumer graphics hardware.利用消费级图形硬件加速人类遗传学中的上位性分析。
BMC Res Notes. 2009 Jul 24;2:149. doi: 10.1186/1756-0500-2-149.