• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于强缩放从头算方法的混合CPU/GPU集成引擎。

Hybrid CPU/GPU Integral Engine for Strong-Scaling Ab Initio Methods.

作者信息

Kussmann Jörg, Ochsenfeld Christian

机构信息

Department of Chemistry, University of Munich (LMU) , Butenandtstrasse 7, D-81377 München, Germany.

Center for Integrated Protein Science (CIPSM) at the Department of Chemistry, University of Munich (LMU) , Butenandtstrasse 5-13, D-81377 München, Germany.

出版信息

J Chem Theory Comput. 2017 Jul 11;13(7):3153-3159. doi: 10.1021/acs.jctc.6b01166. Epub 2017 Jun 21.

DOI:10.1021/acs.jctc.6b01166
PMID:28636392
Abstract

We present a parallel integral algorithm for two-electron contributions occurring in Hartree-Fock and hybrid density functional theory that allows for a strong scaling parallelization on inhomogeneous compute clusters. With a particular focus on graphic processing units, we show that our approach allows an efficient use of CPUs and graphics processing units (GPUs) simultaneously, although the different architectures demand conflictive strategies in order to ensure efficient program execution. Furthermore, we present a general strategy to use large basis sets like quadruple-ζ split valence on GPUs and investigate the balance between CPUs and GPUs depending on l-quantum numbers of the corresponding basis functions. Finally, we present first illustrative calculations using a hybrid CPU/GPU environment and demonstrate the strong-scaling performance of our parallelization strategy also for pure CPU-based calculations.

摘要

我们提出了一种用于Hartree-Fock和混合密度泛函理论中两电子贡献的并行积分算法,该算法允许在非均匀计算集群上进行强缩放并行化。特别关注图形处理单元,我们表明我们的方法允许同时高效使用中央处理器(CPU)和图形处理单元(GPU),尽管不同的架构需要相互冲突的策略以确保程序高效执行。此外,我们提出了一种在GPU上使用诸如四重ζ分裂价基组等大基组的通用策略,并根据相应基函数的l量子数研究CPU和GPU之间的平衡。最后,我们展示了使用混合CPU/GPU环境的首次说明性计算,并证明了我们的并行化策略对于基于纯CPU的计算也具有强缩放性能。

相似文献

1
Hybrid CPU/GPU Integral Engine for Strong-Scaling Ab Initio Methods.用于强缩放从头算方法的混合CPU/GPU集成引擎。
J Chem Theory Comput. 2017 Jul 11;13(7):3153-3159. doi: 10.1021/acs.jctc.6b01166. Epub 2017 Jun 21.
2
A hybrid CPU/GPU method for Hartree-Fock self-consistent-field calculation.一种用于哈特里-福克自洽场计算的混合CPU/ GPU方法。
J Chem Phys. 2023 Sep 14;159(10). doi: 10.1063/5.0156934.
3
Employing OpenCL to Accelerate Ab Initio Calculations on Graphics Processing Units.利用 OpenCL 加速图形处理单元上的从头算计算。
J Chem Theory Comput. 2017 Jun 13;13(6):2712-2716. doi: 10.1021/acs.jctc.7b00515. Epub 2017 May 31.
4
Highly Efficient Resolution-of-Identity Density Functional Theory Calculations on Central and Graphics Processing Units.基于中央处理器和图形处理器的高效密度泛函理论中的单位分解计算
J Chem Theory Comput. 2021 Mar 9;17(3):1512-1521. doi: 10.1021/acs.jctc.0c01252. Epub 2021 Feb 22.
5
Accelerating Coupled-Cluster Calculations with GPUs: An Implementation of the Density-Fitted CCSD(T) Approach for Heterogeneous Computing Architectures Using OpenMP Directives.利用GPU加速耦合簇计算:一种使用OpenMP指令在异构计算架构上实现密度拟合CCSD(T)方法的方案
J Chem Theory Comput. 2023 Nov 14;19(21):7640-7657. doi: 10.1021/acs.jctc.3c00876. Epub 2023 Oct 25.
6
Communication: A reduced scaling J-engine based reformulation of SOS-MP2 using graphics processing units.通讯:一种基于减少缩放的J引擎对使用图形处理单元的SOS-MP2的重新表述。
J Chem Phys. 2014 Aug 7;141(5):051106. doi: 10.1063/1.4891797.
7
Faster Self-Consistent Field (SCF) Calculations on GPU Clusters.在GPU集群上更快的自洽场(SCF)计算
J Chem Theory Comput. 2021 Dec 14;17(12):7486-7503. doi: 10.1021/acs.jctc.1c00720. Epub 2021 Nov 15.
8
GPU/CPU Algorithm for Generalized Born/Solvent-Accessible Surface Area Implicit Solvent Calculations.用于广义玻恩/溶剂可及表面积隐式溶剂计算的GPU/CPU算法
J Chem Theory Comput. 2012 Jul 10;8(7):2521-2530. doi: 10.1021/ct3003089. Epub 2012 Jun 15.
9
Screening methods for linear-scaling short-range hybrid calculations on CPU and GPU architectures.在 CPU 和 GPU 架构上进行线性标度短程杂化计算的筛选方法。
J Chem Phys. 2017 Apr 14;146(14):144108. doi: 10.1063/1.4978476.
10
Double-buffered, heterogeneous CPU + GPU integral digestion algorithm for single-excitation calculations involving a large number of excited states.用于涉及大量激发态的单激发计算的双缓冲异构CPU+GPU积分消化算法
J Comput Chem. 2018 Oct 5;39(26):2173-2182. doi: 10.1002/jcc.25531. Epub 2018 Oct 3.

引用本文的文献

1
Efficient Low-Scaling Calculation of THC-SOS-LR-CC2 and THC-SOS-ADC(2) Excitation Energies Through Density-Based Integral-Direct Tensor Hypercontraction.通过基于密度的积分直接张量超收缩高效低尺度计算THC-SOS-LR-CC2和THC-SOS-ADC(2)激发能
J Chem Theory Comput. 2025 May 27;21(10):5083-5102. doi: 10.1021/acs.jctc.5c00230. Epub 2025 May 12.
2
Acceleration of the Relativistic Dirac-Kohn-Sham Method with GPU: A Pre-Exascale Implementation of BERTHA and PyBERTHA.利用GPU加速相对论性狄拉克-科恩-沙姆方法:BERTHA和PyBERTHA的百亿亿次级前实现
J Chem Theory Comput. 2025 Apr 8;21(7):3460-3475. doi: 10.1021/acs.jctc.4c01759. Epub 2025 Mar 21.
3
VeloxChem: GPU-Accelerated Fock Matrix Construction Enabling Complex Polarization Propagator Simulations of Circular Dichroism Spectra of G-Quadruplexes.
VeloxChem:基于GPU加速的福克矩阵构建,实现G-四链体圆二色光谱的复杂极化传播子模拟
J Phys Chem A. 2025 Jan 16;129(2):633-642. doi: 10.1021/acs.jpca.4c07510. Epub 2024 Dec 31.
4
Mechanism of proton release during water oxidation in Photosystem II.光系统II中水氧化过程中质子释放的机制。
Proc Natl Acad Sci U S A. 2024 Dec 24;121(52):e2413396121. doi: 10.1073/pnas.2413396121. Epub 2024 Dec 19.
5
A Constraint-Based Orbital-Optimized Excited State Method (COOX).基于约束的轨道优化激发态方法(COOX)
J Chem Theory Comput. 2024 Oct 8;20(19):8461-8473. doi: 10.1021/acs.jctc.4c00467. Epub 2024 Sep 30.
6
Low-Scaling, Efficient and Memory Optimized Computation of Nuclear Magnetic Resonance Shieldings within the Random Phase Approximation Using Cholesky-Decomposed Densities and an Attenuated Coulomb Metric.利用Cholesky分解密度和衰减库仑度量在随机相位近似内进行低尺度、高效且内存优化的核磁共振屏蔽计算。
J Phys Chem A. 2024 Sep 19;128(37):7950-7965. doi: 10.1021/acs.jpca.4c02773. Epub 2024 Sep 6.
7
Efficient Exploitation of Numerical Quadrature with Distance-Dependent Integral Screening in Explicitly Correlated F12 Theory: Linear Scaling Evaluation of the Most Expensive RI-MP2-F12 Term.显式相关F12理论中基于距离相关积分筛选的数值积分的高效利用:最昂贵的RI-MP2-F12项的线性标度评估
J Chem Theory Comput. 2024 May 14;20(9):3706-3718. doi: 10.1021/acs.jctc.4c00193. Epub 2024 Apr 16.
8
Systematic QM/MM Study for Predicting P NMR Chemical Shifts of Adenosine Nucleotides in Solution and Stages of ATP Hydrolysis in a Protein Environment.系统的QM/MM 研究预测核苷酸碱基在溶液中的 P NMR 化学位移和在蛋白质环境中 ATP 水解的各个阶段。
J Chem Theory Comput. 2024 Mar 26;20(6):2433-2444. doi: 10.1021/acs.jctc.3c01280. Epub 2024 Mar 18.
9
Exploring Chemical Space Using Hyperreactor Dynamics.利用超反应器动力学探索化学空间。
ACS Cent Sci. 2024 Jan 31;10(2):302-314. doi: 10.1021/acscentsci.3c01403. eCollection 2024 Feb 28.
10
Improved Sampling of Adaptive Path Collective Variables by Stabilized Extended-System Dynamics.通过稳定扩展系统动力学改进自适应路径集体变量的采样
J Chem Theory Comput. 2023 Dec 26;19(24):9202-9210. doi: 10.1021/acs.jctc.3c00938. Epub 2023 Dec 11.