图形处理器上的耦合簇理论 I. 耦合簇双激发方法。

Coupled Cluster Theory on Graphics Processing Units I. The Coupled Cluster Doubles Method.

作者信息

DePrince A Eugene, Hammond Jeff R

机构信息

Center for Nanoscale Materials and ‡Leadership Computing Facility, Argonne National Laboratory , 9700 South Cass Avenue, Argonne, Illinois 60439, United States.

出版信息

J Chem Theory Comput. 2011 May 10;7(5):1287-95. doi: 10.1021/ct100584w. Epub 2011 Apr 15.

DOI:10.1021/ct100584w

PMID:26610123

Abstract

The coupled cluster (CC) ansatz is generally recognized as providing one of the best wave function-based descriptions of electronic correlation in small- and medium-sized molecules. The fact that the CC equations with double excitations (CCD) may be expressed as a handful of dense matrix-matrix multiplications makes it an ideal method to be ported to graphics processing units (GPUs). We present our implementation of the spin-free CCD equations in which the entire iterative procedure is evaluated on the GPU. The GPU-accelerated algorithm readily achieves a factor of 4-5 speedup relative to the multithreaded CPU algorithm on same-generation hardware. The GPU-accelerated algorithm is approximately 8-12 times faster than Molpro, 17-22 times faster than NWChem, and 21-29 times faster than GAMESS for each CC iteration. Single-precision GPU-accelerated computations are also performed, leading to an additional doubling of performance. Single-precision errors in the energy are typically on the order of 10(-6) hartrees and can be improved by about an order of magnitude by performing one additional iteration in double precision.

摘要

耦合簇（CC）近似通常被认为是对中小分子电子相关性基于波函数的最佳描述之一。具有双激发的CC方程（CCD）可以表示为少量密集的矩阵 - 矩阵乘法，这使得它成为移植到图形处理单元（GPU）的理想方法。我们展示了无自旋CCD方程的实现，其中整个迭代过程在GPU上进行评估。相对于同一代硬件上的多线程CPU算法，GPU加速算法很容易实现4到5倍的加速。对于每次CC迭代，GPU加速算法比Molpro快约8 - 12倍，比NWChem快17 - 22倍，比GAMESS快21 - 29倍。还进行了单精度GPU加速计算，从而使性能额外提高一倍。能量的单精度误差通常在10^(-6)哈特里量级，通过再进行一次双精度迭代可以提高大约一个数量级。

相似文献

Coupled Cluster Theory on Graphics Processing Units I. The Coupled Cluster Doubles Method.

J Chem Theory Comput. 2011 May 10;7(5):1287-95. doi: 10.1021/ct100584w. Epub 2011 Apr 15.

Computing the Density Matrix in Electronic Structure Theory on Graphics Processing Units.

J Chem Theory Comput. 2012 Nov 13;8(11):4094-101. doi: 10.1021/ct300442w. Epub 2012 Oct 8.

Grid-based algorithm to search critical points, in the electron density, accelerated by graphics processing units.

J Comput Chem. 2014 Dec 5;35(31):2272-8. doi: 10.1002/jcc.23752.

Accelerating Coupled-Cluster Calculations with GPUs: An Implementation of the Density-Fitted CCSD(T) Approach for Heterogeneous Computing Architectures Using OpenMP Directives.

J Chem Theory Comput. 2023 Nov 14;19(21):7640-7657. doi: 10.1021/acs.jctc.3c00876. Epub 2023 Oct 25.

Quantum supercharger library: hyper-parallelism of the Hartree-Fock method.

J Comput Chem. 2015 Jul 5;36(18):1399-409. doi: 10.1002/jcc.23936. Epub 2015 May 14.

Stacked-Bloch-wave electron diffraction simulations using GPU acceleration.

Ultramicroscopy. 2014 Jun;141:32-7. doi: 10.1016/j.ultramic.2014.03.003. Epub 2014 Mar 17.

Acceleration of the GAMESS-UK electronic structure package on graphical processing units.

J Comput Chem. 2011 Jul 30;32(10):2313-8. doi: 10.1002/jcc.21815. Epub 2011 May 3.

Graphics processing unit accelerated computation of digital holograms.

Appl Opt. 2009 Dec 1;48(34):H137-43. doi: 10.1364/AO.48.00H137.

Mesh-particle interpolations on graphics processing units and multicore central processing units.

Philos Trans A Math Phys Eng Sci. 2011 Jun 13;369(1944):2164-75. doi: 10.1098/rsta.2011.0074.

Single-precision open-shell CCSD and CCSD(T) calculations on graphics processing units.

Phys Chem Chem Phys. 2020 Nov 21;22(43):25103-25111. doi: 10.1039/d0cp03800h. Epub 2020 Oct 29.

引用本文的文献

Frozen Natural Orbitals-Based Coupled-Cluster Singles, Doubles, and (full) Triples - A Computational Study.

Chem Asian J. 2025 Jul;20(14):e00472. doi: 10.1002/asia.202500472. Epub 2025 Jun 6.

Linear-Scaling Local Natural Orbital-Based Full Triples Treatment in Coupled-Cluster Theory.

J Chem Theory Comput. 2025 Mar 11;21(5):2386-2401. doi: 10.1021/acs.jctc.4c01716. Epub 2025 Feb 21.

Accelerating Pythonic Coupled-Cluster Implementations: A Comparison Between CPUs and GPUs.

J Chem Theory Comput. 2024 Feb 13;20(3):1130-1142. doi: 10.1021/acs.jctc.3c01110. Epub 2024 Feb 2.

Tensor Hypercontraction Form of the Perturbative Triples Energy in Coupled-Cluster Theory.

J Chem Theory Comput. 2023 Mar 14;19(5):1476-1486. doi: 10.1021/acs.jctc.2c00996. Epub 2023 Feb 17.

TeraChem: Accelerating electronic structure and ab initio molecular dynamics with graphical processing units.

J Chem Phys. 2020 Jun 14;152(22):224110. doi: 10.1063/5.0007615.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

图形处理器上的耦合簇理论 I. 耦合簇双激发方法。

Coupled Cluster Theory on Graphics Processing Units I. The Coupled Cluster Doubles Method.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献