Steinbach Pit, Bannwarth Christoph
Institute for Physical Chemistry, RWTH Aachen University, Melatener Str. 20, 52074 Aachen, Germany.
Phys Chem Chem Phys. 2024 Jun 12;26(23):16567-16578. doi: 10.1039/d3cp06086a.
The computational efficiency of low-cost electronic structure methods can be further improved by leveraging heterogenous computing architectures. The software package TeraChem has been developed since 2008 to make use of graphical processing units (GPUs), particularly their strong single-precision performance, for the acceleration of quantum chemical calculations. Here, we present the implementation of three low-cost methods, namely HF-3c, PBEh-3c, and the recently introduced ωB97X-3c. We show that these can benefit in terms of performance when combined with "consumer grade" GPUs by leveraging the mixed precision integral handling in TeraChem. The current limitation of the latter's GPU integral library is that Gaussian integrals only for functions with angular momentum < 3 can be computed, which generally restricts the achievable accuracy in terms of the one-particle basis set. Particularly, the implementation of the ωB97X-3c method now enables higher accuracy with this setting which, in turn, provides the most efficient implementation accessible with consumer-grade hardware. We furthermore show that the implemented 3c methods can be combined with the hh-TDA formalism. This gives new and efficient low-cost multi-configurational excited states methods, which are benchmarked for the description of lowest vertical excitation energies in this work. All in all, the combination of these efficient electronic structure theory methods with affordable highly parallelized computing hardware provides an optimal computational and monetary cost to accuracy ratio.
通过利用异构计算架构,可以进一步提高低成本电子结构方法的计算效率。自2008年以来,已开发了软件包TeraChem,以利用图形处理单元(GPU),特别是其强大的单精度性能,来加速量子化学计算。在此,我们展示了三种低成本方法的实现,即HF-3c、PBEh-3c和最近推出的ωB97X-3c。我们表明,通过利用TeraChem中的混合精度积分处理,这些方法与“消费级”GPU结合使用时在性能方面会受益。后者的GPU积分库目前的局限性在于只能计算角动量<3的函数的高斯积分,这通常在单粒子基组方面限制了可达到的精度。特别是,ωB97X-3c方法的实现现在在这种设置下能够实现更高的精度,进而提供了消费级硬件可实现的最有效实现。我们还表明,所实现的3c方法可以与hh-TDA形式主义相结合。这产生了新的高效低成本多组态激发态方法,在本工作中对其描述最低垂直激发能进行了基准测试。总而言之,这些高效的电子结构理论方法与经济实惠的高度并行计算硬件的结合提供了最佳的计算成本与货币成本与精度之比。