重新审视CPU/GPU系统上的分子动力学：水核与SHAKE并行化

Revisiting Molecular Dynamics on a CPU/GPU system: Water Kernel and SHAKE Parallelization.

作者信息

Ruymgaart A Peter, Elber Ron

机构信息

Department of Chemistry and Biochemistry, Institute for Computational Engineering and Sciences, University of Texas at Austin, Austin, TX 78712.

出版信息

J Chem Theory Comput. 2012 Nov 13;8(11):4624-4636. doi: 10.1021/ct300324k. Epub 2012 Aug 21.

DOI:10.1021/ct300324k

PMID:23264758

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3524996/

Abstract

We report Graphics Processing Unit (GPU) and Open-MP parallel implementations of water-specific force calculations and of bond constraints for use in Molecular Dynamics simulations. We focus on a typical laboratory computing-environment in which a CPU with a few cores is attached to a GPU. We discuss in detail the design of the code and we illustrate performance comparable to highly optimized codes such as GROMACS. Beside speed our code shows excellent energy conservation. Utilization of water-specific lists allows the efficient calculations of non-bonded interactions that include water molecules and results in a speed-up factor of more than 40 on the GPU compared to code optimized on a single CPU core for systems larger than 20,000 atoms. This is up four-fold from a factor of 10 reported in our initial GPU implementation that did not include a water-specific code. Another optimization is the implementation of constrained dynamics entirely on the GPU. The routine, which enforces constraints of all bonds, runs in parallel on multiple Open-MP cores or entirely on the GPU. It is based on Conjugate Gradient solution of the Lagrange multipliers (CG SHAKE). The GPU implementation is partially in double precision and requires no communication with the CPU during the execution of the SHAKE algorithm. The (parallel) implementation of SHAKE allows an increase of the time step to 2.0fs while maintaining excellent energy conservation. Interestingly, CG SHAKE is faster than the usual bond relaxation algorithm even on a single core if high accuracy is expected. The significant speedup of the optimized components transfers the computational bottleneck of the MD calculation to the reciprocal part of Particle Mesh Ewald (PME).

摘要

我们报告了用于分子动力学模拟的水特异性力计算和键约束的图形处理单元（GPU）及Open-MP并行实现。我们聚焦于一种典型的实验室计算环境，即具有几个核心的CPU连接到一个GPU。我们详细讨论了代码设计，并展示了与诸如GROMACS等高度优化的代码相当的性能。除了速度之外，我们的代码还具有出色的能量守恒特性。利用水特异性列表可以高效计算包含水分子的非键相互作用，对于大于20,000个原子的系统，与在单个CPU核心上优化的代码相比，在GPU上的加速因子超过40。这比我们最初未包含水特异性代码的GPU实现中报告的10倍加速因子提高了四倍。另一个优化是在GPU上完全实现约束动力学。该例程用于强制执行所有键的约束，可在多个Open-MP核心上并行运行，也可完全在GPU上运行。它基于拉格朗日乘子的共轭梯度解（CG SHAKE）。GPU实现部分采用双精度，并且在执行SHAKE算法期间无需与CPU通信。SHAKE的（并行）实现允许将时间步长增加到2.0飞秒，同时保持出色的能量守恒。有趣的是，如果期望高精度，即使在单核上，CG SHAKE也比通常的键松弛算法更快。优化组件的显著加速将分子动力学计算的计算瓶颈转移到了粒子网格埃瓦尔德（PME）的倒数部分。

相似文献

Revisiting Molecular Dynamics on a CPU/GPU system: Water Kernel and SHAKE Parallelization.重新审视CPU/GPU系统上的分子动力学：水核与SHAKE并行化

J Chem Theory Comput. 2012 Nov 13;8(11):4624-4636. doi: 10.1021/ct300324k. Epub 2012 Aug 21.

MOIL-opt: Energy-Conserving Molecular Dynamics on a GPU/CPU system.MOIL-opt：GPU/CPU系统上的节能分子动力学

J Chem Theory Comput. 2011 Aug 26;7(10):3072-3082. doi: 10.1021/ct200360f.

A nonvoxel-based dose convolution/superposition algorithm optimized for scalable GPU architectures.一种针对可扩展GPU架构进行优化的基于非体素的剂量卷积/叠加算法。

Med Phys. 2014 Oct;41(10):101711. doi: 10.1118/1.4895822.

Parallel Implementation of Density Functional Theory Methods in the Quantum Interaction Computational Kernel Program.量子相互作用计算内核程序中密度泛函理论方法的并行实现。

J Chem Theory Comput. 2020 Jul 14;16(7):4315-4326. doi: 10.1021/acs.jctc.0c00290. Epub 2020 Jun 24.

A GPU-Accelerated Fast Multipole Method for GROMACS: Performance and Accuracy.GPU 加速的 GROMACS 快速多极方法：性能与精度。

J Chem Theory Comput. 2020 Nov 10;16(11):6938-6949. doi: 10.1021/acs.jctc.0c00744. Epub 2020 Oct 21.

An Implementation of the Smooth Particle Mesh Ewald Method on GPU Hardware.光滑粒子网格埃瓦尔德方法在GPU硬件上的实现

J Chem Theory Comput. 2009 Sep 8;5(9):2371-7. doi: 10.1021/ct900275y.

High performance computing for deformable image registration: towards a new paradigm in adaptive radiotherapy.用于可变形图像配准的高性能计算：迈向自适应放射治疗的新范式。

Med Phys. 2008 Aug;35(8):3546-53. doi: 10.1118/1.2948318.

Graphics Processing Unit Acceleration and Parallelization of GENESIS for Large-Scale Molecular Dynamics Simulations.用于大规模分子动力学模拟的GENESIS的图形处理单元加速与并行化

J Chem Theory Comput. 2016 Oct 11;12(10):4947-4958. doi: 10.1021/acs.jctc.6b00241. Epub 2016 Sep 27.

CPU-GPU hybrid accelerating the Zuker algorithm for RNA secondary structure prediction applications.CPU-GPU 混合加速 Zuker 算法在 RNA 二级结构预测中的应用。

BMC Genomics. 2012;13 Suppl 1(Suppl 1):S14. doi: 10.1186/1471-2164-13-S1-S14. Epub 2012 Jan 17.

Open-Source Multi-GPU-Accelerated QM/MM Simulations with AMBER and QUICK.使用 AMBER 和 QUICK 进行开源的多 GPU 加速的 QM/MM 模拟。

J Chem Inf Model. 2021 May 24;61(5):2109-2115. doi: 10.1021/acs.jcim.1c00169. Epub 2021 Apr 29.

引用本文的文献

The EMC acts as a chaperone for membrane proteins.内质网分子伴侣（EMC）作为膜蛋白的伴侣蛋白发挥作用。

Nat Commun. 2025 Aug 2;16(1):7097. doi: 10.1038/s41467-025-62109-x.

A small-molecule Skp1 inhibitor elicits cell death by p53-dependent mechanism.一种小分子Skp1抑制剂通过p53依赖性机制引发细胞死亡。

iScience. 2022 Jun 14;25(7):104591. doi: 10.1016/j.isci.2022.104591. eCollection 2022 Jul 15.

Accelerated Molecular Mechanical and Solvation Energetics on Multicore CPUs and Manycore GPUs.多核CPU和众核GPU上的加速分子力学与溶剂化能量学

ACM BCB. 2015 Sep;2015:222-231. doi: 10.1145/2808719.2808742.

Partition of Positively and Negatively Charged Tryptophan Ions in Membranes with Inverted Phospholipid Heads: Simulations and Experiments.带反式磷脂头的膜中带正电荷和带负电荷色氨酸离子的分区：模拟与实验。

J Phys Chem B. 2019 Apr 18;123(15):3272-3281. doi: 10.1021/acs.jpcb.9b00754. Epub 2019 Apr 9.

Perspective: Computer simulations of long time dynamics.视角：长时间动力学的计算机模拟

J Chem Phys. 2016 Feb 14;144(6):060901. doi: 10.1063/1.4940794.

Extracting intrinsic dynamic parameters of biomolecular folding from single-molecule force spectroscopy experiments.从单分子力谱实验中提取生物分子折叠的内在动力学参数。

Protein Sci. 2016 Jan;25(1):123-34. doi: 10.1002/pro.2727. Epub 2015 Jul 14.

Molecular dynamics studies of modular polyketide synthase ketoreductase stereospecificity.模块化聚酮合酶酮还原酶立体特异性的分子动力学研究

Biochemistry. 2015 Apr 14;54(14):2346-59. doi: 10.1021/bi501401g. Epub 2015 Apr 2.

Automated Optimization of Potential Parameters.潜在参数的自动优化

J Chem Theory Comput. 2013 Aug 13;9(8):3311-3320. doi: 10.1021/ct400313n.

本文引用的文献

GROMACS 4: Algorithms for Highly Efficient, Load-Balanced, and Scalable Molecular Simulation.GROMACS 4：高效、负载均衡和可扩展的分子模拟算法。

J Chem Theory Comput. 2008 Mar;4(3):435-47. doi: 10.1021/ct700301q.

P-LINCS: A Parallel Linear Constraint Solver for Molecular Simulation.P-LINCS：一种用于分子模拟的并行线性约束求解器。

J Chem Theory Comput. 2008 Jan;4(1):116-22. doi: 10.1021/ct700200b.

ACEMD: Accelerating Biomolecular Dynamics in the Microsecond Time Scale.ACEMD：在微秒时间尺度上加速生物分子动力学

J Chem Theory Comput. 2009 Jun 9;5(6):1632-9. doi: 10.1021/ct9000685. Epub 2009 May 21.

How conformational dynamics of DNA polymerase select correct substrates: experiments and simulations.DNA 聚合酶如何通过构象动力学选择正确的底物：实验与模拟。

Structure. 2012 Apr 4;20(4):618-27. doi: 10.1016/j.str.2012.02.018. Epub 2012 Apr 3.

SHAKE parallelization.SHAKE并行化

Eur Phys J Spec Top. 2011 Nov 1;200(1):211-223. doi: 10.1140/epjst/e2011-01525-9.

MOIL-opt: Energy-Conserving Molecular Dynamics on a GPU/CPU system.MOIL-opt：GPU/CPU系统上的节能分子动力学

J Chem Theory Comput. 2011 Aug 26;7(10):3072-3082. doi: 10.1021/ct200360f.

Revisiting and computing reaction coordinates with Directional Milestoning.重访并计算具有定向里程碑的反应坐标。

J Phys Chem A. 2011 Jun 16;115(23):6137-48. doi: 10.1021/jp111093c. Epub 2011 Apr 18.

Milestoning without a Reaction Coordinate.无反应坐标的里程碑式标记

J Chem Theory Comput. 2010;6(6):1805-1817. doi: 10.1021/ct100114j.

CCMA: A Robust, Parallelizable Constraint Method for Molecular Simulations.CCMA：一种用于分子模拟的稳健、可并行化的约束方法。

J Chem Theory Comput. 2010 Feb 9;6(2):434-437. doi: 10.1021/ct900463w.

Efficient nonbonded interactions for molecular dynamics on a graphics processing unit.高效的非键相互作用在图形处理单元上的分子动力学。

J Comput Chem. 2010 Apr 30;31(6):1268-72. doi: 10.1002/jcc.21413.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验