• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于混合CPU-GPU架构的大规模并行张量网络状态算法

Massively Parallel Tensor Network State Algorithms on Hybrid CPU-GPU Based Architectures.

作者信息

Menczer Andor, Legeza Örs

机构信息

Strongly Correlated Systems "Lendület" Research Group, Wigner Research Centre for Physics, H-1525 Budapest, Hungary.

Eötvös Loránd University, Pázmány Péter Sétány 1/C, 1117 Budapest, Hungary.

出版信息

J Chem Theory Comput. 2025 Feb 25;21(4):1572-1587. doi: 10.1021/acs.jctc.4c00661. Epub 2025 Feb 4.

DOI:10.1021/acs.jctc.4c00661
PMID:39902559
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11866756/
Abstract

The interplay of quantum and classical simulation and the delicate divide between them is in the focus of massively parallelized tensor network state (TNS) algorithms designed for high performance computing (HPC). In this contribution, we present novel algorithmic solutions together with implementation details to extend current limits of TNS algorithms on HPC infrastructure building on state-of-the-art hardware and software technologies. Benchmark results obtained via large-scale density matrix renormalization group (DMRG) simulations on single node multiGPU NVIDIA A100 system are presented for selected strongly correlated molecular systems addressing problems on Hilbert space dimensions up to 4.17 × 10.

摘要

量子模拟与经典模拟之间的相互作用以及它们之间的微妙界限,是为高性能计算(HPC)设计的大规模并行张量网络态(TNS)算法的核心关注点。在本论文中,我们基于最先进的硬件和软件技术,提出了新颖的算法解决方案以及实现细节,以扩展TNS算法在HPC基础设施上的当前限制。通过在单节点多GPU NVIDIA A100系统上进行大规模密度矩阵重整化群(DMRG)模拟,针对选定的强关联分子系统给出了基准测试结果,这些系统解决了高达4.17×10的希尔伯特空间维度问题。

相似文献

1
Massively Parallel Tensor Network State Algorithms on Hybrid CPU-GPU Based Architectures.基于混合CPU-GPU架构的大规模并行张量网络状态算法
J Chem Theory Comput. 2025 Feb 25;21(4):1572-1587. doi: 10.1021/acs.jctc.4c00661. Epub 2025 Feb 4.
2
Parallel Implementation of the Density Matrix Renormalization Group Method Achieving a Quarter petaFLOPS Performance on a Single DGX-H100 GPU Node.密度矩阵重整化群方法的并行实现,在单个DGX-H100 GPU节点上实现了四分之一petaFLOPS的性能。
J Chem Theory Comput. 2024 Oct 8;20(19):8397-8404. doi: 10.1021/acs.jctc.4c00903. Epub 2024 Sep 19.
3
Tensor Network State Algorithms on AI Accelerators.人工智能加速器上的张量网络状态算法
J Chem Theory Comput. 2024 Oct 22;20(20):8897-8910. doi: 10.1021/acs.jctc.4c00800. Epub 2024 Oct 14.
4
Distributed Multi-GPU Density Matrix Renormalization Group Algorithm with Applications to the P-Cluster of Nitrogenase.分布式多GPU密度矩阵重整化群算法及其在固氮酶P簇中的应用
J Chem Theory Comput. 2024 Jan 23;20(2):775-786. doi: 10.1021/acs.jctc.3c01228. Epub 2024 Jan 10.
5
GPUs Outperform Current HPC and Neuromorphic Solutions in Terms of Speed and Energy When Simulating a Highly-Connected Cortical Model.在模拟高度连接的皮质模型时,图形处理器(GPU)在速度和能源方面优于当前的高性能计算(HPC)和神经形态解决方案。
Front Neurosci. 2018 Dec 12;12:941. doi: 10.3389/fnins.2018.00941. eCollection 2018.
6
WFA-GPU: gap-affine pairwise read-alignment using GPUs.WFA-GPU:基于 GPU 的缺口仿射两两序列比对
Bioinformatics. 2023 Dec 1;39(12). doi: 10.1093/bioinformatics/btad701.
7
High performance computing for deformable image registration: towards a new paradigm in adaptive radiotherapy.用于可变形图像配准的高性能计算:迈向自适应放射治疗的新范式。
Med Phys. 2008 Aug;35(8):3546-53. doi: 10.1118/1.2948318.
8
Novel hybrid GPU-CPU implementation of parallelized Monte Carlo parametric expectation maximization estimation method for population pharmacokinetic data analysis.新型 GPU-CPU 混合并行化 Monte Carlo 参数期望极大化估计方法在群体药代动力学数据分析中的应用。
AAPS J. 2013 Oct;15(4):1212-21. doi: 10.1208/s12248-013-9524-0. Epub 2013 Sep 4.
9
Accelerating Coupled-Cluster Calculations with GPUs: An Implementation of the Density-Fitted CCSD(T) Approach for Heterogeneous Computing Architectures Using OpenMP Directives.利用GPU加速耦合簇计算:一种使用OpenMP指令在异构计算架构上实现密度拟合CCSD(T)方法的方案
J Chem Theory Comput. 2023 Nov 14;19(21):7640-7657. doi: 10.1021/acs.jctc.3c00876. Epub 2023 Oct 25.
10
Enabling Large-Scale Condensed-Phase Hybrid Density Functional Theory-Based Molecular Dynamics II: Extensions to the Isobaric-Isoenthalpic and Isobaric-Isothermal Ensembles.实现大规模凝聚相杂化密度泛函理论分子动力学Ⅱ:等压等焓和等压恒温系综的扩展。
J Chem Theory Comput. 2021 Dec 14;17(12):7789-7813. doi: 10.1021/acs.jctc.0c01194. Epub 2021 Nov 14.

引用本文的文献

1
Orbital Optimization of Large Active Spaces via AI-Accelerators.通过人工智能加速器对大型活性空间进行轨道优化
J Chem Theory Comput. 2025 Jul 8;21(13):6545-6558. doi: 10.1021/acs.jctc.5c00571. Epub 2025 Jun 13.

本文引用的文献

1
Tensor Network State Algorithms on AI Accelerators.人工智能加速器上的张量网络状态算法
J Chem Theory Comput. 2024 Oct 22;20(20):8897-8910. doi: 10.1021/acs.jctc.4c00800. Epub 2024 Oct 14.
2
Parallel Implementation of the Density Matrix Renormalization Group Method Achieving a Quarter petaFLOPS Performance on a Single DGX-H100 GPU Node.密度矩阵重整化群方法的并行实现,在单个DGX-H100 GPU节点上实现了四分之一petaFLOPS的性能。
J Chem Theory Comput. 2024 Oct 8;20(19):8397-8404. doi: 10.1021/acs.jctc.4c00903. Epub 2024 Sep 19.
3
A comprehensive electron wavefunction analysis toolbox for chemists, Multiwfn.
一款面向化学家的综合电子波函数分析工具箱——Multiwfn。
J Chem Phys. 2024 Aug 28;161(8). doi: 10.1063/5.0216272.
4
Renormalized-Residue-Based Multireference Configuration Interaction Method for Strongly Correlated Systems.用于强关联系统的基于重归一化残差的多参考组态相互作用方法
J Chem Theory Comput. 2024 Mar 12;20(5):1988-2009. doi: 10.1021/acs.jctc.3c01247. Epub 2024 Feb 21.
5
Distributed Implementation of Full Configuration Interaction for One Trillion Determinants.万亿行列式全组态相互作用的分布式实现
J Chem Theory Comput. 2024 Feb 13;20(3):1185-1192. doi: 10.1021/acs.jctc.3c01190. Epub 2024 Feb 5.
6
Distributed Multi-GPU Density Matrix Renormalization Group Algorithm with Applications to the P-Cluster of Nitrogenase.分布式多GPU密度矩阵重整化群算法及其在固氮酶P簇中的应用
J Chem Theory Comput. 2024 Jan 23;20(2):775-786. doi: 10.1021/acs.jctc.3c01228. Epub 2024 Jan 10.
7
Predicting the FCI Energy of Large Systems to Chemical Accuracy from Restricted Active Space Density Matrix Renormalization Group Calculations.通过受限活性空间密度矩阵重整化群计算将大体系的FCI能量预测至化学精度。
J Chem Theory Comput. 2024 Jan 9;20(1):87-102. doi: 10.1021/acs.jctc.3c01001. Epub 2023 Dec 18.
8
Block2: A comprehensive open source framework to develop and apply state-of-the-art DMRG algorithms in electronic structure and beyond.模块2:一个全面的开源框架,用于在电子结构及其他领域开发和应用最先进的密度矩阵重整化群(DMRG)算法。
J Chem Phys. 2023 Dec 21;159(23). doi: 10.1063/5.0180424.
9
Toward Large-Scale Restricted Active Space Calculations Inspired by the Schmidt Decomposition.受施密特分解启发的大规模受限活性空间计算
J Phys Chem A. 2022 Dec 29;126(51):9709-9718. doi: 10.1021/acs.jpca.2c05952. Epub 2022 Dec 15.
10
Matrix Product States with Large Sites.具有大格点的矩阵乘积态
J Chem Theory Comput. 2022 Feb 8;18(2):749-762. doi: 10.1021/acs.jctc.1c00957. Epub 2022 Jan 21.