Hu Wei, Qin Xinming, Jiang Qingcai, Chen Junshi, An Hong, Jia Weile, Li Fang, Liu Xin, Chen Dexun, Liu Fangfang, Zhao Yuwen, Yang Jinlong
Hefei National Laboratory for Physical Sciences at the Microscale, Department of Chemical Physics, and Synergetic Innovation Center of Quantum Information and Quantum Physics, University of Science and Technology of China, Hefei 230026, China.
School of Computer Science and Technology, University of Science and Technology of China, Hefei 230026, China.
Sci Bull (Beijing). 2021 Jan 30;66(2):111-119. doi: 10.1016/j.scib.2020.06.025. Epub 2020 Jun 23.
High performance computing (HPC) is a powerful tool to accelerate the Kohn-Sham density functional theory (KS-DFT) calculations on modern heterogeneous supercomputers. Here, we describe a massively parallel implementation of discontinuous Galerkin density functional theory (DGDFT) method on the Sunway TaihuLight supercomputer. The DGDFT method uses the adaptive local basis (ALB) functions generated on-the-fly during the self-consistent field (SCF) iteration to solve the KS equations with high precision comparable to plane-wave basis set. In particular, the DGDFT method adopts a two-level parallelization strategy that deals with various types of data distribution, task scheduling, and data communication schemes, and combines with the master-slave multi-thread heterogeneous parallelism of SW26010 processor, resulting in large-scale HPC KS-DFT calculations on the Sunway TaihuLight supercomputer. We show that the DGDFT method can scale up to 8,519,680 processing cores (131,072 core groups) on the Sunway TaihuLight supercomputer for studying the electronic structures of two-dimensional (2D) metallic graphene systems that contain tens of thousands of carbon atoms.
高性能计算(HPC)是一种强大的工具,可加速在现代异构超级计算机上进行的Kohn-Sham密度泛函理论(KS-DFT)计算。在此,我们描述了在神威·太湖之光超级计算机上对间断伽辽金密度泛函理论(DGDFT)方法的大规模并行实现。DGDFT方法使用在自洽场(SCF)迭代过程中即时生成的自适应局部基(ALB)函数,以高精度求解KS方程,其精度可与平面波基组相媲美。特别地,DGDFT方法采用了两级并行化策略,该策略处理各种类型的数据分布、任务调度和数据通信方案,并与SW26010处理器的主从多线程异构并行相结合,从而在神威·太湖之光超级计算机上实现大规模HPC KS-DFT计算。我们表明,DGDFT方法在神威·太湖之光超级计算机上可扩展至8519680个处理核心(131072个核心组),用于研究包含数万个碳原子的二维(2D)金属石墨烯系统的电子结构。