Limas Nidia Gabaldon, Manz Thomas A
Department of Chemical & Materials Engineering, New Mexico State University Las Cruces New Mexico 88003-8001 USA
RSC Adv. 2018 Jan 11;8(5):2678-2707. doi: 10.1039/c7ra11829e. eCollection 2018 Jan 9.
The DDEC6 method is one of the most accurate and broadly applicable atomic population analysis methods. It works for a broad range of periodic and non-periodic materials with no magnetism, collinear magnetism, and non-collinear magnetism irrespective of the basis set type. First, we show DDEC6 charge partitioning to assign net atomic charges corresponds to solving a series of 14 Lagrangians in order. Then, we provide flow diagrams for overall DDEC6 analysis, spin partitioning, and bond order calculations. We wrote an OpenMP parallelized Fortran code to provide efficient computations. We show that by storing large arrays as shared variables in cache line friendly order, memory requirements are independent of the number of parallel computing cores and false sharing is minimized. We show that both total memory required and the computational time scale linearly with increasing numbers of atoms in the unit cell. Using the presently chosen uniform grids, computational times of ∼9 to 94 seconds per atom were required to perform DDEC6 analysis on a single computing core in an Intel Xeon E5 multi-processor unit. Parallelization efficiencies were usually >50% for computations performed on 2 to 16 cores of a cache coherent node. As examples we study a B-DNA decamer, nickel metal, supercells of hexagonal ice crystals, six X@C endohedral fullerene complexes, a water dimer, a Mn-acetate single molecule magnet exhibiting collinear magnetism, a FeONCH single molecule magnet exhibiting non-collinear magnetism, and several spin states of an ozone molecule. Efficient parallel computation was achieved for systems containing as few as one and as many as >8000 atoms in a unit cell. We varied many calculation factors (, grid spacing, code design, thread arrangement, ) and report their effects on calculation speed and precision. We make recommendations for excellent performance.
DDEC6方法是最准确且应用广泛的原子布居分析方法之一。它适用于多种周期性和非周期性材料,包括无磁性、共线磁性和非共线磁性的材料,且与基组类型无关。首先,我们展示了DDEC6电荷划分以分配净原子电荷,这对应于依次求解一系列14个拉格朗日方程。然后,我们提供了整体DDEC6分析、自旋划分和键级计算的流程图。我们编写了一个OpenMP并行化的Fortran代码以实现高效计算。我们表明,通过以缓存行友好顺序将大型数组存储为共享变量,内存需求与并行计算核心的数量无关,并且虚假共享被最小化。我们表明,所需的总内存和计算时间都随晶胞中原子数量的增加而呈线性变化。使用目前选择的均匀网格,在英特尔至强E5多处理器单元的单个计算核心上对每个原子执行DDEC6分析需要约9至94秒的计算时间。对于在缓存一致节点的2至16个核心上执行的计算,并行化效率通常>50%。作为示例,我们研究了一个B - DNA十聚体、镍金属、六方冰晶的超胞、六个X@C内嵌富勒烯配合物、一个水二聚体、一个表现出共线磁性的Mn - 乙酸盐单分子磁体、一个表现出非共线磁性的FeONCH单分子磁体以及臭氧分子的几种自旋态。对于晶胞中原子数少至1个和多至>8000个的系统都实现了高效并行计算。我们改变了许多计算因素(如网格间距、代码设计、线程安排等)并报告了它们对计算速度和精度的影响。我们给出了实现卓越性能的建议。