Suppr超能文献

使用GPU加速的堆叠布洛赫波电子衍射模拟。

Stacked-Bloch-wave electron diffraction simulations using GPU acceleration.

作者信息

Pennington Robert S, Wang Feng, Koch Christoph T

机构信息

ELIM, Institut für Experimentelle Physik, Universität Ulm, Albert-Einstein-Allee 11, 89081 Ulm, Germany.

ELIM, Institut für Experimentelle Physik, Universität Ulm, Albert-Einstein-Allee 11, 89081 Ulm, Germany.

出版信息

Ultramicroscopy. 2014 Jun;141:32-7. doi: 10.1016/j.ultramic.2014.03.003. Epub 2014 Mar 17.

Abstract

In this paper, we discuss the advantages for Bloch-wave simulations performed using graphics processing units (GPUs), based on approximating the matrix exponential directly instead of performing a matrix diagonalization. Our direct matrix-exponential algorithm yields a functionally identical electron scattering matrix to that generated with matrix diagonalization. Using the matrix-exponential scaling-and-squaring method with a Padé approximation, direct GPU-based matrix-exponential double-precision calculations are up to 20× faster than CPU-based calculations and up to approximately 70× faster than matrix diagonalization. We compare precision and runtime of scaling and squaring methods with either the Padé approximation or a Taylor expansion. We also discuss the stacked-Bloch-wave method, and show that our stacked-Bloch-wave implementation yields the same electron scattering matrix as traditional Bloch-wave matrix diagonalization.

摘要

在本文中,我们讨论了使用图形处理单元(GPU)进行布洛赫波模拟的优势,该方法基于直接逼近矩阵指数而非进行矩阵对角化。我们的直接矩阵指数算法产生的电子散射矩阵在功能上与通过矩阵对角化生成的矩阵相同。使用带有帕德近似的矩阵指数缩放平方方法,基于GPU的直接矩阵指数双精度计算比基于CPU的计算快20倍,比矩阵对角化快约70倍。我们比较了使用帕德近似或泰勒展开的缩放平方方法的精度和运行时间。我们还讨论了堆叠布洛赫波方法,并表明我们的堆叠布洛赫波实现产生的电子散射矩阵与传统的布洛赫波矩阵对角化相同。

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验