Hesford Andrew J, Waag Robert C
Department of Electrical and Computer Engineering, University of Rochester, Rochester NY 14642-8648 USA.
J Comput Phys. 2011 May 10;230(10):3656-3667. doi: 10.1016/j.jcp.2011.02.016.
The fast multipole method (FMM) has been shown to have a reduced computational dependence on the size of finest-level groups of elements when the elements are positioned on a regular grid and FFT convolution is used to represent neighboring interactions. However, transformations between plane-wave expansions used for FMM interactions and pressure distributions used for neighboring interactions remain significant contributors to the cost of FMM computations when finest-level groups are large. The transformation operators, which are forward and inverse Fourier transforms with the wave space confined to the unit sphere, are smooth and well approximated using reduced-rank decompositions that further reduce the computational dependence of the FMM on finest-level group size. The adaptive cross approximation (ACA) is selected to represent the forward and adjoint far-field transformation operators required by the FMM. However, the actual error of the ACA is found to be greater than that predicted using traditional estimates, and the ACA generally performs worse than the approximation resulting from a truncated singular-value decomposition (SVD). To overcome these issues while avoiding the cost of a full-scale SVD, the ACA is employed with more stringent accuracy demands and recompressed using a reduced, truncated SVD. The results show a greatly reduced approximation error that performs comparably to the full-scale truncated SVD without degrading the asymptotic computational efficiency associated with ACA matrix assembly.
当元素位于规则网格上且使用快速傅里叶变换(FFT)卷积来表示相邻相互作用时,快速多极子方法(FMM)已被证明在计算上对最细级别元素组大小的依赖性降低。然而,当最细级别组很大时,用于FMM相互作用的平面波展开与用于相邻相互作用的压力分布之间的变换仍然是FMM计算成本的重要组成部分。变换算子是波空间局限于单位球的正向和反向傅里叶变换,它们是光滑的,并且可以使用降秩分解进行很好的近似,这进一步降低了FMM对最细级别组大小的计算依赖性。选择自适应交叉近似(ACA)来表示FMM所需的正向和伴随远场变换算子。然而,发现ACA的实际误差大于使用传统估计预测的误差,并且ACA通常比截断奇异值分解(SVD)产生的近似效果更差。为了在避免全规模SVD成本的同时克服这些问题,采用了具有更严格精度要求的ACA,并使用简化的截断SVD进行重新压缩。结果表明,近似误差大大降低,其性能与全规模截断SVD相当,同时不会降低与ACA矩阵组装相关的渐近计算效率。