Suppr超能文献

迭代主成分分析算法的并行GPU实现

Parallel GPU implementation of iterative PCA algorithms.

作者信息

Andrecut M

机构信息

Institute for Biocomplexity and Informatics, University of Calgary, Calgary, Alberta, Canada.

出版信息

J Comput Biol. 2009 Nov;16(11):1593-9. doi: 10.1089/cmb.2008.0221.

Abstract

Principal component analysis (PCA) is a key statistical technique for multivariate data analysis. For large data sets, the common approach to PCA computation is based on the standard NIPALS-PCA algorithm, which unfortunately suffers from loss of orthogonality, and therefore its applicability is usually limited to the estimation of the first few components. Here we present an algorithm based on Gram-Schmidt orthogonalization (called GS-PCA), which eliminates this shortcoming of NIPALS-PCA. Also, we discuss the GPU (Graphics Processing Unit) parallel implementation of both NIPALS-PCA and GS-PCA algorithms. The numerical results show that the GPU parallel optimized versions, based on CUBLAS (NVIDIA), are substantially faster (up to 12 times) than the CPU optimized versions based on CBLAS (GNU Scientific Library).

摘要

主成分分析(PCA)是多元数据分析的关键统计技术。对于大型数据集,PCA计算的常用方法基于标准的非线性迭代偏最小二乘法主成分分析(NIPALS-PCA)算法,但遗憾的是该算法存在正交性损失问题,因此其适用性通常仅限于前几个成分的估计。在此,我们提出一种基于格拉姆-施密特正交化的算法(称为GS-PCA),它消除了NIPALS-PCA的这一缺点。此外,我们还讨论了NIPALS-PCA算法和GS-PCA算法的图形处理器(GPU)并行实现。数值结果表明,基于英伟达CUDA基础线性代数子程序库(CUBLAS)的GPU并行优化版本比基于GNU科学库的CBLAS的中央处理器(CPU)优化版本快得多(高达12倍)。

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验