用于全基因组上位性计算的经济高效的GPU网格

Pütz B, Kam-Thong T, Karbalai N, Altmann A, Müller-Myhsok B

MPI of Psychiatry, Statistical Genetics,Munich, Germany.

Methods Inf Med. 2013;52(1):91-5. doi: 10.3414/ME11-02-0049. Epub 2012 Dec 7.

BACKGROUND

Until recently, genotype studies were limited to the investigation of single SNP effects due to the computational burden incurred when studying pairwise interactions of SNPs. However, some genetic effects as simple as coloring (in plants and animals) cannot be ascribed to a single locus but only understood when epistasis is taken into account [1]. It is expected that such effects are also found in complex diseases where many genes contribute to the clinical outcome of affected individuals. Only recently have such problems become feasible computationally.

OBJECTIVES

The inherently parallel structure of the problem makes it a perfect candidate for massive parallelization on either grid or cloud architectures. Since we are also dealing with confidential patient data, we were not able to consider a cloud-based solution but had to find a way to process the data in-house and aimed to build a local GPU-based grid structure.

METHODS

Sequential epistatsis calculations were ported to GPU using CUDA at various levels. Parallelization on the CPU was compared to corresponding GPU counterparts with regards to performance and cost.

RESULTS

A cost-effective solution was created by combining custom-built nodes equipped with relatively inexpensive consumer-level graphics cards with highly parallel GPUs in a local grid. The GPU method outperforms current cluster-based systems on a price/performance criterion, as a single GPU shows speed performance comparable up to 200 CPU cores.

CONCLUSION

The outlined approach will work for problems that easily lend themselves to massive parallelization. Code for various tasks has been made available and ongoing development of tools will further ease the transition from sequential to parallel algorithms.

背景

直到最近，由于研究单核苷酸多态性（SNP）的成对相互作用时会产生计算负担，基因型研究仍局限于对单个SNP效应的调查。然而，一些像（动植物的）着色这样简单的遗传效应不能归因于单个基因座，而只有在考虑上位性时才能理解[1]。预计在复杂疾病中也会发现此类效应，在复杂疾病中许多基因对受影响个体的临床结果都有作用。直到最近，这类问题在计算上才变得可行。

目的

该问题固有的并行结构使其成为在网格或云架构上进行大规模并行化的理想候选对象。由于我们还处理机密的患者数据，所以无法考虑基于云的解决方案，而是必须找到一种内部处理数据的方法，并旨在构建基于本地GPU的网格结构。

方法

使用统一计算设备架构（CUDA）在不同级别将顺序上位性计算移植到GPU上。在性能和成本方面，将CPU上的并行化与相应的GPU并行化进行了比较。

结果

通过在本地网格中将配备相对便宜的消费级显卡的定制节点与高度并行的GPU相结合，创建了一种经济高效的解决方案。在性价比标准方面，GPU方法优于当前基于集群的系统，因为单个GPU的速度性能可与多达200个CPU核心相媲美。

结论

所概述的方法适用于易于进行大规模并行化的问题。已提供了各种任务的代码，并且工具的持续开发将进一步简化从顺序算法到并行算法的过渡。

相似文献

Cost-effective GPU-grid for genome-wide epistasis calculations.

Methods Inf Med. 2013;52(1):91-5. doi: 10.3414/ME11-02-0049. Epub 2012 Dec 7.

NMF-mGPU: non-negative matrix factorization on multi-GPU systems.

BMC Bioinformatics. 2015 Feb 13;16:43. doi: 10.1186/s12859-015-0485-4.

Accelerating epistasis analysis in human genetics with consumer graphics hardware.

BMC Res Notes. 2009 Jul 24;2:149. doi: 10.1186/1756-0500-2-149.

EpiGPU: exhaustive pairwise epistasis scans parallelized on consumer level graphics cards.

Bioinformatics. 2011 Jun 1;27(11):1462-5. doi: 10.1093/bioinformatics/btr172. Epub 2011 Apr 6.

High performance computing for deformable image registration: towards a new paradigm in adaptive radiotherapy.

Med Phys. 2008 Aug;35(8):3546-53. doi: 10.1118/1.2948318.

Discovering epistasis in large scale genetic association studies by exploiting graphics cards.

Front Genet. 2013 Dec 3;4:266. doi: 10.3389/fgene.2013.00266.

GPU-Acceleration of Sequence Homology Searches with Database Subsequence Clustering.

PLoS One. 2016 Aug 2;11(8):e0157338. doi: 10.1371/journal.pone.0157338. eCollection 2016.

Best bang for your buck: GPU nodes for GROMACS biomolecular simulations.

J Comput Chem. 2015 Oct 5;36(26):1990-2008. doi: 10.1002/jcc.24030. Epub 2015 Aug 4.

WFA-GPU: gap-affine pairwise read-alignment using GPUs.

Bioinformatics. 2023 Dec 1;39(12). doi: 10.1093/bioinformatics/btad701.

A nonvoxel-based dose convolution/superposition algorithm optimized for scalable GPU architectures.

Med Phys. 2014 Oct;41(10):101711. doi: 10.1118/1.4895822.

引用本文的文献

How to increase our belief in discovered statistical interactions via large-scale association studies?

Hum Genet. 2019 Apr;138(4):293-305. doi: 10.1007/s00439-019-01987-w. Epub 2019 Mar 6.

From bed to bench: bridging from informatics practice to theory: an exploratory analysis.

Appl Clin Inform. 2014 Oct 29;5(4):907-15. doi: 10.4338/ACI-2014-10-RA-0095. eCollection 2014.

Suppr 超能文献

核心技术专利：CN118964589B侵权必究

相似文献

Cost-effective GPU-grid for genome-wide epistasis calculations.

Methods Inf Med. 2013;52(1):91-5. doi: 10.3414/ME11-02-0049. Epub 2012 Dec 7.

NMF-mGPU: non-negative matrix factorization on multi-GPU systems.

BMC Bioinformatics. 2015 Feb 13;16:43. doi: 10.1186/s12859-015-0485-4.

Accelerating epistasis analysis in human genetics with consumer graphics hardware.

BMC Res Notes. 2009 Jul 24;2:149. doi: 10.1186/1756-0500-2-149.

EpiGPU: exhaustive pairwise epistasis scans parallelized on consumer level graphics cards.

Bioinformatics. 2011 Jun 1;27(11):1462-5. doi: 10.1093/bioinformatics/btr172. Epub 2011 Apr 6.

High performance computing for deformable image registration: towards a new paradigm in adaptive radiotherapy.

Med Phys. 2008 Aug;35(8):3546-53. doi: 10.1118/1.2948318.

Discovering epistasis in large scale genetic association studies by exploiting graphics cards.

Front Genet. 2013 Dec 3;4:266. doi: 10.3389/fgene.2013.00266.

GPU-Acceleration of Sequence Homology Searches with Database Subsequence Clustering.

PLoS One. 2016 Aug 2;11(8):e0157338. doi: 10.1371/journal.pone.0157338. eCollection 2016.

Best bang for your buck: GPU nodes for GROMACS biomolecular simulations.

J Comput Chem. 2015 Oct 5;36(26):1990-2008. doi: 10.1002/jcc.24030. Epub 2015 Aug 4.

WFA-GPU: gap-affine pairwise read-alignment using GPUs.

Bioinformatics. 2023 Dec 1;39(12). doi: 10.1093/bioinformatics/btad701.

A nonvoxel-based dose convolution/superposition algorithm optimized for scalable GPU architectures.

Med Phys. 2014 Oct;41(10):101711. doi: 10.1118/1.4895822.

引用本文的文献

How to increase our belief in discovered statistical interactions via large-scale association studies?

Hum Genet. 2019 Apr;138(4):293-305. doi: 10.1007/s00439-019-01987-w. Epub 2019 Mar 6.

From bed to bench: bridging from informatics practice to theory: an exploratory analysis.

Appl Clin Inform. 2014 Oct 29;5(4):907-15. doi: 10.4338/ACI-2014-10-RA-0095. eCollection 2014.

Cost-effective GPU-grid for genome-wide epistasis calculations.

作者信息

机构信息

出版信息

BACKGROUND

OBJECTIVES

METHODS

RESULTS

CONCLUSION

背景

目的

方法

结果

结论

相似文献

引用本文的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献