通过利用图形处理器在大规模基因关联研究中发现上位性。

Discovering epistasis in large scale genetic association studies by exploiting graphics cards.

作者信息

Chen Gary K, Guo Yunfei

机构信息

Division of Biostatics, Department of Preventive Medicine, University of Southern California Los Angeles, CA, USA.

Division of Biostatics, Department of Preventive Medicine, University of Southern California Los Angeles, CA, USA ; Zilkha Neurogenetic Institute, University of Southern California Los Angeles, CA, USA.

出版信息

Front Genet. 2013 Dec 3;4:266. doi: 10.3389/fgene.2013.00266.

DOI:10.3389/fgene.2013.00266

PMID:24348518

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3848199/

Abstract

Despite the enormous investments made in collecting DNA samples and generating germline variation data across thousands of individuals in modern genome-wide association studies (GWAS), progress has been frustratingly slow in explaining much of the heritability in common disease. Today's paradigm of testing independent hypotheses on each single nucleotide polymorphism (SNP) marker is unlikely to adequately reflect the complex biological processes in disease risk. Alternatively, modeling risk as an ensemble of SNPs that act in concert in a pathway, and/or interact non-additively on log risk for example, may be a more sensible way to approach gene mapping in modern studies. Implementing such analyzes genome-wide can quickly become intractable due to the fact that even modest size SNP panels on modern genotype arrays (500k markers) pose a combinatorial nightmare, require tens of billions of models to be tested for evidence of interaction. In this article, we provide an in-depth analysis of programs that have been developed to explicitly overcome these enormous computational barriers through the use of processors on graphics cards known as Graphics Processing Units (GPU). We include tutorials on GPU technology, which will convey why they are growing in appeal with today's numerical scientists. One obvious advantage is the impressive density of microprocessor cores that are available on only a single GPU. Whereas high end servers feature up to 24 Intel or AMD CPU cores, the latest GPU offerings from nVidia feature over 2600 cores. Each compute node may be outfitted with up to 4 GPU devices. Success on GPUs varies across problems. However, epistasis screens fare well due to the high degree of parallelism exposed in these problems. Papers that we review routinely report GPU speedups of over two orders of magnitude (>100x) over standard CPU implementations.

摘要

尽管在现代全基因组关联研究（GWAS）中投入了巨额资金来收集数千人的DNA样本并生成种系变异数据，但在解释常见疾病的大部分遗传力方面，进展一直令人沮丧地缓慢。当今对每个单核苷酸多态性（SNP）标记进行独立假设检验的范式不太可能充分反映疾病风险中的复杂生物学过程。相反，例如将风险建模为在一条通路中协同作用和/或对对数风险进行非加性相互作用的SNP集合，可能是现代研究中进行基因定位的更明智方法。由于现代基因型阵列上即使是适度规模的SNP面板（50万个标记）也会带来组合难题，需要测试数百亿个模型以寻找相互作用的证据，因此在全基因组范围内实施此类分析很快就会变得难以处理。在本文中，我们深入分析了为通过使用称为图形处理单元（GPU）的显卡上的处理器来明确克服这些巨大计算障碍而开发的程序。我们提供了关于GPU技术的教程，这将说明它们为何在当今的数值科学家当中越来越受欢迎。一个明显的优势是单个GPU上可用的微处理器核心的惊人密度。高端服务器最多有24个英特尔或AMD CPU核心，而英伟达最新的GPU产品有超过2600个核心。每个计算节点最多可配备4个GPU设备。在GPU上的成功因问题而异。然而，由于这些问题中存在高度并行性，上位性筛选表现良好。我们所综述的论文经常报告GPU比标准CPU实现的加速超过两个数量级（>100倍）。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5da3/3848199/5c993c7a07b5/fgene-04-00266-g0001.jpg

相似文献

Discovering epistasis in large scale genetic association studies by exploiting graphics cards.

Front Genet. 2013 Dec 3;4:266. doi: 10.3389/fgene.2013.00266.

GENIE: a software package for gene-gene interaction analysis in genetic association studies using multiple GPU or CPU cores.

BMC Res Notes. 2011 May 26;4:158. doi: 10.1186/1756-0500-4-158.

High performance computing for deformable image registration: towards a new paradigm in adaptive radiotherapy.

Med Phys. 2008 Aug;35(8):3546-53. doi: 10.1118/1.2948318.

Heterogeneous computing architecture for fast detection of SNP-SNP interactions.

BMC Bioinformatics. 2014 Jun 25;15:216. doi: 10.1186/1471-2105-15-216.

Accelerating epistasis analysis in human genetics with consumer graphics hardware.

BMC Res Notes. 2009 Jul 24;2:149. doi: 10.1186/1756-0500-2-149.

Cost-effective GPU-grid for genome-wide epistasis calculations.

Methods Inf Med. 2013;52(1):91-5. doi: 10.3414/ME11-02-0049. Epub 2012 Dec 7.

NMF-mGPU: non-negative matrix factorization on multi-GPU systems.

BMC Bioinformatics. 2015 Feb 13;16:43. doi: 10.1186/s12859-015-0485-4.

A fast forward projection using multithreads for multirays on GPUs in medical image reconstruction.

Med Phys. 2011 Jul;38(7):4052-65. doi: 10.1118/1.3591994.

Parallelizing Epistasis Detection in GWAS on FPGA and GPU-Accelerated Computing Systems.

IEEE/ACM Trans Comput Biol Bioinform. 2015 Sep-Oct;12(5):982-94. doi: 10.1109/TCBB.2015.2389958.

Exploiting graphics processing units for computational biology and bioinformatics.

Interdiscip Sci. 2010 Sep;2(3):213-20. doi: 10.1007/s12539-010-0002-4. Epub 2010 Jul 25.

引用本文的文献

Review on GPU accelerated methods for genome-wide SNP-SNP interactions.

Mol Genet Genomics. 2024 Dec 29;300(1):10. doi: 10.1007/s00438-024-02214-6.

Accelerating Wright-Fisher Forward Simulations on the Graphics Processing Unit.

G3 (Bethesda). 2017 Sep 7;7(9):3229-3236. doi: 10.1534/g3.117.300103.

Assessing the effects of multiple markers in genetic association studies.

Front Genet. 2015 Feb 24;6:66. doi: 10.3389/fgene.2015.00066. eCollection 2015.

本文引用的文献

eQTL Epistasis - Challenges and Computational Approaches.

Front Genet. 2013 May 31;4:51. doi: 10.3389/fgene.2013.00051. eCollection 2013.

cuGWAM: Genome-wide association multifactor dimensionality reduction using CUDA-enabled high-performance graphics processing unit.

Int J Data Min Bioinform. 2012;6(5):471-81. doi: 10.1504/ijdmb.2012.049301.

GLIDE: GPU-based linear regression for detection of epistasis.

Hum Hered. 2012;73(4):220-36. doi: 10.1159/000341885. Epub 2012 Sep 4.

Mendel-GPU: haplotyping and genotype imputation on graphics processing units.

Bioinformatics. 2012 Nov 15;28(22):2979-80. doi: 10.1093/bioinformatics/bts536. Epub 2012 Sep 5.

Approximate probabilistic analysis of biopathway dynamics.

Bioinformatics. 2012 Jun 1;28(11):1508-16. doi: 10.1093/bioinformatics/bts166. Epub 2012 Apr 5.

Multifactor dimensionality reduction as a filter-based approach for genome wide association studies.

Front Genet. 2011 Nov 21;2:80. doi: 10.3389/fgene.2011.00080. eCollection 2011.

SOAP3: ultra-fast GPU-based parallel alignment tool for short reads.

Bioinformatics. 2012 Mar 15;28(6):878-9. doi: 10.1093/bioinformatics/bts061. Epub 2012 Jan 28.

A scalable and portable framework for massively parallel variable selection in genetic association studies.

Bioinformatics. 2012 Mar 1;28(5):719-20. doi: 10.1093/bioinformatics/bts015. Epub 2012 Jan 11.

Graphics Processing Units and High-Dimensional Optimization.

Stat Sci. 2010 Aug 1;25(3):311-324. doi: 10.1214/10-STS336.

CAMPAIGN: an open-source library of GPU-accelerated data clustering algorithms.

Bioinformatics. 2011 Aug 15;27(16):2322-3. doi: 10.1093/bioinformatics/btr386. Epub 2011 Jun 27.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

通过利用图形处理器在大规模基因关联研究中发现上位性。

Discovering epistasis in large scale genetic association studies by exploiting graphics cards.

作者信息

Chen Gary K, Guo Yunfei

机构信息

Division of Biostatics, Department of Preventive Medicine, University of Southern California Los Angeles, CA, USA.

出版信息

Front Genet. 2013 Dec 3;4:266. doi: 10.3389/fgene.2013.00266.

DOI:10.3389/fgene.2013.00266

PMID:24348518

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3848199/

Abstract

摘要

通过利用图形处理器在大规模基因关联研究中发现上位性。

Discovering epistasis in large scale genetic association studies by exploiting graphics cards.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

通过利用图形处理器在大规模基因关联研究中发现上位性。

Discovering epistasis in large scale genetic association studies by exploiting graphics cards.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献