• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

SwiftLink:使用多核 CPU 和 GPU 进行并行 MCMC 连锁分析。

SwiftLink: parallel MCMC linkage analysis using multicore CPU and GPU.

机构信息

Division of Medicine, University College London, London WC1E 6BT, UK, Institute of Biotechnology, University of Helsinki, Helsinki 00014, Finland.

出版信息

Bioinformatics. 2013 Feb 15;29(4):413-9. doi: 10.1093/bioinformatics/bts704. Epub 2012 Dec 13.

DOI:10.1093/bioinformatics/bts704
PMID:23239673
Abstract

MOTIVATION

Linkage analysis remains an important tool in elucidating the genetic component of disease and has become even more important with the advent of whole exome sequencing, enabling the user to focus on only those genomic regions co-segregating with Mendelian traits. Unfortunately, methods to perform multipoint linkage analysis scale poorly with either the number of markers or with the size of the pedigree. Large pedigrees with many markers can only be evaluated with Markov chain Monte Carlo (MCMC) methods that are slow to converge and, as no attempts have been made to exploit parallelism, massively underuse available processing power. Here, we describe SWIFTLINK, a novel application that performs MCMC linkage analysis by spreading the computational burden between multiple processor cores and a graphics processing unit (GPU) simultaneously. SWIFTLINK was designed around the concept of explicitly matching the characteristics of an algorithm with the underlying computer architecture to maximize performance.

RESULTS

We implement our approach using existing Gibbs samplers redesigned for parallel hardware. We applied SWIFTLINK to a real-world dataset, performing parametric multipoint linkage analysis on a highly consanguineous pedigree with EAST syndrome, containing 28 members, where a subset of individuals were genotyped with single nucleotide polymorphisms (SNPs). In our experiments with a four core CPU and GPU, SWIFTLINK achieves a 8.5× speed-up over the single-threaded version and a 109× speed-up over the popular linkage analysis program SIMWALK.

AVAILABILITY

SWIFTLINK is available at https://github.com/ajm/swiftlink. All source code is licensed under GPLv3.

摘要

动机

连锁分析仍然是阐明疾病遗传成分的重要工具,随着外显子组测序的出现,它变得更加重要,使研究人员能够专注于与孟德尔性状共分离的基因组区域。不幸的是,进行多点连锁分析的方法在标记数量或家系大小方面扩展能力都很差。具有大量标记的大型家系只能使用马尔可夫链蒙特卡罗(MCMC)方法进行评估,这些方法收敛速度较慢,而且由于没有尝试利用并行性,因此极大地浪费了可用的处理能力。在这里,我们描述了 SWIFTLINK,这是一种新的应用程序,通过同时在多个处理器内核和图形处理单元(GPU)之间分配计算负担来执行 MCMC 连锁分析。SWIFTLINK 的设计围绕着将算法的特性与底层计算机架构相匹配的概念展开,以最大限度地提高性能。

结果

我们使用重新设计用于并行硬件的现有 Gibbs 抽样器来实现我们的方法。我们将 SWIFTLINK 应用于一个真实世界的数据集,对包含 28 名成员的具有 EAST 综合征的高度近亲繁殖家系进行参数多点连锁分析,其中一部分个体使用单核苷酸多态性(SNP)进行基因分型。在我们使用四核 CPU 和 GPU 的实验中,SWIFTLINK 相对于单线程版本实现了 8.5 倍的加速,相对于流行的连锁分析程序 SIMWALK 实现了 109 倍的加速。

可用性

SWIFTLINK 可在 https://github.com/ajm/swiftlink 上获得。所有源代码均根据 GPLv3 获得许可。

相似文献

1
SwiftLink: parallel MCMC linkage analysis using multicore CPU and GPU.SwiftLink:使用多核 CPU 和 GPU 进行并行 MCMC 连锁分析。
Bioinformatics. 2013 Feb 15;29(4):413-9. doi: 10.1093/bioinformatics/bts704. Epub 2012 Dec 13.
2
Multipoint linkage analysis with many multiallelic or dense diallelic markers: Markov chain-Monte Carlo provides practical approaches for genome scans on general pedigrees.使用多个多等位基因或密集双等位基因标记的多点连锁分析:马尔可夫链蒙特卡罗方法为普通家系的基因组扫描提供了实用方法。
Am J Hum Genet. 2006 Nov;79(5):846-58. doi: 10.1086/508472. Epub 2006 Sep 20.
3
A system for exact and approximate genetic linkage analysis of SNP data in large pedigrees.用于大型家系中 SNP 数据的精确和近似遗传连锁分析的系统。
Bioinformatics. 2013 Jan 15;29(2):197-205. doi: 10.1093/bioinformatics/bts658. Epub 2012 Nov 18.
4
FRANz: reconstruction of wild multi-generation pedigrees.FRANz:重建野生多代系谱。
Bioinformatics. 2009 Aug 15;25(16):2134-9. doi: 10.1093/bioinformatics/btp064. Epub 2009 Feb 8.
5
quickLD: An efficient software for linkage disequilibrium analyses.quickLD:一款高效的连锁不平衡分析软件。
Mol Ecol Resour. 2021 Oct;21(7):2580-2587. doi: 10.1111/1755-0998.13438. Epub 2021 Jun 19.
6
Metropolis sampling in pedigree analysis.系谱分析中的 metropolis 抽样
Stat Methods Med Res. 1993;2(3):263-82. doi: 10.1177/096228029300200305.
7
MrBayes on a graphics processing unit.在图形处理单元上运行 MrBayes。
Bioinformatics. 2011 May 1;27(9):1255-61. doi: 10.1093/bioinformatics/btr140. Epub 2011 Mar 16.
8
Handling marker-marker linkage disequilibrium: pedigree analysis with clustered markers.处理标记-标记连锁不平衡:使用聚类标记的系谱分析
Am J Hum Genet. 2005 Nov;77(5):754-67. doi: 10.1086/497345. Epub 2005 Sep 20.
9
gPGA: GPU Accelerated Population Genetics Analyses.gPGA:GPU加速群体遗传学分析
PLoS One. 2015 Aug 6;10(8):e0135028. doi: 10.1371/journal.pone.0135028. eCollection 2015.
10
MCMC multilocus lod scores: application of a new approach.MCMC多位点连锁分析得分:一种新方法的应用
Hum Hered. 2005;59(2):98-108. doi: 10.1159/000085224. Epub 2005 Apr 18.

引用本文的文献

1
Inferring Transmission Histories of Rare Alleles in Population-Scale Genealogies.推断群体谱系中稀有等位基因的传播历史。
Am J Hum Genet. 2018 Dec 6;103(6):893-906. doi: 10.1016/j.ajhg.2018.10.017.
2
TOPAZ: asymmetric suffix array neighbourhood search for massive protein databases.TOPAZ:用于大规模蛋白质数据库的非对称后缀数组邻域搜索。
BMC Bioinformatics. 2018 Jul 31;19(1):278. doi: 10.1186/s12859-018-2290-3.
3
Genetic Complexity of Crohn's Disease in Two Large Ashkenazi Jewish Families.两个大型阿什肯纳兹犹太人家族中克罗恩病的遗传复杂性
Gastroenterology. 2016 Oct;151(4):698-709. doi: 10.1053/j.gastro.2016.06.040. Epub 2016 Jul 1.
4
Scalable linkage-disequilibrium-based selective sweep detection: a performance guide.基于连锁不平衡的可扩展选择性清除检测:性能指南。
Gigascience. 2016 Feb 8;5:7. doi: 10.1186/s13742-016-0114-9. eCollection 2016.
5
Combinatorial Conflicting Homozygosity (CCH) analysis enables the rapid identification of shared genomic regions in the presence of multiple phenocopies.组合冲突纯合性(CCH)分析能够在存在多种表型模拟的情况下快速识别共享基因组区域。
BMC Genomics. 2015 Mar 10;16(1):163. doi: 10.1186/s12864-015-1360-4.