• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

ParaHaplo 3.0:一个用于通过混合并行计算进行插补和基于单倍型的全基因组关联研究的程序包。

ParaHaplo 3.0: A program package for imputation and a haplotype-based whole-genome association study using hybrid parallel computing.

作者信息

Misawa Kazuharu, Kamatani Naoyuki

机构信息

Research Program for Computational Science, Research and Development Group for Next-Generation Integrated Living Matter Simulation, and Fusion of Data and Analysis Research and Development Team, RIKEN, 4-6-1 Shirokane-dai, Minato-ku, Tokyo 108-8639, Japan.

出版信息

Source Code Biol Med. 2011 May 24;6(1):10. doi: 10.1186/1751-0473-6-10.

DOI:10.1186/1751-0473-6-10
PMID:21609440
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3123186/
Abstract

BACKGROUND

Use of missing genotype imputations and haplotype reconstructions are valuable in genome-wide association studies (GWASs). By modeling the patterns of linkage disequilibrium in a reference panel, genotypes not directly measured in the study samples can be imputed and used for GWASs. Since millions of single nucleotide polymorphisms need to be imputed in a GWAS, faster methods for genotype imputation and haplotype reconstruction are required.

RESULTS

We developed a program package for parallel computation of genotype imputation and haplotype reconstruction. Our program package, ParaHaplo 3.0, is intended for use in workstation clusters using the Intel Message Passing Interface. We compared the performance of ParaHaplo 3.0 on the Japanese in Tokyo, Japan and Han Chinese in Beijing, and Chinese in the HapMap dataset. A parallel version of ParaHaplo 3.0 can conduct genotype imputation 20 times faster than a non-parallel version of ParaHaplo.

CONCLUSIONS

ParaHaplo 3.0 is an invaluable tool for conducting haplotype-based GWASs. The need for faster genotype imputation and haplotype reconstruction using parallel computing will become increasingly important as the data sizes of such projects continue to increase. ParaHaplo executable binaries and program sources are available at http://en.sourceforge.jp/projects/parallelgwas/releases/.

摘要

背景

在全基因组关联研究(GWAS)中,使用缺失基因型推断和单倍型重建很有价值。通过对参考面板中的连锁不平衡模式进行建模,可以推断出研究样本中未直接测量的基因型,并将其用于GWAS。由于在GWAS中需要推断数百万个单核苷酸多态性,因此需要更快的基因型推断和单倍型重建方法。

结果

我们开发了一个用于基因型推断和单倍型重建并行计算的程序包。我们的程序包ParaHaplo 3.0旨在用于使用英特尔消息传递接口的工作站集群。我们比较了ParaHaplo 3.0在日本东京的日本人、北京的汉族人和HapMap数据集中的中国人中的性能。ParaHaplo 3.0的并行版本进行基因型推断的速度比ParaHaplo的非并行版本快20倍。

结论

ParaHaplo 3.0是进行基于单倍型的GWAS的宝贵工具。随着此类项目数据量的不断增加,使用并行计算更快地进行基因型推断和单倍型重建的需求将变得越来越重要。ParaHaplo可执行二进制文件和程序源代码可在http://en.sourceforge.jp/projects/parallelgwas/releases/获取。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a630/3123186/e202a3373036/1751-0473-6-10-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a630/3123186/e202a3373036/1751-0473-6-10-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a630/3123186/e202a3373036/1751-0473-6-10-1.jpg

相似文献

1
ParaHaplo 3.0: A program package for imputation and a haplotype-based whole-genome association study using hybrid parallel computing.ParaHaplo 3.0:一个用于通过混合并行计算进行插补和基于单倍型的全基因组关联研究的程序包。
Source Code Biol Med. 2011 May 24;6(1):10. doi: 10.1186/1751-0473-6-10.
2
ParaHaplo 2.0: a program package for haplotype-estimation and haplotype-based whole-genome association study using parallel computing.ParaHaplo 2.0:一个用于单倍型估计和基于单倍型的全基因组关联研究的程序包,采用并行计算。
Source Code Biol Med. 2010 Jun 4;5:5. doi: 10.1186/1751-0473-5-5.
3
ParaHaplo: A program package for haplotype-based whole-genome association study using parallel computing.ParaHaplo:一个用于基于单倍型的全基因组关联研究的程序包,采用并行计算。
Source Code Biol Med. 2009 Oct 21;4:7. doi: 10.1186/1751-0473-4-7.
4
imputeqc: an R package for assessing imputation quality of genotypes and optimizing imputation parameters.imputeqc:一个用于评估基因型插补质量和优化插补参数的 R 包。
BMC Bioinformatics. 2020 Jul 24;21(Suppl 12):304. doi: 10.1186/s12859-020-03589-0.
5
Genotype Imputation in Genome-Wide Association Studies.全基因组关联研究中的基因型填充
Curr Protoc Hum Genet. 2019 Jun;102(1):e84. doi: 10.1002/cphg.84.
6
Genotype imputation and reference panel: a systematic evaluation on haplotype size and diversity.基因型填充与参考面板:关于单倍型大小和多样性的系统评估
Brief Bioinform. 2019 Nov 6. doi: 10.1093/bib/bbz108.
7
Genotype imputation with thousands of genomes.使用数千份基因组进行基因型推断。
G3 (Bethesda). 2011 Nov;1(6):457-70. doi: 10.1534/g3.111.001198. Epub 2011 Nov 1.
8
Strategies for single nucleotide polymorphism (SNP) genotyping to enhance genotype imputation in Gyr (Bos indicus) dairy cattle: Comparison of commercially available SNP chips.提高吉尔(印度瘤牛)奶牛单核苷酸多态性(SNP)基因分型以增强基因型填充的策略:市售SNP芯片的比较
J Dairy Sci. 2015 Jul;98(7):4969-89. doi: 10.3168/jds.2014-9213. Epub 2015 May 7.
9
A combined reference panel from the 1000 Genomes and UK10K projects improved rare variant imputation in European and Chinese samples.1000 基因组计划和 UK10K 项目的联合参考面板提高了欧洲和中国样本中罕见变异体的推断准确性。
Sci Rep. 2016 Dec 22;6:39313. doi: 10.1038/srep39313.
10
Genotype Imputation from Large Reference Panels.基于大型参考面板的基因型推断。
Annu Rev Genomics Hum Genet. 2018 Aug 31;19:73-96. doi: 10.1146/annurev-genom-083117-021602. Epub 2018 May 23.

本文引用的文献

1
Integrating common and rare genetic variation in diverse human populations.整合不同人类群体中的常见和罕见遗传变异。
Nature. 2010 Sep 2;467(7311):52-8. doi: 10.1038/nature09298.
2
ParaHaplo 2.0: a program package for haplotype-estimation and haplotype-based whole-genome association study using parallel computing.ParaHaplo 2.0:一个用于单倍型估计和基于单倍型的全基因组关联研究的程序包,采用并行计算。
Source Code Biol Med. 2010 Jun 4;5:5. doi: 10.1186/1751-0473-5-5.
3
Genome-wide association study of hematological and biochemical traits in a Japanese population.
全基因组关联研究在一个日本人群中的血液学和生物化学特征。
Nat Genet. 2010 Mar;42(3):210-5. doi: 10.1038/ng.531. Epub 2010 Feb 7.
4
ParaHaplo: A program package for haplotype-based whole-genome association study using parallel computing.ParaHaplo:一个用于基于单倍型的全基因组关联研究的程序包,采用并行计算。
Source Code Biol Med. 2009 Oct 21;4:7. doi: 10.1186/1751-0473-4-7.
5
Genotype imputation.基因型推算
Annu Rev Genomics Hum Genet. 2009;10:387-406. doi: 10.1146/annurev.genom.9.081307.164242.
6
A comprehensive evaluation of SNP genotype imputation.单核苷酸多态性(SNP)基因型填充的综合评估。
Hum Genet. 2009 Mar;125(2):163-71. doi: 10.1007/s00439-008-0606-5. Epub 2008 Dec 17.
7
Complete sequence of the duckweed (Lemna minor) chloroplast genome: structural organization and phylogenetic relationships to other angiosperms.浮萍(青萍)叶绿体基因组的完整序列:结构组织及与其他被子植物的系统发育关系
J Mol Evol. 2008 Jun;66(6):555-64. doi: 10.1007/s00239-008-9091-7. Epub 2008 May 8.
8
ITPKC functional polymorphism associated with Kawasaki disease susceptibility and formation of coronary artery aneurysms.ITPKC功能多态性与川崎病易感性及冠状动脉瘤形成相关。
Nat Genet. 2008 Jan;40(1):35-42. doi: 10.1038/ng.2007.59. Epub 2007 Dec 16.
9
The BioBank Japan Project.日本生物银行项目。
Clin Adv Hematol Oncol. 2007 Sep;5(9):696-7.
10
A new multipoint method for genome-wide association studies by imputation of genotypes.一种通过基因型插补进行全基因组关联研究的新的多点方法。
Nat Genet. 2007 Jul;39(7):906-13. doi: 10.1038/ng2088. Epub 2007 Jun 17.