Suppr超能文献

使用Crossbow在云端进行基因分型。

Genotyping in the cloud with Crossbow.

作者信息

Gurtowski James, Schatz Michael C, Langmead Ben

机构信息

Simons Center for Quantitative Biology, Cold Spring Harbor Laboratory, Cold Spring Harbor, New York.

Department of Computer Science, Johns Hopkins Bloomberg School of Public Health, Baltimore, Maryland.

出版信息

Curr Protoc Bioinformatics. 2012 Sep;Chapter 15:15.3.1-15.3.15. doi: 10.1002/0471250953.bi1503s39.

Abstract

Crossbow is a scalable, portable, and automatic cloud computing tool for identifying SNPs from high-coverage, short-read resequencing data. It is built on Apache Hadoop, an implementation of the MapReduce software framework. Hadoop allows Crossbow to distribute read alignment and SNP calling subtasks over a cluster of commodity computers. Two robust tools, Bowtie and SOAPsnp, implement the fundamental alignment and variant calling operations respectively, and have demonstrated capabilities within Crossbow of analyzing approximately one billion short reads per hour on a commodity Hadoop cluster with 320 cores. Through protocol examples, this unit will demonstrate the use of Crossbow for identifying variations in three different operating modes: on a Hadoop cluster, on a single computer, and on the Amazon Elastic MapReduce cloud computing service.

摘要

Crossbow是一款可扩展、便携且自动化的云计算工具,用于从高覆盖度、短读长重测序数据中识别单核苷酸多态性(SNP)。它基于Apache Hadoop构建,后者是MapReduce软件框架的一种实现。Hadoop使Crossbow能够在一组商用计算机上分布式执行读段比对和SNP检测子任务。两个强大的工具Bowtie和SOAPsnp分别实现基本的比对和变异检测操作,并且在一个拥有320个核心的商用Hadoop集群上,已证明它们在Crossbow中具备每小时分析约10亿条短读段的能力。通过协议示例,本单元将演示如何在三种不同操作模式下使用Crossbow识别变异:在Hadoop集群上、在单台计算机上以及在亚马逊弹性MapReduce云计算服务上。

相似文献

1
Genotyping in the cloud with Crossbow.使用Crossbow在云端进行基因分型。
Curr Protoc Bioinformatics. 2012 Sep;Chapter 15:15.3.1-15.3.15. doi: 10.1002/0471250953.bi1503s39.
2
Searching for SNPs with cloud computing.利用云计算搜索 SNP。
Genome Biol. 2009;10(11):R134. doi: 10.1186/gb-2009-10-11-r134. Epub 2009 Nov 20.
9
Long Read Alignment with Parallel MapReduce Cloud Platform.使用并行MapReduce云平台进行长读段比对
Biomed Res Int. 2015;2015:807407. doi: 10.1155/2015/807407. Epub 2015 Dec 29.
10
Cloud-BS: A MapReduce-based bisulfite sequencing aligner on cloud.Cloud-BS:一种基于MapReduce的云端亚硫酸氢盐测序比对器。
J Bioinform Comput Biol. 2018 Dec;16(6):1840028. doi: 10.1142/S0219720018400280. Epub 2018 Oct 30.

引用本文的文献

5
Survey of gene splicing algorithms based on reads.基于读取的基因剪接算法调查。
Bioengineered. 2017 Nov 2;8(6):750-758. doi: 10.1080/21655979.2017.1373538. Epub 2017 Sep 21.
6
Novel bioinformatic developments for exome sequencing.外显子组测序的新型生物信息学进展
Hum Genet. 2016 Jun;135(6):603-14. doi: 10.1007/s00439-016-1658-6. Epub 2016 Apr 13.
8
Parallel computing in genomic research: advances and applications.基因组研究中的并行计算:进展与应用
Adv Appl Bioinform Chem. 2015 Nov 13;8:23-35. doi: 10.2147/AABC.S64482. eCollection 2015.
10
Next generation distributed computing for cancer research.用于癌症研究的下一代分布式计算。
Cancer Inform. 2015 Apr 27;13(Suppl 7):97-109. doi: 10.4137/CIN.S16344. eCollection 2014.

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验