Suppr超能文献

快速准确的重测序读对齐。

Fast and accurate read alignment for resequencing.

机构信息

Department of Electrical Engineering, Stanford University, Stanford, CA 94305, USA.

出版信息

Bioinformatics. 2012 Sep 15;28(18):2366-73. doi: 10.1093/bioinformatics/bts450. Epub 2012 Jul 18.

Abstract

MOTIVATION

Next-generation sequence analysis has become an important task both in laboratory and clinical settings. A key stage in the majority sequence analysis workflows, such as resequencing, is the alignment of genomic reads to a reference genome. The accurate alignment of reads with large indels is a computationally challenging task for researchers.

RESULTS

We introduce SeqAlto as a new algorithm for read alignment. For reads longer than or equal to 100 bp, SeqAlto is up to 10 × faster than existing algorithms, while retaining high accuracy and the ability to align reads with large (up to 50 bp) indels. This improvement in efficiency is particularly important in the analysis of future sequencing data where the number of reads approaches many billions. Furthermore, SeqAlto uses less than 8 GB of memory to align against the human genome. SeqAlto is benchmarked against several existing tools with both real and simulated data.

AVAILABILITY

Linux and Mac OS X binaries free for academic use are available at http://www.stanford.edu/group/wonglab/seqalto

CONTACT

whwong@stanford.edu.

摘要

动机

下一代测序分析在实验室和临床环境中都已成为一项重要任务。在大多数测序工作流程(如重测序)中,一个关键步骤是将基因组读取与参考基因组进行比对。对于研究人员来说,准确比对具有较大插入/缺失(indels)的读取是一项具有挑战性的计算任务。

结果

我们引入了 SeqAlto 作为一种新的读取对齐算法。对于长度等于或大于 100 bp 的读取,SeqAlto 的速度比现有算法快 10 倍,同时保持了高精度和对齐具有较大(高达 50 bp)插入/缺失的读取的能力。这种效率的提高在未来测序数据分析中尤为重要,因为读取数量接近数十亿。此外,SeqAlto 在对齐人类基因组时使用的内存少于 8GB。我们使用真实数据和模拟数据对 SeqAlto 进行了基准测试,并与几个现有工具进行了比较。

可用性

可在 http://www.stanford.edu/group/wonglab/seqalto 上免费获取适用于学术用途的 Linux 和 Mac OS X 二进制文件。

联系方式

whwong@stanford.edu

相似文献

1
Fast and accurate read alignment for resequencing.快速准确的重测序读对齐。
Bioinformatics. 2012 Sep 15;28(18):2366-73. doi: 10.1093/bioinformatics/bts450. Epub 2012 Jul 18.
2
Fast and accurate short read alignment with Burrows-Wheeler transform.使用Burrows-Wheeler变换进行快速准确的短读比对。
Bioinformatics. 2009 Jul 15;25(14):1754-60. doi: 10.1093/bioinformatics/btp324. Epub 2009 May 18.
3
SRmapper: a fast and sensitive genome-hashing alignment tool.SRmapper:一种快速且灵敏的基因组哈希比对工具。
Bioinformatics. 2013 Feb 1;29(3):316-21. doi: 10.1093/bioinformatics/bts712. Epub 2012 Dec 24.
4
Comparative analysis of algorithms for next-generation sequencing read alignment.下一代测序读段比对算法的比较分析。
Bioinformatics. 2011 Oct 15;27(20):2790-6. doi: 10.1093/bioinformatics/btr477. Epub 2011 Aug 19.
5
Ψ-RA: a parallel sparse index for genomic read alignment.Ψ-RA:一种用于基因组读取比对的并行稀疏索引。
BMC Genomics. 2011;12 Suppl 2(Suppl 2):S7. doi: 10.1186/1471-2164-12-S2-S7. Epub 2011 Jul 27.
6
ARYANA: Aligning Reads by Yet Another Approach.ARYANA:另一种方法进行读段对齐。
BMC Bioinformatics. 2014;15 Suppl 9(Suppl 9):S12. doi: 10.1186/1471-2105-15-S9-S12. Epub 2014 Sep 10.
9
Fast and accurate long-read alignment with Burrows-Wheeler transform.基于 Burrows-Wheeler 变换的快速准确长读比对。
Bioinformatics. 2010 Mar 1;26(5):589-95. doi: 10.1093/bioinformatics/btp698. Epub 2010 Jan 15.

引用本文的文献

本文引用的文献

1
Fast gapped-read alignment with Bowtie 2.快速缺口读对准与 Bowtie 2。
Nat Methods. 2012 Mar 4;9(4):357-9. doi: 10.1038/nmeth.1923.
2
ART: a next-generation sequencing read simulator.ART:一种新一代测序读模拟程序。
Bioinformatics. 2012 Feb 15;28(4):593-4. doi: 10.1093/bioinformatics/btr708. Epub 2011 Dec 23.
5
SHRiMP2: sensitive yet practical SHort Read Mapping.SHRiMP2:敏感而实用的短读序列比对。
Bioinformatics. 2011 Apr 1;27(7):1011-2. doi: 10.1093/bioinformatics/btr046. Epub 2011 Jan 28.
8
GASSST: global alignment short sequence search tool.GASSST:全局比对短序列搜索工具。
Bioinformatics. 2010 Oct 15;26(20):2534-40. doi: 10.1093/bioinformatics/btq485. Epub 2010 Aug 24.
9
Fast and accurate long-read alignment with Burrows-Wheeler transform.基于 Burrows-Wheeler 变换的快速准确长读比对。
Bioinformatics. 2010 Mar 1;26(5):589-95. doi: 10.1093/bioinformatics/btp698. Epub 2010 Jan 15.

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验