Suppr
超能文献

PASS：一个用于比对短序列的程序。

PASS: a program to align short sequences.

作者信息

Campagna Davide, Albiero Alessandro, Bilardi Alessandra, Caniato Elisa, Forcato Claudio, Manavski Svetlin, Vitulo Nicola, Valle Giorgio

机构信息

CRIBI Biotechnology Centre, University of Padua, Padova, Italy.

出版信息

Bioinformatics. 2009 Apr 1;25(7):967-8. doi: 10.1093/bioinformatics/btp087. Epub 2009 Feb 13.

DOI:10.1093/bioinformatics/btp087

PMID:19218350

Abstract

SUMMARY

Standard DNA alignment programs are inadequate to manage the data produced by new generation DNA sequencers. To answer this problem, we developed PASS with the objective of improving execution time and sensitivity when compared with other available programs. PASS performs fast gapped and ungapped alignments of short DNA sequences onto a reference DNA, typically a genomic sequence. It is designed to handle a huge amount of reads such as those generated by Solexa, SOLiD or 454 technologies. The algorithm is based on a data structure that holds in RAM the index of the genomic positions of 'seed' words (typically 11 and 12 bases) as well as an index of the precomputed scores of short words (typically seven and eight bases) aligned against each other. After building the genomic index, the program scans every query sequence performing three steps: (1) it finds matching seed words in the genome; (2) for every match checks the precomputed alignment of the short flanking regions; (3) if passes step 2, then it performs an exact dynamic alignment of a narrow region around the match. The performance of the program is very striking both for sensitivity and speed. For instance, gap alignment is achieved hundreds of times faster than BLAST and several times faster than SOAP, especially when gaps are allowed. Furthermore, PASS has a higher sensitivity when compared with the other available programs.

AVAILABILITY AND IMPLEMENTATION

Source code and binaries are freely available for download at http://pass.cribi.unipd.it, implemented in C++and supported on Linux and Windows.

摘要

标准的DNA比对程序不足以处理新一代DNA测序仪产生的数据。为了解决这个问题，我们开发了PASS，目的是与其他现有程序相比，提高执行时间和灵敏度。PASS能将短DNA序列与参考DNA（通常是基因组序列）进行快速的带空位和不带空位的比对。它旨在处理大量的读段，比如由Solexa、SOLiD或454技术产生的读段。该算法基于一种数据结构，这种数据结构在随机存取存储器（RAM）中保存“种子”词（通常为11和12个碱基）的基因组位置索引，以及相互比对的短词（通常为7和8个碱基）的预计算得分索引。构建基因组索引后，程序扫描每个查询序列，执行三个步骤：（1）在基因组中找到匹配的种子词；（2）对于每个匹配项，检查侧翼短区域的预计算比对；（3）如果通过步骤2，那么它会对匹配周围的一个狭窄区域进行精确的动态比对。该程序在灵敏度和速度方面的表现都非常出色。例如，带空位比对的速度比BLAST快数百倍，比SOAP快几倍，尤其是在允许有空位的情况下。此外，与其他现有程序相比，PASS具有更高的灵敏度。

可用性和实现方式

源代码和二进制文件可在http://pass.cribi.unipd.it免费下载，用C++实现，支持Linux和Windows系统。

相似文献

PASS: a program to align short sequences.

Bioinformatics. 2009 Apr 1;25(7):967-8. doi: 10.1093/bioinformatics/btp087. Epub 2009 Feb 13.

GASSST: global alignment short sequence search tool.

Bioinformatics. 2010 Oct 15;26(20):2534-40. doi: 10.1093/bioinformatics/btq485. Epub 2010 Aug 24.

Exact and complete short-read alignment to microbial genomes using Graphics Processing Unit programming.

Bioinformatics. 2011 May 15;27(10):1351-8. doi: 10.1093/bioinformatics/btr151. Epub 2011 Mar 30.

PASS-bis: a bisulfite aligner suitable for whole methylome analysis of Illumina and SOLiD reads.

Bioinformatics. 2013 Jan 15;29(2):268-70. doi: 10.1093/bioinformatics/bts675. Epub 2012 Nov 17.

ProbeMatch: rapid alignment of oligonucleotides to genome allowing both gaps and mismatches.

Bioinformatics. 2009 Jun 1;25(11):1424-5. doi: 10.1093/bioinformatics/btp178. Epub 2009 Apr 7.

SOAP: short oligonucleotide alignment program.

Bioinformatics. 2008 Mar 1;24(5):713-4. doi: 10.1093/bioinformatics/btn025. Epub 2008 Jan 28.

RAP: a new computer program for de novo identification of repeated sequences in whole genomes.

Bioinformatics. 2005 Mar 1;21(5):582-8. doi: 10.1093/bioinformatics/bti039. Epub 2004 Sep 16.

Fast and accurate short read alignment with Burrows-Wheeler transform.

Bioinformatics. 2009 Jul 15;25(14):1754-60. doi: 10.1093/bioinformatics/btp324. Epub 2009 May 18.

transAlign: using amino acids to facilitate the multiple alignment of protein-coding DNA sequences.

BMC Bioinformatics. 2005 Jun 22;6:156. doi: 10.1186/1471-2105-6-156.

Ψ-RA: a parallel sparse index for genomic read alignment.

BMC Genomics. 2011;12 Suppl 2(Suppl 2):S7. doi: 10.1186/1471-2164-12-S2-S7. Epub 2011 Jul 27.

引用本文的文献

mebipred: identifying metal-binding potential in protein sequence.

Bioinformatics. 2022 Jul 11;38(14):3532-3540. doi: 10.1093/bioinformatics/btac358.

Identification of Known and Novel L. MicroRNAs and Their Targets Using High-Throughput Sequencing and Degradome Analysis.

Life (Basel). 2022 Apr 27;12(5):651. doi: 10.3390/life12050651.

Technology dictates algorithms: recent developments in read alignment.

Genome Biol. 2021 Aug 26;22(1):249. doi: 10.1186/s13059-021-02443-7.

Drought stress modulates cuticular wax composition of the grape berry.

J Exp Bot. 2020 May 30;71(10):3126-3141. doi: 10.1093/jxb/eraa046.

High-throughput sequencing of the chloroplast and mitochondrion of Chlamydomonas reinhardtii to generate improved de novo assemblies, analyze expression patterns and transcript speciation, and evaluate diversity among laboratory strains and wild isolates.

Plant J. 2018 Feb;93(3):545-565. doi: 10.1111/tpj.13788. Epub 2018 Jan 7.

Brain RNA-Seq Profiling of the Mucopolysaccharidosis Type II Mouse Model.

Int J Mol Sci. 2017 May 17;18(5):1072. doi: 10.3390/ijms18051072.

Short Read Mapping: An Algorithmic Tour.

Proc IEEE Inst Electr Electron Eng. 2017 Mar;105(3):436-458. doi: 10.1109/JPROC.2015.2455551. Epub 2015 Sep 7.

Transcriptional Characterization of a Widely-Used Grapevine Rootstock Genotype under Different Iron-Limited Conditions.

Front Plant Sci. 2017 Jan 5;7:1994. doi: 10.3389/fpls.2016.01994. eCollection 2016.

From next-generation resequencing reads to a high-quality variant data set.

Heredity (Edinb). 2017 Feb;118(2):111-124. doi: 10.1038/hdy.2016.102. Epub 2016 Oct 19.

Direct 16S rRNA-seq from bacterial communities: a PCR-independent approach to simultaneously assess microbial diversity and functional activity potential of each taxon.

Sci Rep. 2016 Aug 31;6:32165. doi: 10.1038/srep32165.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

Suppr超能文献

PASS：一个用于比对短序列的程序。

PASS: a program to align short sequences.

作者信息

机构信息

出版信息

SUMMARY

AVAILABILITY AND IMPLEMENTATION

摘要

可用性和实现方式

相似文献

引用本文的文献

文献AI研究员

用中文搜PubMed

文档翻译