基于随机排列的快速读取对齐方法。

A random-permutations-based approach to fast read alignment.

机构信息

Applied Mathematics Program, Yale University, 51 Prospect St., New Haven, CT 06511, USA.

出版信息

BMC Bioinformatics. 2013;14 Suppl 5(Suppl 5):S8. doi: 10.1186/1471-2105-14-S5-S8. Epub 2013 Apr 10.

DOI:10.1186/1471-2105-14-S5-S8

PMID:23734846

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3622637/

Abstract

BACKGROUND

Read alignment is a computational bottleneck in some sequencing projects. Most of the existing software packages for read alignment are based on two algorithmic approaches: prefix-trees and hash-tables. We propose a new approach to read alignment using random permutations of strings.

RESULTS

We present a prototype implementation and experiments performed with simulated and real reads of human DNA. Our experiments indicate that this permutations-based prototype is several times faster than comparable programs for fast read alignment and that it aligns more reads correctly.

CONCLUSIONS

This approach may lead to improved speed, sensitivity, and accuracy in read alignment. The algorithm can also be used for specialized alignment applications and it can be extended to other related problems, such as assembly.More information: http://alignment.commons.yale.edu.

摘要

背景

在某些测序项目中，读取比对是一个计算瓶颈。大多数现有的读取比对软件包都基于两种算法方法：前缀树和哈希表。我们提出了一种使用字符串随机排列进行读取比对的新方法。

结果

我们提出了一个原型实现，并使用人类 DNA 的模拟和真实读取进行了实验。我们的实验表明，这种基于排列的原型比用于快速读取比对的可比程序快几倍，并且它可以正确对齐更多的读取。

结论

这种方法可能会提高读取比对的速度、灵敏度和准确性。该算法还可用于专门的对齐应用程序，并且可以扩展到其他相关问题，例如组装。更多信息：http://alignment.commons.yale.edu。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c6bf/3622637/8eb2d45d80f9/1471-2105-14-S5-S8-1.jpg

相似文献

A random-permutations-based approach to fast read alignment.基于随机排列的快速读取对齐方法。

BMC Bioinformatics. 2013;14 Suppl 5(Suppl 5):S8. doi: 10.1186/1471-2105-14-S5-S8. Epub 2013 Apr 10.

Fast and accurate read alignment for resequencing.快速准确的重测序读对齐。

Bioinformatics. 2012 Sep 15;28(18):2366-73. doi: 10.1093/bioinformatics/bts450. Epub 2012 Jul 18.

ARYANA: Aligning Reads by Yet Another Approach.ARYANA：另一种方法进行读段对齐。

BMC Bioinformatics. 2014;15 Suppl 9(Suppl 9):S12. doi: 10.1186/1471-2105-15-S9-S12. Epub 2014 Sep 10.

Fast and accurate short read alignment with Burrows-Wheeler transform.使用Burrows-Wheeler变换进行快速准确的短读比对。

Bioinformatics. 2009 Jul 15;25(14):1754-60. doi: 10.1093/bioinformatics/btp324. Epub 2009 May 18.

Comparative analysis of algorithms for next-generation sequencing read alignment.下一代测序读段比对算法的比较分析。

Bioinformatics. 2011 Oct 15;27(20):2790-6. doi: 10.1093/bioinformatics/btr477. Epub 2011 Aug 19.

A fast read alignment method based on seed-and-vote for next generation sequencing.一种基于种子与投票的用于下一代测序的快速读段比对方法。

BMC Bioinformatics. 2016 Dec 23;17(Suppl 17):466. doi: 10.1186/s12859-016-1329-6.

SRPRISM (Single Read Paired Read Indel Substitution Minimizer): an efficient aligner for assemblies with explicit guarantees.SRPRISM（单读配对读插入缺失替换最小化器）：具有明确保证的组装的高效对齐器。

Gigascience. 2020 Apr 1;9(4). doi: 10.1093/gigascience/giaa023.

Improving read mapping using additional prefix grams.利用附加前缀词提高读段匹配度。

BMC Bioinformatics. 2014 Feb 5;15:42. doi: 10.1186/1471-2105-15-42.

Mapping RNA-seq reads to transcriptomes efficiently based on learning to hash method.基于学习哈希方法的高效 RNA-seq reads 转录组映射。

Comput Biol Med. 2020 Jan;116:103539. doi: 10.1016/j.compbiomed.2019.103539. Epub 2019 Nov 13.

A consistency-based consensus algorithm for de novo and reference-guided sequence assembly of short reads.一种基于一致性的从头和参考引导短读段序列组装的共识算法。

Bioinformatics. 2009 May 1;25(9):1118-24. doi: 10.1093/bioinformatics/btp131. Epub 2009 Mar 5.

引用本文的文献

A survey of mapping algorithms in the long-reads era.长读时代的图谱算法研究综述。

Genome Biol. 2023 Jun 1;24(1):133. doi: 10.1186/s13059-023-02972-3.

Entropy predicts sensitivity of pseudorandom seeds.熵预测伪随机种子的敏感性。

Genome Res. 2023 Jul;33(7):1162-1174. doi: 10.1101/gr.277645.123. Epub 2023 May 22.

BLEND: a fast, memory-efficient and accurate mechanism to find fuzzy seed matches in genome analysis.BLEND：一种在基因组分析中快速、节省内存且准确地查找模糊种子匹配项的机制。

NAR Genom Bioinform. 2023 Jan 20;5(1):lqad004. doi: 10.1093/nargab/lqad004. eCollection 2023 Mar.

Strobealign: flexible seed size enables ultra-fast and accurate read alignment.Strobealign：灵活的种子大小可实现超快速和准确的读取对齐。

Genome Biol. 2022 Dec 15;23(1):260. doi: 10.1186/s13059-022-02831-7.

Effective sequence similarity detection with strobemers.利用频闪体进行有效的序列相似性检测。

Genome Res. 2021 Nov;31(11):2080-2094. doi: 10.1101/gr.275648.121. Epub 2021 Oct 19.

Ritornello: high fidelity control-free chromatin immunoprecipitation peak calling.利托内洛：高保真无对照染色质免疫沉淀峰检测

Nucleic Acids Res. 2017 Dec 1;45(21):e173. doi: 10.1093/nar/gkx799.

本文引用的文献

Tools for mapping high-throughput sequencing data.高通量测序数据映射工具。

Bioinformatics. 2012 Dec 15;28(24):3169-77. doi: 10.1093/bioinformatics/bts605. Epub 2012 Oct 11.

Fast gapped-read alignment with Bowtie 2.快速缺口读对准与 Bowtie 2。

Nat Methods. 2012 Mar 4;9(4):357-9. doi: 10.1038/nmeth.1923.

Hobbes: optimized gram-based methods for efficient read alignment.霍布斯：基于优化格的方法，用于高效的读对齐。

Nucleic Acids Res. 2012 Mar;40(6):e41. doi: 10.1093/nar/gkr1246. Epub 2011 Dec 22.

Randomized approximate nearest neighbors algorithm.随机近邻算法。

Proc Natl Acad Sci U S A. 2011 Sep 20;108(38):15679-86. doi: 10.1073/pnas.1107769108. Epub 2011 Sep 1.

A map of human genome variation from population-scale sequencing.人类基因组变异的图谱来自于基于人群的测序。

Nature. 2010 Oct 28;467(7319):1061-73. doi: 10.1038/nature09534.

mrsFAST: a cache-oblivious algorithm for short-read mapping.mrsFAST：一种用于短读段映射的缓存无关算法。

Nat Methods. 2010 Aug;7(8):576-7. doi: 10.1038/nmeth0810-576.

A survey of sequence alignment algorithms for next-generation sequencing.下一代测序序列比对算法综述。

Brief Bioinform. 2010 Sep;11(5):473-83. doi: 10.1093/bib/bbq015. Epub 2010 May 11.

Sense from sequence reads: methods for alignment and assembly.从序列读取中获取意义：比对和组装方法

Nat Methods. 2009 Nov;6(11 Suppl):S6-S12. doi: 10.1038/nmeth.1376.

Personalized copy number and segmental duplication maps using next-generation sequencing.使用下一代测序技术构建个性化拷贝数和片段重复图谱。

Nat Genet. 2009 Oct;41(10):1061-7. doi: 10.1038/ng.437. Epub 2009 Aug 30.

RazerS--fast read mapping with sensitivity control.RazerS——具有灵敏度控制的快速读取映射。

Genome Res. 2009 Sep;19(9):1646-54. doi: 10.1101/gr.088823.108. Epub 2009 Jul 10.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

基于随机排列的快速读取对齐方法。

A random-permutations-based approach to fast read alignment.

机构信息

出版信息

BACKGROUND

RESULTS

CONCLUSIONS

背景

结果

结论

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献