Suppr超能文献

YAHA:快速灵活的长读比对,具有最佳断点检测功能。

YAHA: fast and flexible long-read alignment with optimal breakpoint detection.

机构信息

Department of Computer Science, University of Virginia, Charlottesville, VA 22908, USA.

出版信息

Bioinformatics. 2012 Oct 1;28(19):2417-24. doi: 10.1093/bioinformatics/bts456. Epub 2012 Jul 24.

Abstract

MOTIVATION

With improved short-read assembly algorithms and the recent development of long-read sequencers, split mapping will soon be the preferred method for structural variant (SV) detection. Yet, current alignment tools are not well suited for this.

RESULTS

We present YAHA, a fast and flexible hash-based aligner. YAHA is as fast and accurate as BWA-SW at finding the single best alignment per query and is dramatically faster and more sensitive than both SSAHA2 and MegaBLAST at finding all possible alignments. Unlike other aligners that report all, or one, alignment per query, or that use simple heuristics to select alignments, YAHA uses a directed acyclic graph to find the optimal set of alignments that cover a query using a biologically relevant breakpoint penalty. YAHA can also report multiple mappings per defined segment of the query. We show that YAHA detects more breakpoints in less time than BWA-SW across all SV classes, and especially excels at complex SVs comprising multiple breakpoints.

AVAILABILITY

YAHA is currently supported on 64-bit Linux systems. Binaries and sample data are freely available for download from http://faculty.virginia.edu/irahall/YAHA.

CONTACT

imh4y@virginia.edu.

摘要

动机

随着短读长序列组装算法的改进和长读测序仪的最新发展,拆分映射将很快成为结构变异 (SV) 检测的首选方法。然而,当前的比对工具对此并不完全适用。

结果

我们提出了 YAHA,一种快速灵活的基于哈希的比对工具。YAHA 在寻找每个查询的最佳单一对齐方面与 BWA-SW 一样快速和准确,并且在寻找所有可能对齐方面比 SSAHA2 和 MegaBLAST 都快得多,也更敏感。与其他报告每个查询的所有或一个对齐,或使用简单启发式选择对齐的比对工具不同,YAHA 使用有向无环图来找到使用生物学相关断点惩罚覆盖查询的最佳对齐集。YAHA 还可以为查询的定义段报告多个映射。我们表明,在所有 SV 类别中,YAHA 比 BWA-SW 更快地检测到更多的断点,特别是在包含多个断点的复杂 SV 方面表现出色。

可用性

YAHA 目前支持 64 位 Linux 系统。二进制文件和示例数据可从 http://faculty.virginia.edu/irahall/YAHA 免费下载。

联系方式

imh4y@virginia.edu

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/daad/3463118/82c5a0cff7d5/bts456f1p.jpg

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验