Suppr超能文献

YAHA:快速灵活的长读比对,具有最佳断点检测功能。

YAHA: fast and flexible long-read alignment with optimal breakpoint detection.

机构信息

Department of Computer Science, University of Virginia, Charlottesville, VA 22908, USA.

出版信息

Bioinformatics. 2012 Oct 1;28(19):2417-24. doi: 10.1093/bioinformatics/bts456. Epub 2012 Jul 24.

Abstract

MOTIVATION

With improved short-read assembly algorithms and the recent development of long-read sequencers, split mapping will soon be the preferred method for structural variant (SV) detection. Yet, current alignment tools are not well suited for this.

RESULTS

We present YAHA, a fast and flexible hash-based aligner. YAHA is as fast and accurate as BWA-SW at finding the single best alignment per query and is dramatically faster and more sensitive than both SSAHA2 and MegaBLAST at finding all possible alignments. Unlike other aligners that report all, or one, alignment per query, or that use simple heuristics to select alignments, YAHA uses a directed acyclic graph to find the optimal set of alignments that cover a query using a biologically relevant breakpoint penalty. YAHA can also report multiple mappings per defined segment of the query. We show that YAHA detects more breakpoints in less time than BWA-SW across all SV classes, and especially excels at complex SVs comprising multiple breakpoints.

AVAILABILITY

YAHA is currently supported on 64-bit Linux systems. Binaries and sample data are freely available for download from http://faculty.virginia.edu/irahall/YAHA.

CONTACT

imh4y@virginia.edu.

摘要

动机

随着短读长序列组装算法的改进和长读测序仪的最新发展,拆分映射将很快成为结构变异 (SV) 检测的首选方法。然而,当前的比对工具对此并不完全适用。

结果

我们提出了 YAHA,一种快速灵活的基于哈希的比对工具。YAHA 在寻找每个查询的最佳单一对齐方面与 BWA-SW 一样快速和准确,并且在寻找所有可能对齐方面比 SSAHA2 和 MegaBLAST 都快得多,也更敏感。与其他报告每个查询的所有或一个对齐,或使用简单启发式选择对齐的比对工具不同,YAHA 使用有向无环图来找到使用生物学相关断点惩罚覆盖查询的最佳对齐集。YAHA 还可以为查询的定义段报告多个映射。我们表明,在所有 SV 类别中,YAHA 比 BWA-SW 更快地检测到更多的断点,特别是在包含多个断点的复杂 SV 方面表现出色。

可用性

YAHA 目前支持 64 位 Linux 系统。二进制文件和示例数据可从 http://faculty.virginia.edu/irahall/YAHA 免费下载。

联系方式

imh4y@virginia.edu

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/daad/3463118/82c5a0cff7d5/bts456f1p.jpg

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验