使用HISAT-3N对核苷酸转换测序读数进行快速准确的比对。

Rapid and accurate alignment of nucleotide conversion sequencing reads with HISAT-3N.

作者信息

Zhang Yun, Park Chanhee, Bennett Christopher, Thornton Micah, Kim Daehwan

机构信息

Lyda Hill Department of Bioinformatics, University of Texas Southwestern Medical Center, Dallas, Texas 75390, USA.

出版信息

Genome Res. 2021 Jul;31(7):1290-1295. doi: 10.1101/gr.275193.120. Epub 2021 Jun 8.

Abstract

Sequencing technologies using nucleotide conversion techniques such as cytosine to thymine in bisulfite-seq and thymine to cytosine in SLAM seq are powerful tools to explore the chemical intricacies of cellular processes. To date, no one has developed a unified methodology for aligning converted sequences and consolidating alignment of these technologies in one package. In this paper, we describe hierarchical indexing for spliced alignment of transcripts-3 nucleotides (HISAT-3N), which can rapidly and accurately align sequences consisting of any nucleotide conversion by leveraging the powerful hierarchical index and repeat index algorithms originally developed for the HISAT software. Tests on real and simulated data sets show that HISAT-3N is faster than other modern systems, with greater alignment accuracy, higher scalability, and smaller memory requirements. HISAT-3N therefore becomes an ideal aligner when used with converted sequence technologies.

摘要

使用核苷酸转换技术的测序技术,如亚硫酸氢盐测序中胞嘧啶到胸腺嘧啶的转换以及SLAM测序中胸腺嘧啶到胞嘧啶的转换,是探索细胞过程化学复杂性的强大工具。迄今为止,还没有人开发出一种统一的方法来比对转换后的序列,并将这些技术的比对整合到一个软件包中。在本文中,我们描述了用于转录本-3核苷酸剪接比对的分层索引(HISAT-3N),它可以通过利用最初为HISAT软件开发的强大分层索引和重复索引算法,快速准确地比对由任何核苷酸转换组成的序列。对真实和模拟数据集的测试表明,HISAT-3N比其他现代系统更快,具有更高的比对准确性、更高的可扩展性和更小的内存需求。因此,当与转换序列技术一起使用时,HISAT-3N成为理想的比对器。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5758/8256862/ae1a6ffff78c/1290f01.jpg

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索