• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

FastRemap:一种快速在基因组组装之间重新映射读取的工具。

FastRemap: a tool for quickly remapping reads between genome assemblies.

机构信息

Department of Computer Engineering, ETH Zurich, D-ITET, Zurich 8006, Switzerland.

Department of Computer Engineering, Bionano Genomics, San Diego, CA 92121, USA.

出版信息

Bioinformatics. 2022 Sep 30;38(19):4633-4635. doi: 10.1093/bioinformatics/btac554.

DOI:10.1093/bioinformatics/btac554
PMID:35976109
Abstract

MOTIVATION

A genome read dataset can be quickly and efficiently remapped from one reference to another similar reference (e.g., between two reference versions or two similar species) using a variety of tools, e.g., the commonly used CrossMap tool. With the explosion of available genomic datasets and references, high-performance remapping tools will be even more important for keeping up with the computational demands of genome assembly and analysis.

RESULTS

We provide FastRemap, a fast and efficient tool for remapping reads between genome assemblies. FastRemap provides up to a 7.82× speedup (6.47×, on average) and uses as low as 61.7% (80.7%, on average) of the peak memory consumption compared to the state-of-the-art remapping tool, CrossMap.

AVAILABILITY AND IMPLEMENTATION

FastRemap is written in C++. Source code and user manual are freely available at: github.com/CMU-SAFARI/FastRemap. Docker image available at: https://hub.docker.com/r/alkanlab/fastremap. Also available in Bioconda at: https://anaconda.org/bioconda/fastremap-bio.

摘要

动机

使用各种工具,例如常用的 CrossMap 工具,可以快速有效地将基因组读取数据集从一个参考基因组重新映射到另一个类似的参考基因组(例如,在两个参考版本或两个相似物种之间)。随着可用基因组数据集和参考基因组的爆炸式增长,高性能的重新映射工具对于满足基因组组装和分析的计算需求将变得更加重要。

结果

我们提供了 FastRemap,这是一种用于在基因组组装之间重新映射读取的快速高效工具。FastRemap 提供了高达 7.82 倍的加速(平均 6.47 倍),与最先进的重新映射工具 CrossMap 相比,仅使用 61.7%(平均 80.7%)的峰值内存消耗。

可用性和实现

FastRemap 是用 C++编写的。源代码和用户手册可在以下网址免费获取:github.com/CMU-SAFARI/FastRemap。Docker 映像可在以下网址获取:https://hub.docker.com/r/alkanlab/fastremap。也可在 Bioconda 上获取:https://anaconda.org/bioconda/fastremap-bio。

相似文献

1
FastRemap: a tool for quickly remapping reads between genome assemblies.FastRemap:一种快速在基因组组装之间重新映射读取的工具。
Bioinformatics. 2022 Sep 30;38(19):4633-4635. doi: 10.1093/bioinformatics/btac554.
2
CrossMap: a versatile tool for coordinate conversion between genome assemblies.CrossMap:一种用于基因组组装之间坐标转换的通用工具。
Bioinformatics. 2014 Apr 1;30(7):1006-7. doi: 10.1093/bioinformatics/btt730. Epub 2013 Dec 18.
3
SQUAT: a Sequencing Quality Assessment Tool for data quality assessments of genome assemblies.SQUAT:用于基因组组装数据质量评估的测序质量评估工具。
BMC Genomics. 2019 Apr 18;19(Suppl 9):238. doi: 10.1186/s12864-019-5445-3.
4
HiCLift: a fast and efficient tool for converting chromatin interaction data between genome assemblies.HiCLift:一种用于在基因组组装之间转换染色质相互作用数据的快速高效工具。
Bioinformatics. 2023 Jun 1;39(6). doi: 10.1093/bioinformatics/btad389.
5
Plassembler: an automated bacterial plasmid assembly tool.Plassembler:一种自动化的细菌质粒组装工具。
Bioinformatics. 2023 Jul 1;39(7). doi: 10.1093/bioinformatics/btad409.
6
Nubeam-dedup: a fast and RAM-efficient tool to de-duplicate sequencing reads without mapping.Nubeam-dedup:一款快速且节省内存的去重工具,无需进行测序读取映射。
Bioinformatics. 2020 May 1;36(10):3254-3256. doi: 10.1093/bioinformatics/btaa112.
7
Subset selection of high-depth next generation sequencing reads for de novo genome assembly using MapReduce framework.使用MapReduce框架进行从头基因组组装时对高深度下一代测序读数的子集选择。
BMC Genomics. 2015;16 Suppl 12(Suppl 12):S9. doi: 10.1186/1471-2164-16-S12-S9. Epub 2015 Dec 9.
8
NucBreak: location of structural errors in a genome assembly by using paired-end Illumina reads.NucBreak:利用 Illumina 配对末端读取来定位基因组组装中的结构错误。
BMC Bioinformatics. 2020 Feb 21;21(1):66. doi: 10.1186/s12859-020-3414-0.
9
WGA-LP: a pipeline for whole genome assembly of contaminated reads.WGA-LP:一种用于污染读段全基因组组装的流程。
Bioinformatics. 2022 Jan 12;38(3):846-848. doi: 10.1093/bioinformatics/btab719.
10
Ragout-a reference-assisted assembly tool for bacterial genomes.烩菜——一种用于细菌基因组的参考辅助组装工具。
Bioinformatics. 2014 Jun 15;30(12):i302-9. doi: 10.1093/bioinformatics/btu280.

引用本文的文献

1
BLEND: a fast, memory-efficient and accurate mechanism to find fuzzy seed matches in genome analysis.BLEND:一种在基因组分析中快速、节省内存且准确地查找模糊种子匹配项的机制。
NAR Genom Bioinform. 2023 Jan 20;5(1):lqad004. doi: 10.1093/nargab/lqad004. eCollection 2023 Mar.
2
From molecules to genomic variations: Accelerating genome analysis via intelligent algorithms and architectures.从分子到基因组变异:通过智能算法和架构加速基因组分析
Comput Struct Biotechnol J. 2022 Aug 18;20:4579-4599. doi: 10.1016/j.csbj.2022.08.019. eCollection 2022.