• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

路径图:一种基于路径的长噪声读取高灵敏度映射工具。

pathMap: a path-based mapping tool for long noisy reads with high sensitivity.

机构信息

School of Physics and Opto-Electronics Technology, Baoji University of Arts and Sciences, Baoji, 721016, China.

Division of Biomedical Engineering, Department of Computer Science and Department of Mechanical Engineering, University of Saskatchewan, Saskatoon, SK S7N 5A9, Canada.

出版信息

Brief Bioinform. 2024 Jan 22;25(2). doi: 10.1093/bib/bbae107.

DOI:10.1093/bib/bbae107
PMID:38517696
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10959152/
Abstract

With the rapid development of single-molecule sequencing (SMS) technologies, the output read length is continuously increasing. Mapping such reads onto a reference genome is one of the most fundamental tasks in sequence analysis. Mapping sensitivity is becoming a major concern since high sensitivity can detect more aligned regions on the reference and obtain more aligned bases, which are useful for downstream analysis. In this study, we present pathMap, a novel k-mer graph-based mapper that is specifically designed for mapping SMS reads with high sensitivity. By viewing the alignment chain as a path containing as many anchors as possible in the matched k-mer graph, pathMap treats chaining as a path selection problem in the directed graph. pathMap iteratively searches the longest path in the remaining nodes; more candidate chains with high quality can be effectively detected and aligned. Compared to other state-of-the-art mapping methods such as minimap2 and Winnowmap2, experiment results on simulated and real-life datasets demonstrate that pathMap obtains the number of mapped chains at least 11.50% more than its closest competitor and increases the mapping sensitivity by 17.28% and 13.84% of bases over the next-best mapper for Pacific Biosciences and Oxford Nanopore sequencing data, respectively. In addition, pathMap is more robust to sequence errors and more sensitive to species- and strain-specific identification of pathogens using MinION reads.

摘要

随着单分子测序 (SMS) 技术的快速发展,输出的读取长度不断增加。将这些读取映射到参考基因组上是序列分析中最基本的任务之一。由于高灵敏度可以检测到参考基因组上更多的对齐区域并获得更多的对齐碱基,这对于下游分析很有用,因此映射灵敏度成为一个主要关注点。在这项研究中,我们提出了 pathMap,这是一种专门为高灵敏度映射 SMS 读取而设计的新型基于 k-mer 图的映射器。通过将对齐链视为在匹配的 k-mer 图中包含尽可能多锚点的路径,pathMap 将链处理为有向图中的路径选择问题。pathMap 迭代地搜索剩余节点中的最长路径;可以有效地检测和对齐更多具有高质量的候选链。与 minimap2 和 Winnowmap2 等其他最先进的映射方法相比,在模拟和真实数据集上的实验结果表明,pathMap 获得的映射链数量至少比其最接近的竞争对手多 11.50%,并且分别将 Pacific Biosciences 和 Oxford Nanopore 测序数据的映射灵敏度提高了 17.28%和 13.84%。此外,pathMap 对序列错误更稳健,并且对 MinION 读取中病原体的种和菌株特异性识别更敏感。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f687/10959152/ab585acaa7d9/bbae107f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f687/10959152/548c96171955/bbae107f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f687/10959152/8c3c4482b2af/bbae107f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f687/10959152/55a284ecb4dd/bbae107f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f687/10959152/ab585acaa7d9/bbae107f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f687/10959152/548c96171955/bbae107f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f687/10959152/8c3c4482b2af/bbae107f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f687/10959152/55a284ecb4dd/bbae107f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f687/10959152/ab585acaa7d9/bbae107f4.jpg

相似文献

1
pathMap: a path-based mapping tool for long noisy reads with high sensitivity.路径图:一种基于路径的长噪声读取高灵敏度映射工具。
Brief Bioinform. 2024 Jan 22;25(2). doi: 10.1093/bib/bbae107.
2
kngMap: Sensitive and Fast Mapping Algorithm for Noisy Long Reads Based on the -Mer Neighborhood Graph.kngMap:基于-mer邻域图的针对噪声长读段的灵敏且快速的映射算法
Front Genet. 2022 May 5;13:890651. doi: 10.3389/fgene.2022.890651. eCollection 2022.
3
smsMap: mapping single molecule sequencing reads by locating the alignment starting positions.smsMap:通过定位比对起始位置来对单分子测序reads 进行映射。
BMC Bioinformatics. 2020 Aug 4;21(1):341. doi: 10.1186/s12859-020-03698-w.
4
Sensitive alignment using paralogous sequence variants improves long-read mapping and variant calling in segmental duplications.利用直系同源序列变异进行敏感比对可提高大片段重复区域的长读长序列比对和变异calling 效率。
Nucleic Acids Res. 2020 Nov 4;48(19):e114. doi: 10.1093/nar/gkaa829.
5
invMap: a sensitive mapping tool for long noisy reads with inversion structural variants.invMap:一种用于具有反转结构变体的长噪声读取的敏感映射工具。
Bioinformatics. 2023 Dec 1;39(12). doi: 10.1093/bioinformatics/btad726.
6
lordFAST: sensitive and Fast Alignment Search Tool for LOng noisy Read sequencing Data.lordFAST:用于长噪声测序数据的敏感快速比对搜索工具。
Bioinformatics. 2019 Jan 1;35(1):20-27. doi: 10.1093/bioinformatics/bty544.
7
A hybrid and scalable error correction algorithm for indel and substitution errors of long reads.一种用于长读段插入/缺失和替换错误的混合可扩展纠错算法。
BMC Genomics. 2019 Dec 20;20(Suppl 11):948. doi: 10.1186/s12864-019-6286-9.
8
Improving the sensitivity of long read overlap detection using grouped short k-mer matches.利用分组短 k-mer 匹配提高长读重叠检测的灵敏度。
BMC Genomics. 2019 Apr 4;20(Suppl 2):190. doi: 10.1186/s12864-019-5475-x.
9
S-conLSH: alignment-free gapped mapping of noisy long reads.S-conLSH:无比对的含噪长读段映射
BMC Bioinformatics. 2021 Feb 11;22(1):64. doi: 10.1186/s12859-020-03918-3.
10
Fast and sensitive mapping of nanopore sequencing reads with GraphMap.使用GraphMap对纳米孔测序读数进行快速灵敏的映射
Nat Commun. 2016 Apr 15;7:11307. doi: 10.1038/ncomms11307.

本文引用的文献

1
A survey of mapping algorithms in the long-reads era.长读时代的图谱算法研究综述。
Genome Biol. 2023 Jun 1;24(1):133. doi: 10.1186/s13059-023-02972-3.
2
A Two-Branch Neural Network for Short-Axis PET Image Quality Enhancement.用于短轴 PET 图像质量增强的双分支神经网络。
IEEE J Biomed Health Inform. 2023 Jun;27(6):2864-2875. doi: 10.1109/JBHI.2023.3260180. Epub 2023 Jun 5.
3
Comparison of Methods for Biological Sequence Clustering.生物序列聚类方法的比较
IEEE/ACM Trans Comput Biol Bioinform. 2023 Sep-Oct;20(5):2874-2888. doi: 10.1109/TCBB.2023.3253138. Epub 2023 Oct 9.
4
kngMap: Sensitive and Fast Mapping Algorithm for Noisy Long Reads Based on the -Mer Neighborhood Graph.kngMap:基于-mer邻域图的针对噪声长读段的灵敏且快速的映射算法
Front Genet. 2022 May 5;13:890651. doi: 10.3389/fgene.2022.890651. eCollection 2022.
5
Long-read mapping to repetitive reference sequences using Winnowmap2.使用Winnowmap2将长读段映射到重复参考序列。
Nat Methods. 2022 Jun;19(6):705-710. doi: 10.1038/s41592-022-01457-8. Epub 2022 Apr 1.
6
A survey on the algorithm and development of multiple sequence alignment.多序列比对算法与发展研究综述。
Brief Bioinform. 2022 May 13;23(3). doi: 10.1093/bib/bbac069.
7
A novel fast multiple nucleotide sequence alignment method based on FM-index.基于 FM-index 的新型快速多核苷酸序列比对方法。
Brief Bioinform. 2022 Jan 17;23(1). doi: 10.1093/bib/bbab519.
8
Technology dictates algorithms: recent developments in read alignment.技术决定算法:读段比对的最新进展。
Genome Biol. 2021 Aug 26;22(1):249. doi: 10.1186/s13059-021-02443-7.
9
Genome Sequence Resource of from China Using Nanopore Sequencing Technology.中国利用纳米孔测序技术获得的 基因组序列资源。
Plant Dis. 2021 Dec;105(12):4141-4145. doi: 10.1094/PDIS-11-20-2327-A. Epub 2021 Dec 3.
10
Comparison of Methods for Picking the Operational Taxonomic Units From Amplicon Sequences.从扩增子序列中挑选操作分类单元的方法比较
Front Microbiol. 2021 Mar 24;12:644012. doi: 10.3389/fmicb.2021.644012. eCollection 2021.