• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

评估基于图的读映射器相对于基线方法,可以突出当前方法的优缺点。

Assessing graph-based read mappers against a baseline approach highlights strengths and weaknesses of current methods.

机构信息

Department of informatics, University of Oslo, Gaustadalleen 23 B, Oslo, 0371, Norway.

Department of Mathematics, University of Oslo, Moltke Moes vei 35, Oslo, 0851, Norway.

出版信息

BMC Genomics. 2020 Apr 6;21(1):282. doi: 10.1186/s12864-020-6685-y.

DOI:10.1186/s12864-020-6685-y
PMID:32252628
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7132971/
Abstract

BACKGROUND

Graph-based reference genomes have become popular as they allow read mapping and follow-up analyses in settings where the exact haplotypes underlying a high-throughput sequencing experiment are not precisely known. Two recent papers show that mapping to graph-based reference genomes can improve accuracy as compared to methods using linear references. Both of these methods index the sequences for most paths up to a certain length in the graph in order to enable direct mapping of reads containing common variants. However, the combinatorial explosion of possible paths through nearby variants also leads to a huge search space and an increased chance of false positive alignments to highly variable regions.

RESULTS

We here assess three prominent graph-based read mappers against a hybrid baseline approach that combines an initial path determination with a tuned linear read mapping method. We show, using a previously proposed benchmark, that this simple approach is able to improve overall accuracy of read-mapping to graph-based reference genomes.

CONCLUSIONS

Our method is implemented in a tool Two-step Graph Mapper, which is available at https://github.com/uio-bmi/two_step_graph_mapperalong with data and scripts for reproducing the experiments. Our method highlights characteristics of the current generation of graph-based read mappers and shows potential for improvement for future graph-based read mappers.

摘要

背景

基于图的参考基因组已经变得流行,因为它们允许在无法准确了解高通量测序实验背后的确切单倍型的情况下进行读映射和后续分析。最近的两篇论文表明,与使用线性参考的方法相比,映射到基于图的参考基因组可以提高准确性。这两种方法都对图中特定长度内的大多数路径的序列进行索引,以便能够直接映射包含常见变体的读段。然而,通过附近变体的可能路径的组合爆炸也导致了巨大的搜索空间和高变异性区域的假阳性比对的机会增加。

结果

我们在这里评估了三种著名的基于图的读映射器与混合基线方法的对比,该混合基线方法结合了初始路径确定和经过调整的线性读映射方法。我们使用之前提出的基准测试表明,这种简单的方法能够提高基于图的参考基因组的读映射整体准确性。

结论

我们的方法在 Two-step Graph Mapper 工具中实现,该工具可在 https://github.com/uio-bmi/two_step_graph_mapper 上获得,同时还提供用于重现实验的数据和脚本。我们的方法突出了当前一代基于图的读映射器的特点,并展示了未来基于图的读映射器改进的潜力。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/80b7/7132971/09583fe081fa/12864_2020_6685_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/80b7/7132971/79f4429295e8/12864_2020_6685_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/80b7/7132971/43e345d360a5/12864_2020_6685_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/80b7/7132971/293cf2e1b876/12864_2020_6685_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/80b7/7132971/a6284df04f47/12864_2020_6685_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/80b7/7132971/e60832fbfa40/12864_2020_6685_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/80b7/7132971/e840e2435245/12864_2020_6685_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/80b7/7132971/09583fe081fa/12864_2020_6685_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/80b7/7132971/79f4429295e8/12864_2020_6685_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/80b7/7132971/43e345d360a5/12864_2020_6685_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/80b7/7132971/293cf2e1b876/12864_2020_6685_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/80b7/7132971/a6284df04f47/12864_2020_6685_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/80b7/7132971/e60832fbfa40/12864_2020_6685_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/80b7/7132971/e840e2435245/12864_2020_6685_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/80b7/7132971/09583fe081fa/12864_2020_6685_Fig7_HTML.jpg

相似文献

1
Assessing graph-based read mappers against a baseline approach highlights strengths and weaknesses of current methods.评估基于图的读映射器相对于基线方法,可以突出当前方法的优缺点。
BMC Genomics. 2020 Apr 6;21(1):282. doi: 10.1186/s12864-020-6685-y.
2
Assessing the impact of exact reads on reducing the error rate of read mapping.评估精确读取对降低读取映射错误率的影响。
BMC Bioinformatics. 2018 Nov 6;19(1):406. doi: 10.1186/s12859-018-2432-7.
3
Fast and sensitive mapping of nanopore sequencing reads with GraphMap.使用GraphMap对纳米孔测序读数进行快速灵敏的映射
Nat Commun. 2016 Apr 15;7:11307. doi: 10.1038/ncomms11307.
4
Improving read mapping using additional prefix grams.利用附加前缀词提高读段匹配度。
BMC Bioinformatics. 2014 Feb 5;15:42. doi: 10.1186/1471-2105-15-42.
5
Comparison of mapping algorithms used in high-throughput sequencing: application to Ion Torrent data.高通量测序中使用的映射算法比较:应用于Ion Torrent数据
BMC Genomics. 2014 Apr 5;15:264. doi: 10.1186/1471-2164-15-264.
6
Fast and SNP-aware short read alignment with SALT.基于 SALT 的快速 SNP 感知短读序列比对。
BMC Bioinformatics. 2021 Aug 25;22(Suppl 9):172. doi: 10.1186/s12859-021-04088-6.
7
NovoGraph: Human genome graph construction from multiple long-read assemblies.NovoGraph:基于多个长读长组装构建人类基因组图谱。
F1000Res. 2018 Sep 3;7:1391. doi: 10.12688/f1000research.15895.2. eCollection 2018.
8
Vulcan: Improved long-read mapping and structural variant calling via dual-mode alignment.火神:通过双模比对提高长读段比对和结构变异 calling。
Gigascience. 2021 Sep 24;10(9). doi: 10.1093/gigascience/giab063.
9
Systematic benchmark of ancient DNA read mapping.系统评估古 DNA 读段映射。
Brief Bioinform. 2021 Sep 2;22(5). doi: 10.1093/bib/bbab076.
10
Efficient short read mapping to a pangenome that is represented by a graph of ED strings.高效的短读映射到由 ED 字符串图表示的泛基因组。
Bioinformatics. 2023 May 4;39(5). doi: 10.1093/bioinformatics/btad320.

引用本文的文献

1
Gastric cancer genomics study using reference human pangenomes.使用参考人类泛基因组的胃癌基因组学研究。
Life Sci Alliance. 2025 Jan 27;8(4). doi: 10.26508/lsa.202402977. Print 2025 Apr.
2
Applications of advanced technologies for detecting genomic structural variation.先进技术在检测基因组结构变异中的应用。
Mutat Res Rev Mutat Res. 2023 Jul-Dec;792:108475. doi: 10.1016/j.mrrev.2023.108475. Epub 2023 Nov 4.
3
Inherited and De Novo Variation in Lithuanian Genomes: Introduction to the Analysis of the Generational Shift.

本文引用的文献

1
CHOP: haplotype-aware path indexing in population graphs.CHOP:群体图中的单倍型感知路径索引。
Genome Biol. 2020 Mar 11;21(1):65. doi: 10.1186/s13059-020-01963-y.
2
Genotyping structural variants in pangenome graphs using the vg toolkit.使用vg工具包对泛基因组图谱中的结构变异进行基因分型。
Genome Biol. 2020 Feb 12;21(1):35. doi: 10.1186/s13059-020-1941-7.
3
Pan-genomics in the human genome era.人类基因组时代的泛基因组学。
立陶宛基因组中的遗传和新生变异:代际转变分析简介。
Genes (Basel). 2022 Mar 23;13(4):569. doi: 10.3390/genes13040569.
4
Methods and Developments in Graphical Pangenomics.图形泛基因组学的方法与进展
J Indian Inst Sci. 2021;101(3):485-498. doi: 10.1007/s41745-021-00255-z. Epub 2021 Aug 24.
5
How the pan-genome is changing crop genomics and improvement.泛基因组如何改变作物基因组学和改良。
Genome Biol. 2021 Jan 4;22(1):3. doi: 10.1186/s13059-020-02224-8.
6
Reference flow: reducing reference bias using multiple population genomes.参考文献流向:利用多个群体基因组减少参考文献偏差。
Genome Biol. 2021 Jan 4;22(1):8. doi: 10.1186/s13059-020-02229-3.
7
Bovine breed-specific augmented reference graphs facilitate accurate sequence read mapping and unbiased variant discovery.牛种特异性增强参考图谱有助于准确的序列读取映射和无偏的变异发现。
Genome Biol. 2020 Jul 27;21(1):184. doi: 10.1186/s13059-020-02105-0.
8
Pan-genomics in the human genome era.人类基因组时代的泛基因组学。
Nat Rev Genet. 2020 Apr;21(4):243-254. doi: 10.1038/s41576-020-0210-7. Epub 2020 Feb 7.
Nat Rev Genet. 2020 Apr;21(4):243-254. doi: 10.1038/s41576-020-0210-7. Epub 2020 Feb 7.
4
Paragraph: a graph-based structural variant genotyper for short-read sequence data.段落:基于图的短读序列数据结构变异基因分型器。
Genome Biol. 2019 Dec 19;20(1):291. doi: 10.1186/s13059-019-1909-7.
5
GraphTyper2 enables population-scale genotyping of structural variation using pangenome graphs.GraphTyper2 可使用泛基因组图谱实现基于群体的结构变异基因分型。
Nat Commun. 2019 Nov 27;10(1):5402. doi: 10.1038/s41467-019-13341-9.
6
Is it time to change the reference genome?是否到了改变参考基因组的时候了?
Genome Biol. 2019 Aug 9;20(1):159. doi: 10.1186/s13059-019-1774-4.
7
Graph Peak Caller: Calling ChIP-seq peaks on graph-based reference genomes.图峰调用器:基于图的参考基因组上的 ChIP-seq 峰调用。
PLoS Comput Biol. 2019 Feb 19;15(2):e1006731. doi: 10.1371/journal.pcbi.1006731. eCollection 2019 Feb.
8
Fast and accurate genomic analyses using genome graphs.利用基因组图谱进行快速准确的基因组分析。
Nat Genet. 2019 Feb;51(2):354-362. doi: 10.1038/s41588-018-0316-4. Epub 2019 Jan 14.
9
FORGe: prioritizing variants for graph genomes.FORGe:对图基因组中的变体进行优先级排序。
Genome Biol. 2018 Dec 17;19(1):220. doi: 10.1186/s13059-018-1595-x.
10
Variation graph toolkit improves read mapping by representing genetic variation in the reference.变异图谱工具包通过表示参考中的遗传变异来提高读映射质量。
Nat Biotechnol. 2018 Oct;36(9):875-879. doi: 10.1038/nbt.4227. Epub 2018 Aug 20.