• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

WGT:用于识别、可视化和生成惠勒图的工具与算法。

WGT: Tools and algorithms for recognizing, visualizing, and generating Wheeler graphs.

作者信息

Chao Kuan-Hao, Chen Pei-Wei, Seshia Sanjit A, Langmead Ben

机构信息

Department of Computer Science, Johns Hopkins University, Baltimore, MD 21218, USA.

Department of Electrical Engineering and Computer Sciences, University of California, Berkeley, Berkeley, CA 94720, USA.

出版信息

iScience. 2023 Jul 14;26(8):107402. doi: 10.1016/j.isci.2023.107402. eCollection 2023 Aug 18.

DOI:10.1016/j.isci.2023.107402
PMID:37575187
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10415921/
Abstract

A Wheeler graph represents a collection of strings in a way that is particularly easy to index and query. Such a graph is a practical choice for representing a graph-shaped pangenome, and it is the foundation for current graph-based pangenome indexes. However, there are no practical tools to visualize or to check graphs that may have the Wheeler properties. Here, we present Wheelie, an algorithm that combines a with a permutation solver (Wheelie-PR) or a Satisfiability Modulo Theory (SMT) solver (Wheelie-SMT) to check whether a given graph has the Wheeler properties, a problem that is NP-complete in general. Wheelie can check a variety of random and real-world graphs in far less time than any algorithm proposed to date. It can check a graph with 1,000s of nodes in seconds. We implement these algorithms together with complementary visualization tools in the WGT toolkit, available as open source software at https://github.com/Kuanhao-Chao/Wheeler_Graph_Toolkit.

摘要

惠勒图以一种特别易于索引和查询的方式表示字符串集合。这样的图是表示图状泛基因组的实际选择,并且是当前基于图的泛基因组索引的基础。然而,目前没有实用的工具来可视化或检查可能具有惠勒属性的图。在这里,我们提出了Wheelie算法,它结合了一个[此处原文缺失内容]与一个排列求解器(Wheelie-PR)或一个可满足性模理论(SMT)求解器(Wheelie-SMT),以检查给定的图是否具有惠勒属性,该问题一般来说是NP完全问题。Wheelie能够在比迄今为止提出的任何算法都少得多的时间内检查各种随机和现实世界的图。它可以在几秒钟内检查一个具有数千个节点的图。我们将这些算法与互补的可视化工具一起实现在WGT工具包中,该工具包可作为开源软件在https://github.com/Kuanhao-Chao/Wheeler_Graph_Toolkit上获取。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8287/10415921/a8dfb1f2cf6d/gr7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8287/10415921/1640fe9c414c/fx1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8287/10415921/8ea067dbd6ff/gr1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8287/10415921/c4870cf3af1c/gr2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8287/10415921/d24633acd4df/gr3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8287/10415921/494bad97675b/gr4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8287/10415921/f6e42d2ecd30/gr5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8287/10415921/9f7ce263949d/gr6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8287/10415921/a8dfb1f2cf6d/gr7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8287/10415921/1640fe9c414c/fx1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8287/10415921/8ea067dbd6ff/gr1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8287/10415921/c4870cf3af1c/gr2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8287/10415921/d24633acd4df/gr3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8287/10415921/494bad97675b/gr4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8287/10415921/f6e42d2ecd30/gr5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8287/10415921/9f7ce263949d/gr6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8287/10415921/a8dfb1f2cf6d/gr7.jpg

相似文献

1
WGT: Tools and algorithms for recognizing, visualizing, and generating Wheeler graphs.WGT:用于识别、可视化和生成惠勒图的工具与算法。
iScience. 2023 Jul 14;26(8):107402. doi: 10.1016/j.isci.2023.107402. eCollection 2023 Aug 18.
2
Haplotype-aware graph indexes.单体型感知图索引。
Bioinformatics. 2020 Jan 15;36(2):400-407. doi: 10.1093/bioinformatics/btz575.
3
Unbiased pangenome graphs.无偏泛基因组图。
Bioinformatics. 2023 Jan 1;39(1). doi: 10.1093/bioinformatics/btac743.
4
ODGI: understanding pangenome graphs.ODGI:理解泛基因组图谱。
Bioinformatics. 2022 Jun 27;38(13):3319-3326. doi: 10.1093/bioinformatics/btac308.
5
Pangenome graph layout by Path-Guided Stochastic Gradient Descent.基于路径引导随机梯度下降的泛基因组图谱布局。
Bioinformatics. 2024 Jul 1;40(7). doi: 10.1093/bioinformatics/btae363.
6
Co-linear chaining on pangenome graphs.泛基因组图谱上的共线性连锁
Algorithms Mol Biol. 2024 Jan 27;19(1):4. doi: 10.1186/s13015-024-00250-w.
7
Efficient short read mapping to a pangenome that is represented by a graph of ED strings.高效的短读映射到由 ED 字符串图表示的泛基因组。
Bioinformatics. 2023 May 4;39(5). doi: 10.1093/bioinformatics/btad320.
8
Investigating Pangenome Graphs Using Wheat Panache.利用小麦全景图研究泛基因组图谱
Methods Mol Biol. 2023;2703:23-29. doi: 10.1007/978-1-0716-3389-2_2.
9
Comparing methods for constructing and representing human pangenome graphs.比较构建和表示人类泛基因组图的方法。
Genome Biol. 2023 Nov 30;24(1):274. doi: 10.1186/s13059-023-03098-2.
10
Pangenome graph layout by Path-Guided Stochastic Gradient Descent.基于路径引导随机梯度下降的泛基因组图布局
bioRxiv. 2023 Oct 17:2023.09.22.558964. doi: 10.1101/2023.09.22.558964.

引用本文的文献

1
GIN-TONIC: non-hierarchical full-text indexing for graph genomes.GIN-TONIC:用于图基因组的非分层全文索引
NAR Genom Bioinform. 2024 Dec 11;6(4):lqae159. doi: 10.1093/nargab/lqae159. eCollection 2024 Dec.

本文引用的文献

1
Pangenomics enables genotyping of known structural variants in 5202 diverse genomes.泛基因组学能够对 5202 个不同基因组中的已知结构变异进行基因分型。
Science. 2021 Dec 17;374(6574):abg8871. doi: 10.1126/science.abg8871.
2
Ensembl 2022.Ensembl 2022.
Nucleic Acids Res. 2022 Jan 7;50(D1):D988-D995. doi: 10.1093/nar/gkab1049.
3
Prefix-free parsing for building big BWTs.用于构建大型Burrows-Wheeler变换(BWT)的无前缀解析
Algorithms Mol Biol. 2019 May 24;14:13. doi: 10.1186/s13015-019-0148-5. eCollection 2019.
4
Variation graph toolkit improves read mapping by representing genetic variation in the reference.变异图谱工具包通过表示参考中的遗传变异来提高读映射质量。
Nat Biotechnol. 2018 Oct;36(9):875-879. doi: 10.1038/nbt.4227. Epub 2018 Aug 20.
5
Wheeler graphs: A framework for BWT-based data structures.惠勒图:基于Burrows-Wheeler变换的数据结构框架。
Theor Comput Sci. 2017 Oct 25;698:67-78. doi: 10.1016/j.tcs.2017.06.016.
6
Succinct colored de Bruijn graphs.简明彩色 de Bruijn 图。
Bioinformatics. 2017 Oct 15;33(20):3181-3187. doi: 10.1093/bioinformatics/btx067.
7
Indexing Graphs for Path Queries with Applications in Genome Research.用于路径查询的图索引及其在基因组研究中的应用
IEEE/ACM Trans Comput Biol Bioinform. 2014 Mar-Apr;11(2):375-88. doi: 10.1109/TCBB.2013.2297101.
8
HISAT: a fast spliced aligner with low memory requirements.HISAT:一种内存需求低的快速剪接比对器。
Nat Methods. 2015 Apr;12(4):357-60. doi: 10.1038/nmeth.3317. Epub 2015 Mar 9.