• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

格蕾特变体图形评估工具包

Gretl-variation GRaph Evaluation TooLkit.

作者信息

Vorbrugg Sebastian, Bezrukov Ilja, Bao Zhigui, Weigel Detlef

机构信息

Department of Molecular Biology, Max Planck Institute for Biology Tübingen, 72076 Tübingen, Germany.

出版信息

Bioinformatics. 2024 Dec 26;41(1). doi: 10.1093/bioinformatics/btae755.

DOI:10.1093/bioinformatics/btae755
PMID:39719064
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11729725/
Abstract

MOTIVATION

As genome graphs are powerful data structures for representing the genetic diversity within populations, they can help identify genomic variations that traditional linear references miss, but their complexity and size makes the analysis of genome graphs challenging. We sought to develop a genome graph analysis tool that helps these analyses to become more accessible by addressing the limitations of existing tools. Specifically, we improve scalability and user-friendliness, and we provide many new statistics tailored to variation graphs for graph evaluation, including sample-specific features.

RESULTS

We developed an efficient, comprehensive, and integrated tool, gretl, to analyze genome graphs and gain insights into their structure and composition by providing a wide range of statistics. gretl can be utilized to evaluate different graphs, compare the output of graph construction pipelines with different parameters, as well as perform an in-depth analysis of individual graphs, including sample-specific analysis. With the assistance of gretl, novel patterns of genetic variation and potential regions of interest can be identified, for later, more detailed inspection. We demonstrate that gretl outperforms other tools in terms of speed, particularly for larger genome graphs.

AVAILABILITY AND IMPLEMENTATION

Commented Rust source code and documentation is available under MIT license at https://github.com/MoinSebi/gretl together with Python scripts and step-by-step usage examples. The package is available at Bioconda for easy installation.

摘要

动机

由于基因组图是用于表示群体内遗传多样性的强大数据结构,它们有助于识别传统线性参考遗漏的基因组变异,但其复杂性和规模使得基因组图的分析具有挑战性。我们试图开发一种基因组图分析工具,通过解决现有工具的局限性,使这些分析更容易进行。具体而言,我们提高了可扩展性和用户友好性,并提供了许多针对变异图量身定制的新统计数据,用于图评估,包括样本特异性特征。

结果

我们开发了一个高效、全面且集成的工具gretl,通过提供广泛的统计数据来分析基因组图,并深入了解其结构和组成。gretl可用于评估不同的图,比较具有不同参数的图构建管道的输出,以及对单个图进行深入分析,包括样本特异性分析。在gretl的帮助下,可以识别新的遗传变异模式和潜在的感兴趣区域,以便稍后进行更详细的检查。我们证明,gretl在速度方面优于其他工具,特别是对于更大的基因组图。

可用性和实现方式

带有注释的Rust源代码和文档在MIT许可下可从https://github.com/MoinSebi/gretl获得,同时还有Python脚本和逐步使用示例。该软件包可在Bioconda上获取,便于安装。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7eba/11729725/f400b0bd0e1f/btae755f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7eba/11729725/f400b0bd0e1f/btae755f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7eba/11729725/f400b0bd0e1f/btae755f1.jpg

相似文献

1
Gretl-variation GRaph Evaluation TooLkit.格蕾特变体图形评估工具包
Bioinformatics. 2024 Dec 26;41(1). doi: 10.1093/bioinformatics/btae755.
2
Unbiased pangenome graphs.无偏泛基因组图。
Bioinformatics. 2023 Jan 1;39(1). doi: 10.1093/bioinformatics/btac743.
3
ODGI: understanding pangenome graphs.ODGI:理解泛基因组图谱。
Bioinformatics. 2022 Jun 27;38(13):3319-3326. doi: 10.1093/bioinformatics/btac308.
4
Efficient dynamic variation graphs.高效动态变化图。
Bioinformatics. 2021 Jan 29;36(21):5139-5144. doi: 10.1093/bioinformatics/btaa640.
5
Panacus: fast and exact pangenome growth and core size estimation.人参属:快速准确的泛基因组增长及核心大小估计
Bioinformatics. 2024 Nov 28;40(12). doi: 10.1093/bioinformatics/btae720.
6
BubbleGun: enumerating bubbles and superbubbles in genome graphs.BubbleGun:基因组图中的泡和超泡枚举。
Bioinformatics. 2022 Sep 2;38(17):4217-4219. doi: 10.1093/bioinformatics/btac448.
7
GraphAligner: rapid and versatile sequence-to-graph alignment.GraphAligner:快速且通用的序列到图的对齐方法。
Genome Biol. 2020 Sep 24;21(1):253. doi: 10.1186/s13059-020-02157-2.
8
CoMut: visualizing integrated molecular information with comutation plots.CoMut:通过共突变图可视化整合的分子信息。
Bioinformatics. 2020 Aug 1;36(15):4348-4349. doi: 10.1093/bioinformatics/btaa554.
9
Pangenome graph layout by Path-Guided Stochastic Gradient Descent.基于路径引导随机梯度下降的泛基因组图谱布局。
Bioinformatics. 2024 Jul 1;40(7). doi: 10.1093/bioinformatics/btae363.
10
MoMI-G: modular multi-scale integrated genome graph browser.MoMI-G:模块化多尺度综合基因组图谱浏览器。
BMC Bioinformatics. 2019 Nov 5;20(1):548. doi: 10.1186/s12859-019-3145-2.

本文引用的文献

1
Taurine pangenome uncovers a segmental duplication upstream of associated with depigmentation in white-headed cattle.牛磺酸泛基因组揭示了与白头牛色素脱失相关的上游片段重复。
Genome Res. 2025 Apr 14;35(4):1041-1052. doi: 10.1101/gr.279064.124.
2
Building pangenome graphs.构建泛基因组图谱。
Nat Methods. 2024 Nov;21(11):2008-2012. doi: 10.1038/s41592-024-02430-3. Epub 2024 Oct 21.
3
Pangenome graph layout by Path-Guided Stochastic Gradient Descent.基于路径引导随机梯度下降的泛基因组图谱布局。
Bioinformatics. 2024 Jul 1;40(7). doi: 10.1093/bioinformatics/btae363.
4
Telomere-to-telomere assemblies of 142 strains characterize the genome structural landscape in Saccharomyces cerevisiae.142 株酿酒酵母的端粒到端粒组装描绘了基因组结构景观。
Nat Genet. 2023 Aug;55(8):1390-1399. doi: 10.1038/s41588-023-01459-y. Epub 2023 Jul 31.
5
Graph construction method impacts variation representation and analyses in a bovine super-pangenome.图构建方法影响牛超级泛基因组中的变异表示和分析。
Genome Biol. 2023 May 22;24(1):124. doi: 10.1186/s13059-023-02969-y.
6
Cycles of satellite and transposon evolution in Arabidopsis centromeres.拟南芥着丝粒卫星和转座子的演化循环。
Nature. 2023 Jun;618(7965):557-565. doi: 10.1038/s41586-023-06062-z. Epub 2023 May 17.
7
A draft human pangenome reference.人类泛基因组参考草图。
Nature. 2023 May;617(7960):312-324. doi: 10.1038/s41586-023-05896-x. Epub 2023 May 10.
8
Pangenome graph construction from genome alignments with Minigraph-Cactus.基于 Minigraph-Cactus 的基因组比对构建泛基因组图谱。
Nat Biotechnol. 2024 Apr;42(4):663-673. doi: 10.1038/s41587-023-01793-w. Epub 2023 May 10.
9
A super pan-genomic landscape of rice.水稻的超级泛基因组景观。
Cell Res. 2022 Oct;32(10):878-896. doi: 10.1038/s41422-022-00685-z. Epub 2022 Jul 12.
10
Gfastats: conversion, evaluation and manipulation of genome sequences using assembly graphs.Gfastats:使用组装图转换、评估和操作基因组序列。
Bioinformatics. 2022 Sep 2;38(17):4214-4216. doi: 10.1093/bioinformatics/btac460.