• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

KNeXT:一种基于NetworkX的与拓扑相关的KEGG解析器。

KNeXT: a NetworkX-based topologically relevant KEGG parser.

作者信息

Castaneda Everest Uriel, Baker Erich J

机构信息

Department of Biology, Baylor University, Waco, TX, United States.

School of Engineering and Computer Science, Baylor University, Waco, TX, United States.

出版信息

Front Genet. 2024 Feb 13;15:1292394. doi: 10.3389/fgene.2024.1292394. eCollection 2024.

DOI:10.3389/fgene.2024.1292394
PMID:38415058
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10896898/
Abstract

Automating the recreation of gene and mixed gene-compound networks from Kyoto Encyclopedia of Genes and Genomes (KEGG) Markup Language (KGML) files is challenging because the data structure does not preserve the independent or loosely connected neighborhoods in which they were originally derived, referred to here as its topological environment. Identical accession numbers may overlap, causing neighborhoods to artificially collapse based on duplicated identifiers. This causes current parsers to create misleading or erroneous graphical representations when mixed gene networks are converted to gene-only networks. To overcome these challenges we created a python-based KEGG NetworkX Topological (KNeXT) parser that allows users to accurately recapitulate genetic networks and mixed networks from KGML map data. The software, archived as a python package index (PyPI) file to ensure broad application, is designed to ingest KGML files through built-in APIs and dynamically create high-fidelity topological representations. The utilization of NetworkX's framework to generate tab-separated files additionally ensures that KNeXT results may be imported into other graph frameworks and maintain programmatic access to the original - axis positions to each node in the KEGG pathway. KNeXT is a well-described Python 3 package that allows users to rapidly download and aggregate specific KGML files and recreate KEGG pathways based on a range of user-defined settings. KNeXT is platform-independent, distinctive, and it is not written on top of other Python parsers. Furthermore, KNeXT enables users to parse entire local folders or single files through command line scripts and convert the output into NCBI or UniProt IDs. KNeXT provides an ability for researchers to generate pathway visualizations while persevering the original context of a KEGG pathway. Source code is freely available at https://github.com/everest-castaneda/knext.

摘要

从京都基因与基因组百科全书(KEGG)标记语言(KGML)文件中自动重建基因网络和混合基因-化合物网络具有挑战性,因为数据结构无法保留其最初派生时的独立或松散连接的邻域,在此称为其拓扑环境。相同的登录号可能会重叠,导致邻域基于重复的标识符而人为地合并。这使得当前的解析器在将混合基因网络转换为仅基因网络时会创建误导性或错误的图形表示。为了克服这些挑战,我们创建了一个基于Python的KEGG NetworkX拓扑(KNeXT)解析器,该解析器允许用户从KGML图谱数据中准确地重现遗传网络和混合网络。该软件作为Python包索引(PyPI)文件存档以确保广泛应用,旨在通过内置API摄取KGML文件并动态创建高保真拓扑表示。利用NetworkX框架生成制表符分隔的文件还可确保KNeXT结果可以导入到其他图形框架中,并保持对KEGG通路中每个节点的原始轴位置的编程访问。KNeXT是一个描述详尽的Python 3包,允许用户快速下载和汇总特定的KGML文件,并根据一系列用户定义的设置重新创建KEGG通路。KNeXT是独立于平台的,具有独特性,并且不是在其他Python解析器之上编写的。此外,KNeXT允许用户通过命令行脚本解析整个本地文件夹或单个文件,并将输出转换为NCBI或UniProt ID。KNeXT使研究人员能够生成通路可视化,同时保留KEGG通路的原始上下文。源代码可在https://github.com/everest-castaneda/knext上免费获得。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4fb9/10896898/9486617fcc8e/fgene-15-1292394-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4fb9/10896898/2fc3e3c32a36/fgene-15-1292394-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4fb9/10896898/6250ed99d0a5/fgene-15-1292394-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4fb9/10896898/4e19aa43f5d1/fgene-15-1292394-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4fb9/10896898/586757e84f6c/fgene-15-1292394-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4fb9/10896898/9486617fcc8e/fgene-15-1292394-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4fb9/10896898/2fc3e3c32a36/fgene-15-1292394-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4fb9/10896898/6250ed99d0a5/fgene-15-1292394-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4fb9/10896898/4e19aa43f5d1/fgene-15-1292394-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4fb9/10896898/586757e84f6c/fgene-15-1292394-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4fb9/10896898/9486617fcc8e/fgene-15-1292394-g005.jpg

相似文献

1
KNeXT: a NetworkX-based topologically relevant KEGG parser.KNeXT:一种基于NetworkX的与拓扑相关的KEGG解析器。
Front Genet. 2024 Feb 13;15:1292394. doi: 10.3389/fgene.2024.1292394. eCollection 2024.
2
A Python library for FAIRer access and deposition to the Metabolomics Workbench Data Repository.一个用于更公平地访问和存入代谢组学工作台数据存储库的Python库。
Metabolomics. 2018;14(5):64. doi: 10.1007/s11306-018-1356-6. Epub 2018 Apr 20.
3
KEGGgraph: a graph approach to KEGG PATHWAY in R and bioconductor.KEGGgraph:R语言和生物导体中KEGG通路的图形化方法
Bioinformatics. 2009 Jun 1;25(11):1470-1. doi: 10.1093/bioinformatics/btp167. Epub 2009 Mar 23.
4
kegg_pull: a software package for the RESTful access and pulling from the Kyoto Encyclopedia of Gene and Genomes.KEGG_PULL:一个用于通过 RESTful 访问和从京都基因与基因组百科全书(KEGG)中提取数据的软件包。
BMC Bioinformatics. 2023 Mar 4;24(1):78. doi: 10.1186/s12859-023-05208-0.
5
KEGGParser: parsing and editing KEGG pathway maps in Matlab.KEGGParser:在 Matlab 中解析和编辑 KEGG 途径图谱。
Bioinformatics. 2013 Feb 15;29(4):518-9. doi: 10.1093/bioinformatics/bts730. Epub 2013 Jan 3.
6
KEGGconverter: a tool for the in-silico modelling of metabolic networks of the KEGG Pathways database.KEGGconverter:用于 KEGG 通路数据库中代谢网络的计算机模拟的工具。
BMC Bioinformatics. 2009 Oct 8;10:324. doi: 10.1186/1471-2105-10-324.
7
NeuroPycon: An open-source python toolbox for fast multi-modal and reproducible brain connectivity pipelines.NeuroPycon:一个开源的 Python 工具包,用于快速进行多模态和可重复的脑连接管道。
Neuroimage. 2020 Oct 1;219:117020. doi: 10.1016/j.neuroimage.2020.117020. Epub 2020 Jun 6.
8
Dynamic exploration and editing of KEGG pathway diagrams.KEGG通路图的动态探索与编辑
Bioinformatics. 2007 Feb 1;23(3):344-50. doi: 10.1093/bioinformatics/btl611. Epub 2006 Dec 1.
9
KEGGtranslator: visualizing and converting the KEGG PATHWAY database to various formats.KEGGtranslator:可视化和转换 KEGG PATHWAY 数据库到各种格式。
Bioinformatics. 2011 Aug 15;27(16):2314-5. doi: 10.1093/bioinformatics/btr377. Epub 2011 Jun 23.
10
A fast and efficient python library for interfacing with the Biological Magnetic Resonance Data Bank.一个用于与生物磁共振数据库接口的快速高效的Python库。
BMC Bioinformatics. 2017 Mar 17;18(1):175. doi: 10.1186/s12859-017-1580-5.

引用本文的文献

1
Spectral divergence prioritizes key classes, genes, and pathways shared between substance use disorders and cardiovascular disease.光谱散度对物质使用障碍和心血管疾病之间共有的关键类别、基因和通路进行了优先排序。
Front Neurosci. 2025 Jul 22;19:1572243. doi: 10.3389/fnins.2025.1572243. eCollection 2025.
2
Influence of multi-species data on gene-disease associations in substance use disorder using random walk with restart models.使用带重启的随机游走模型的多物种数据对物质使用障碍中基因-疾病关联的影响
PLoS One. 2025 Jun 16;20(6):e0325201. doi: 10.1371/journal.pone.0325201. eCollection 2025.
3
Integrative single-cell and multi-omics analyses reveal ferroptosis-associated gene expression and immune microenvironment heterogeneity in gastric cancer.

本文引用的文献

1
ggkegg: analysis and visualization of KEGG data utilizing the grammar of graphics.ggkegg:利用图形语法分析和可视化 KEGG 数据。
Bioinformatics. 2023 Oct 3;39(10). doi: 10.1093/bioinformatics/btad622.
2
Graph Autoencoder with Preserving Node Attribute Similarity.具有保留节点属性相似性的图自动编码器
Entropy (Basel). 2023 Mar 26;25(4):567. doi: 10.3390/e25040567.
3
UniProt: the Universal Protein Knowledgebase in 2023.UniProt:2023 年的通用蛋白质知识库。
整合单细胞和多组学分析揭示胃癌中铁死亡相关基因表达及免疫微环境异质性
Discov Oncol. 2025 Jan 17;16(1):57. doi: 10.1007/s12672-025-01798-8.
Nucleic Acids Res. 2023 Jan 6;51(D1):D523-D531. doi: 10.1093/nar/gkac1052.
4
Risk stratification and pathway analysis based on graph neural network and interpretable algorithm.基于图神经网络和可解释算法的风险分层和路径分析。
BMC Bioinformatics. 2022 Sep 27;23(1):394. doi: 10.1186/s12859-022-04950-1.
5
Modularity-aware graph autoencoders for joint community detection and link prediction.模块感知图自动编码器用于联合社区检测和链路预测。
Neural Netw. 2022 Sep;153:474-495. doi: 10.1016/j.neunet.2022.06.021. Epub 2022 Jun 22.
6
Database resources of the national center for biotechnology information.国家生物技术信息中心数据库资源。
Nucleic Acids Res. 2022 Jan 7;50(D1):D20-D26. doi: 10.1093/nar/gkab1112.
7
netgsa: Fast computation and interactive visualization for topology-based pathway enrichment analysis.netgsa:基于拓扑的通路富集分析的快速计算和交互式可视化。
PLoS Comput Biol. 2021 Jun 11;17(6):e1008979. doi: 10.1371/journal.pcbi.1008979. eCollection 2021 Jun.
8
KEGG2Net: Deducing gene interaction networks and acyclic graphs from KEGG pathways.KEGG2Net:从KEGG通路推导基因相互作用网络和无环图。
EMBnet J. 2021;26. doi: 10.14806/ej.26.0.949. Epub 2021 Mar 5.
9
Addressing uncertainty in genome-scale metabolic model reconstruction and analysis.解决基因组规模代谢模型重建与分析中的不确定性问题。
Genome Biol. 2021 Feb 18;22(1):64. doi: 10.1186/s13059-021-02289-z.
10
A strategy to incorporate prior knowledge into correlation network cutoff selection.将先验知识纳入相关网络截断选择的策略。
Nat Commun. 2020 Oct 14;11(1):5153. doi: 10.1038/s41467-020-18675-3.