• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于用户提交的多样化实验,对途径数据库进行系统评估。

Systematic assessment of pathway databases, based on a diverse collection of user-submitted experiments.

机构信息

Department of Molecular Life Sciences, University of Zurich, 8057 Zurich, Switzerland.

Swiss Institute of Bioinformatics, 1015 Lausanne, Switzerland.

出版信息

Brief Bioinform. 2022 Sep 20;23(5). doi: 10.1093/bib/bbac355.

DOI:10.1093/bib/bbac355
PMID:36088548
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9487593/
Abstract

A knowledge-based grouping of genes into pathways or functional units is essential for describing and understanding cellular complexity. However, it is not always clear a priori how and at what level of specificity functionally interconnected genes should be partitioned into pathways, for a given application. Here, we assess and compare nine existing and two conceptually novel functional classification systems, with respect to their discovery power and generality in gene set enrichment testing. We base our assessment on a collection of nearly 2000 functional genomics datasets provided by users of the STRING database. With these real-life and diverse queries, we assess which systems typically provide the most specific and complete enrichment results. We find many structural and performance differences between classification systems. Overall, the well-established, hierarchically organized pathway annotation systems yield the best enrichment performance, despite covering substantial parts of the human genome in general terms only. On the other hand, the more recent unsupervised annotation systems perform strongest in understudied areas and organisms, and in detecting more specific pathways, albeit with less informative labels.

摘要

基于知识的基因分组为途径或功能单元对于描述和理解细胞复杂性至关重要。然而,对于给定的应用程序,如何以及在什么特异性水平上将功能上相互关联的基因划分到途径中,这在事先并不总是清楚的。在这里,我们评估和比较了九种现有的和两种概念新颖的功能分类系统,就其在基因集富集测试中的发现能力和通用性而言。我们的评估基于 STRING 数据库用户提供的近 2000 个功能基因组数据集。通过这些现实生活和多样化的查询,我们评估了哪些系统通常提供最具体和完整的富集结果。我们发现分类系统之间存在许多结构和性能差异。总体而言,尽管通常仅从总体上涵盖人类基因组的大部分内容,但经过良好验证的层次组织途径注释系统可提供最佳的富集性能。另一方面,最近的无监督注释系统在研究较少的领域和生物体中表现最强,并能检测到更具体的途径,尽管标签的信息量较少。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5713/9487593/4e044ec03bbf/bbac355f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5713/9487593/936edf57eda1/bbac355f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5713/9487593/d0f4bc79c046/bbac355f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5713/9487593/34dd38561ee4/bbac355f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5713/9487593/ab2635dd988e/bbac355f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5713/9487593/4e044ec03bbf/bbac355f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5713/9487593/936edf57eda1/bbac355f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5713/9487593/d0f4bc79c046/bbac355f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5713/9487593/34dd38561ee4/bbac355f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5713/9487593/ab2635dd988e/bbac355f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5713/9487593/4e044ec03bbf/bbac355f5.jpg

相似文献

1
Systematic assessment of pathway databases, based on a diverse collection of user-submitted experiments.基于用户提交的多样化实验,对途径数据库进行系统评估。
Brief Bioinform. 2022 Sep 20;23(5). doi: 10.1093/bib/bbac355.
2
STRING v11: protein-protein association networks with increased coverage, supporting functional discovery in genome-wide experimental datasets.STRING v11:具有增强覆盖范围的蛋白质-蛋白质相互作用网络,支持在全基因组实验数据集的功能发现。
Nucleic Acids Res. 2019 Jan 8;47(D1):D607-D613. doi: 10.1093/nar/gky1131.
3
IPAD: the Integrated Pathway Analysis Database for Systematic Enrichment Analysis.IPAD:系统富集分析的综合途径分析数据库。
BMC Bioinformatics. 2012;13 Suppl 15(Suppl 15):S7. doi: 10.1186/1471-2105-13-S15-S7. Epub 2012 Sep 11.
4
Algal Functional Annotation Tool: a web-based analysis suite to functionally interpret large gene lists using integrated annotation and expression data.藻种功能注释工具:一个基于网络的分析套件,可使用集成注释和表达数据来对大型基因列表进行功能解释。
BMC Bioinformatics. 2011 Jul 12;12:282. doi: 10.1186/1471-2105-12-282.
5
6
ODG: Omics database generator - a tool for generating, querying, and analyzing multi-omics comparative databases to facilitate biological understanding.ODG:组学数据库生成器——一种用于生成、查询和分析多组学比较数据库以促进生物学理解的工具。
BMC Bioinformatics. 2017 Aug 10;18(1):367. doi: 10.1186/s12859-017-1777-7.
7
MILANO--custom annotation of microarray results using automatic literature searches.米兰——使用自动文献检索对微阵列结果进行定制注释。
BMC Bioinformatics. 2005 Jan 20;6:12. doi: 10.1186/1471-2105-6-12.
8
FunMappOne: a tool to hierarchically organize and visually navigate functional gene annotations in multiple experiments.FunMappOne:一种用于在多个实验中对功能基因注释进行层次组织和可视化导航的工具。
BMC Bioinformatics. 2019 Feb 15;20(1):79. doi: 10.1186/s12859-019-2639-2.
9
Visual annotation display (VLAD): a tool for finding functional themes in lists of genes.视觉注释显示(VLAD):一种在基因列表中寻找功能主题的工具。
Mamm Genome. 2015 Oct;26(9-10):567-73. doi: 10.1007/s00335-015-9570-2. Epub 2015 Jun 6.
10
TOA: A software package for automated functional annotation in non-model plant species.TOA:用于非模式植物物种自动功能注释的软件包。
Mol Ecol Resour. 2021 Feb;21(2):621-636. doi: 10.1111/1755-0998.13285. Epub 2020 Nov 18.

引用本文的文献

1
Pathway Analysis Interpretation in the Multi-Omic Era.多组学时代的通路分析解读
BioTech (Basel). 2025 Jul 29;14(3):58. doi: 10.3390/biotech14030058.
2
Spectral divergence prioritizes key classes, genes, and pathways shared between substance use disorders and cardiovascular disease.光谱散度对物质使用障碍和心血管疾病之间共有的关键类别、基因和通路进行了优先排序。
Front Neurosci. 2025 Jul 22;19:1572243. doi: 10.3389/fnins.2025.1572243. eCollection 2025.
3
Current Bioinformatics Tools in Precision Oncology.精准肿瘤学中的当前生物信息学工具

本文引用的文献

1
Pathway size matters: the influence of pathway granularity on over-representation (enrichment analysis) statistics.通路大小很重要:通路粒度对过代表(富集分析)统计的影响。
BMC Genomics. 2021 Mar 16;22(1):191. doi: 10.1186/s12864-021-07502-8.
2
PANTHER version 16: a revised family classification, tree-based classification tool, enhancer regions and extensive API.PANTHER 版本 16:修订后的家族分类、基于树的分类工具、增强子区域和广泛的 API。
Nucleic Acids Res. 2021 Jan 8;49(D1):D394-D403. doi: 10.1093/nar/gkaa1106.
3
The STRING database in 2021: customizable protein-protein networks, and functional characterization of user-uploaded gene/measurement sets.
MedComm (2020). 2025 Jul 9;6(7):e70243. doi: 10.1002/mco2.70243. eCollection 2025 Jul.
4
Transcriptomic profiling of pancreatic neuroendocrine tumors: dysregulation of WNT, MAPK, PI3K, neddylation pathways and potential non-invasive biomarkers.胰腺神经内分泌肿瘤的转录组分析:WNT、MAPK、PI3K、NEDDylation信号通路失调及潜在的非侵入性生物标志物
PLoS One. 2025 Jun 16;20(6):e0325672. doi: 10.1371/journal.pone.0325672. eCollection 2025.
5
Mapping growth differentiation factor-15 (GDF15)-mediated signaling pathways in cancer: insights into its role across different cancer types.绘制癌症中生长分化因子15(GDF15)介导的信号通路:洞察其在不同癌症类型中的作用
Discov Oncol. 2025 Mar 25;16(1):386. doi: 10.1007/s12672-025-02121-1.
6
A workflow for human health hazard evaluation using transcriptomic data and Key Characteristics-based gene sets.一种使用转录组数据和基于关键特征的基因集进行人类健康危害评估的工作流程。
Toxicol Sci. 2025 Jun 1;205(2):310-325. doi: 10.1093/toxsci/kfaf036.
7
Exploring Synaptic Pathways in Traumatic Brain Injury: A Cross-Phenotype Genomics Approach.探索创伤性脑损伤中的突触通路:一种跨表型基因组学方法。
J Neurotrauma. 2025 Jan;42(1-2):131-142. doi: 10.1089/neu.2024.0153. Epub 2024 Oct 7.
8
Benchmarking enrichment analysis methods with the disease pathway network.使用疾病通路网络对富集分析方法进行基准测试。
Brief Bioinform. 2024 Jan 22;25(2). doi: 10.1093/bib/bbae069.
9
In Silico Models to Validate Novel Blood-Based Biomarkers.基于计算机的模型用于验证新型血液生物标志物。
Methods Mol Biol. 2024;2785:321-344. doi: 10.1007/978-1-0716-3774-6_20.
10
HGCA2.0: An RNA-Seq Based Webtool for Gene Coexpression Analysis in .HGCA2.0:基于 RNA-Seq 的. 基因共表达分析网络工具
Cells. 2023 Jan 21;12(3):388. doi: 10.3390/cells12030388.
2021 年的 STRING 数据库:可定制的蛋白质-蛋白质网络,以及用户上传的基因/测量集的功能特征分析。
Nucleic Acids Res. 2021 Jan 8;49(D1):D605-D612. doi: 10.1093/nar/gkaa1074.
4
WikiPathways: connecting communities.维基路径:连接社区。
Nucleic Acids Res. 2021 Jan 8;49(D1):D613-D621. doi: 10.1093/nar/gkaa1024.
5
Pathway information extracted from 25 years of pathway figures.从 25 年的通路图中提取的通路信息。
Genome Biol. 2020 Nov 9;21(1):273. doi: 10.1186/s13059-020-02181-2.
6
G-Protein-Coupled Receptor Expression and Purification.G 蛋白偶联受体的表达和纯化。
Methods Mol Biol. 2021;2178:439-467. doi: 10.1007/978-1-0716-0775-6_28.
7
Functionally Enigmatic Genes in Cancer: Using TCGA Data to Map the Limitations of Annotations.癌症中功能神秘的基因:利用 TCGA 数据绘制注释的局限性图谱。
Sci Rep. 2020 Mar 5;10(1):4106. doi: 10.1038/s41598-020-60456-x.
8
The reactome pathway knowledgebase.Reactome 通路知识库。
Nucleic Acids Res. 2020 Jan 8;48(D1):D498-D503. doi: 10.1093/nar/gkz1031.
9
Involving community in genes and pathway curation.让社区参与基因和途径的整理。
Database (Oxford). 2019 Jan 1;2019:bay146. doi: 10.1093/database/bay146.
10
STRING v11: protein-protein association networks with increased coverage, supporting functional discovery in genome-wide experimental datasets.STRING v11:具有增强覆盖范围的蛋白质-蛋白质相互作用网络,支持在全基因组实验数据集的功能发现。
Nucleic Acids Res. 2019 Jan 8;47(D1):D607-D613. doi: 10.1093/nar/gky1131.