• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

信号通路数据库的可用性:经验教训。

Signalling pathway database usability: lessons learned.

作者信息

Tieri Paolo, Nardini Christine

机构信息

Key Laboratory of Computational Biology, CAS-MPG Partner Institute for Computational Biology, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, Yue Yang Road 320, Shanghai, P. R. China.

出版信息

Mol Biosyst. 2013 Oct;9(10):2401-7. doi: 10.1039/c3mb70242a.

DOI:10.1039/c3mb70242a
PMID:23942525
Abstract

BACKGROUND

issues and limitations related to accessibility, understandability and ease of use of signalling pathway databases may hamper or divert research workflow, leading, in the worst case, to the generation of confusing reference frameworks and misinterpretation of experimental results. In an attempt to retrieve signalling pathway data related to a specific set of test genes, we queried and analysed the results from six of the major curated signalling pathway databases: Reactome, PathwayCommons, KEGG, InnateDB, PID, and Wikipathways.

FINDINGS

although we expected differences - often a desirable feature for the integration of each individual query, we observed variations of exceptional magnitude, with disproportionate quality and quantity of the results. Some of the more remarkable differences can be explained by the diverse conceptual designs and purposes of the databases, the types of data stored and the structure of the query, as well as by missing or erroneous descriptions of the search procedure. To go beyond the mere enumeration of these problems, we identified a number of operational features, in particular inner and cross coherence, which, once quantified, offer objective criteria to choose the best source of information.

CONCLUSIONS

in silico biology heavily relies on the information stored in databases. To ensure that computational biology mirrors biological reality and offers focused hypotheses to be experimentally validated, coherence of data codification is crucial and yet highly underestimated. We make practical recommendations for the end-user to cope with the current state of the databases as well as for the maintainers of those databases to contribute to the goal of the full enactment of the open data paradigm.

摘要

背景

信号通路数据库在可访问性、可理解性和易用性方面存在的问题与局限,可能会妨碍或扰乱研究流程,在最坏的情况下,会导致产生令人困惑的参考框架并对实验结果产生误解。为了检索与一组特定测试基因相关的信号通路数据,我们查询并分析了六个主要的经过整理的信号通路数据库的结果:Reactome、PathwayCommons、KEGG、InnateDB、PID和Wikipathways。

研究结果

尽管我们预期会存在差异——这通常是整合每个单独查询时所期望的特征,但我们观察到了异常巨大的变化,结果的质量和数量不成比例。一些更显著的差异可以通过数据库的不同概念设计和目的、所存储数据的类型、查询结构以及搜索过程中缺失或错误的描述来解释。为了不仅仅列举这些问题,我们确定了一些操作特征,特别是内部和交叉一致性,一旦对其进行量化,就可以提供选择最佳信息来源的客观标准。

结论

计算机生物学严重依赖于存储在数据库中的信息。为确保计算生物学反映生物学现实并提供有待实验验证的重点假设,数据编码的一致性至关重要,但却被严重低估。我们为终端用户应对数据库的当前状态提出了实际建议,也为这些数据库的维护者提出建议,以助力实现开放数据范式全面实施的目标。

相似文献

1
Signalling pathway database usability: lessons learned.信号通路数据库的可用性:经验教训。
Mol Biosyst. 2013 Oct;9(10):2401-7. doi: 10.1039/c3mb70242a.
2
IntPath--an integrated pathway gene relationship database for model organisms and important pathogens.IntPath——一个针对模式生物和重要病原体的综合通路基因关系数据库。
BMC Syst Biol. 2012;6 Suppl 2(Suppl 2):S2. doi: 10.1186/1752-0509-6-S2-S2. Epub 2012 Dec 12.
3
Development of SRS.php, a Simple Object Access Protocol-based library for data acquisition from integrated biological databases.SRS.php的开发,一个基于简单对象访问协议的用于从综合生物数据库获取数据的库。
Genet Mol Res. 2007 Dec 11;6(4):1142-50.
4
Molecular network analysis of diseases and drugs in KEGG.KEGG中疾病与药物的分子网络分析
Methods Mol Biol. 2013;939:263-75. doi: 10.1007/978-1-62703-107-3_17.
5
Quality controls in integrative approaches to detect errors and inconsistencies in biological databases.整合方法中的质量控制,用于检测生物数据库中的错误和不一致性。
J Integr Bioinform. 2010 Mar 25;7(3):454. doi: 10.2390/biecoll-jib-2010-119.
6
MIPS: analysis and annotation of proteins from whole genomes in 2005.MIPS:2005年全基因组蛋白质分析与注释
Nucleic Acids Res. 2006 Jan 1;34(Database issue):D169-72. doi: 10.1093/nar/gkj148.
7
cPath: open source software for collecting, storing, and querying biological pathways.cPath:用于收集、存储和查询生物途径的开源软件。
BMC Bioinformatics. 2006 Nov 13;7:497. doi: 10.1186/1471-2105-7-497.
8
Better bioinformatics through usability analysis.通过可用性分析实现更好的生物信息学。
Bioinformatics. 2009 Feb 1;25(3):406-12. doi: 10.1093/bioinformatics/btn633. Epub 2008 Dec 9.
9
CARGO: a web portal to integrate customized biological information.CARGO:一个整合定制生物信息的网络门户。
Nucleic Acids Res. 2007 Jul;35(Web Server issue):W16-20. doi: 10.1093/nar/gkm280. Epub 2007 May 5.
10
JUICE: a data management system that facilitates the analysis of large volumes of information in an EST project workflow.JUICE:一个数据管理系统,可在EST项目工作流程中促进对大量信息的分析。
BMC Bioinformatics. 2006 Nov 23;7:513. doi: 10.1186/1471-2105-7-513.

引用本文的文献

1
Testing the Significance of Ranked Gene Sets in Genome-wide Transcriptome .测试全基因组转录组中排名基因集的显著性
Curr Genomics. 2024 May 31;25(3):202-211. doi: 10.2174/0113892029280470240306044159. Epub 2024 Mar 11.
2
Systematic assessment of pathway databases, based on a diverse collection of user-submitted experiments.基于用户提交的多样化实验,对途径数据库进行系统评估。
Brief Bioinform. 2022 Sep 20;23(5). doi: 10.1093/bib/bbac355.
3
Application of text mining to develop AOP-based mucus hypersecretion genesets and confirmation with in vitro and clinical samples.
文本挖掘在基于 AOP 的黏液高分泌基因集开发中的应用及体外和临床样本的验证。
Sci Rep. 2021 Mar 17;11(1):6091. doi: 10.1038/s41598-021-85345-9.
4
The transcriptional response to oxidative stress is part of, but not sufficient for, insulin resistance in adipocytes.氧化应激的转录反应是脂肪细胞胰岛素抵抗的一部分,但不足以导致其发生。
Sci Rep. 2018 Jan 29;8(1):1774. doi: 10.1038/s41598-018-20104-x.
5
Spatio-Temporal Gene Expression Profiling during In Vivo Early Ovarian Folliculogenesis: Integrated Transcriptomic Study and Molecular Signature of Early Follicular Growth.体内早期卵泡发生过程中的时空基因表达谱分析:早期卵泡生长的综合转录组学研究与分子特征
PLoS One. 2015 Nov 5;10(11):e0141482. doi: 10.1371/journal.pone.0141482. eCollection 2015.
6
Interlog protein network: an evolutionary benchmark of protein interaction networks for the evaluation of clustering algorithms.互作蛋白网络:用于评估聚类算法的蛋白质相互作用网络的进化基准
BMC Bioinformatics. 2015 Oct 5;16:319. doi: 10.1186/s12859-015-0755-1.
7
Editorial: Multi-omic data integration.社论:多组学数据整合
Front Cell Dev Biol. 2015 Jul 7;3:46. doi: 10.3389/fcell.2015.00046. eCollection 2015.
8
Comparison of human cell signaling pathway databases--evolution, drawbacks and challenges.人类细胞信号通路数据库的比较——演变、缺点与挑战
Database (Oxford). 2015 Jan 28;2015. doi: 10.1093/database/bau126. Print 2015.
9
Multi-omic landscape of rheumatoid arthritis: re-evaluation of drug adverse effects.类风湿关节炎的多组学全景:药物不良反应的再评估。
Front Cell Dev Biol. 2014 Nov 4;2:59. doi: 10.3389/fcell.2014.00059. eCollection 2014.