Sting_RDB：一个用于蛋白质分析的结构参数关系数据库，支持数据仓库和数据挖掘。

Sting_RDB: a relational database of structural parameters for protein analysis with support for data warehousing and data mining.

作者信息

Oliveira S R M, Almeida G V, Souza K R R, Rodrigues D N, Kuser-Falcão P R, Yamagishi M E B, Santos E H, Vieira F D, Jardine J G, Neshich G

机构信息

Embrapa Informática Agropecuária, Campinas, SP, Brasil.

出版信息

Genet Mol Res. 2007 Oct 5;6(4):911-22.

PMID:18058712

Abstract

An effective strategy for managing protein databases is to provide mechanisms to transform raw data into consistent, accurate and reliable information. Such mechanisms will greatly reduce operational inefficiencies and improve one's ability to better handle scientific objectives and interpret the research results. To achieve this challenging goal for the STING project, we introduce Sting_RDB, a relational database of structural parameters for protein analysis with support for data warehousing and data mining. In this article, we highlight the main features of Sting_RDB and show how a user can explore it for efficient and biologically relevant queries. Considering its importance for molecular biologists, effort has been made to advance Sting_RDB toward data quality assessment. To the best of our knowledge, Sting_RDB is one of the most comprehensive data repositories for protein analysis, now also capable of providing its users with a data quality indicator. This paper differs from our previous study in many aspects. First, we introduce Sting_RDB, a relational database with mechanisms for efficient and relevant queries using SQL. Sting_rdb evolved from the earlier, text (flat file)-based database, in which data consistency and integrity was not guaranteed. Second, we provide support for data warehousing and mining. Third, the data quality indicator was introduced. Finally and probably most importantly, complex queries that could not be posed on a text-based database, are now easily implemented. Further details are accessible at the Sting_RDB demo web page: http://www.cbi.cnptia.embrapa.br/StingRDB.

摘要

管理蛋白质数据库的一个有效策略是提供将原始数据转化为一致、准确和可靠信息的机制。这样的机制将极大地减少操作效率低下的情况，并提高人们更好地处理科学目标和解释研究结果的能力。为了实现STING项目这一具有挑战性的目标，我们引入了Sting_RDB，这是一个用于蛋白质分析的结构参数关系数据库，支持数据仓库和数据挖掘。在本文中，我们突出了Sting_RDB的主要特性，并展示了用户如何对其进行探索以进行高效且与生物学相关的查询。考虑到它对分子生物学家的重要性，我们已努力推动Sting_RDB进行数据质量评估。据我们所知，Sting_RDB是蛋白质分析方面最全面的数据存储库之一，现在还能够为用户提供数据质量指标。本文在许多方面与我们之前的研究不同。首先，我们引入了Sting_RDB，这是一个具有使用SQL进行高效且相关查询机制的关系数据库。Sting_rdb是从早期基于文本（平面文件）的数据库发展而来的，在那个数据库中数据一致性和完整性无法得到保证。其次，我们提供了对数据仓库和挖掘的支持。第三，引入了数据质量指标。最后且可能最重要的是，现在可以轻松实现那些在基于文本的数据库上无法提出的复杂查询。更多详细信息可在Sting_RDB演示网页获取：http://www.cbi.cnptia.embrapa.br/StingRDB 。

相似文献

Sting_RDB: a relational database of structural parameters for protein analysis with support for data warehousing and data mining.

Genet Mol Res. 2007 Oct 5;6(4):911-22.

Electrostatic potential calculation for biomolecules--creating a database of pre-calculated values reported on a per residue basis for all PDB protein structures.

Genet Mol Res. 2007 Oct 5;6(4):923-36.

The Diamond STING server.

Nucleic Acids Res. 2005 Jul 1;33(Web Server issue):W29-35. doi: 10.1093/nar/gki397.

The Star STING server: a multiplatform environment for protein structure analysis.

Genet Mol Res. 2006 Dec 1;5(4):717-22.

BlotBase: a northern blot database.

Gene. 2008 Dec 31;427(1-2):47-50. doi: 10.1016/j.gene.2008.08.026. Epub 2008 Sep 18.

PDB-Metrics: a web tool for exploring the PDB contents.

Genet Mol Res. 2006 Jun 30;5(2):333-41.

Omics data management and annotation.

Methods Mol Biol. 2011;719:71-96. doi: 10.1007/978-1-61779-027-0_3.

Protein structure databases with new web services for structural biology and biomedical research.

Brief Bioinform. 2008 Jul;9(4):276-85. doi: 10.1093/bib/bbn015. Epub 2008 Apr 22.

Benchmarking NMR experiments: a relational database of protein pulse sequences.

J Magn Reson. 2010 Mar;203(1):129-37. doi: 10.1016/j.jmr.2009.12.008. Epub 2009 Dec 23.

Text mining and protein annotations: the construction and use of protein description sentences.

Genome Inform. 2006;17(2):121-30.

引用本文的文献

STINGAllo: a web server for high-throughput prediction of allosteric site-forming residues using internal protein nanoenvironment descriptors.

Brief Bioinform. 2025 Jul 2;26(4). doi: 10.1093/bib/bbaf424.

Protein allosteric site identification using machine learning and per amino acid residue reported internal protein nanoenvironment descriptors.

Comput Struct Biotechnol J. 2024 Oct 23;23:3907-3919. doi: 10.1016/j.csbj.2024.10.036. eCollection 2024 Dec.

A comparison between internal protein nanoenvironments of α-helices and β-sheets.

PLoS One. 2020 Dec 30;15(12):e0244315. doi: 10.1371/journal.pone.0244315. eCollection 2020.

Study of specific nanoenvironments containing α-helices in all-α and (α+β)+(α/β) proteins.

PLoS One. 2018 Jul 10;13(7):e0200018. doi: 10.1371/journal.pone.0200018. eCollection 2018.

Novel mutations associated with pyruvate kinase deficiency in Brazil.

Hematol Transfus Cell Ther. 2018 Jan-Mar;40(1):5-11. doi: 10.1016/j.bjhh.2017.08.007. Epub 2017 Nov 26.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

Sting_RDB：一个用于蛋白质分析的结构参数关系数据库，支持数据仓库和数据挖掘。

Sting_RDB: a relational database of structural parameters for protein analysis with support for data warehousing and data mining.

作者信息

Oliveira S R M, Almeida G V, Souza K R R, Rodrigues D N, Kuser-Falcão P R, Yamagishi M E B, Santos E H, Vieira F D, Jardine J G, Neshich G

机构信息

Embrapa Informática Agropecuária, Campinas, SP, Brasil.

出版信息

Genet Mol Res. 2007 Oct 5;6(4):911-22.

PMID:18058712

Abstract

摘要

Sting_RDB：一个用于蛋白质分析的结构参数关系数据库，支持数据仓库和数据挖掘。

Sting_RDB: a relational database of structural parameters for protein analysis with support for data warehousing and data mining.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

Sting_RDB：一个用于蛋白质分析的结构参数关系数据库，支持数据仓库和数据挖掘。

Sting_RDB: a relational database of structural parameters for protein analysis with support for data warehousing and data mining.

作者信息

机构信息

出版信息

相似文献

引用本文的文献