放松使用 CouchDB——进入生物信息学的非关系型数据库管理系统时代。

Relax with CouchDB--into the non-relational DBMS era of bioinformatics.

机构信息

Department of Bioinformatics and Computational Biology, The University of Texas MD Anderson Cancer Center, Houston, TX 77030, USA.

出版信息

Genomics. 2012 Jul;100(1):1-7. doi: 10.1016/j.ygeno.2012.05.006. Epub 2012 May 17.

DOI:10.1016/j.ygeno.2012.05.006

PMID:22609849

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3383915/

Abstract

With the proliferation of high-throughput technologies, genome-level data analysis has become common in molecular biology. Bioinformaticians are developing extensive resources to annotate and mine biological features from high-throughput data. The underlying database management systems for most bioinformatics software are based on a relational model. Modern non-relational databases offer an alternative that has flexibility, scalability, and a non-rigid design schema. Moreover, with an accelerated development pace, non-relational databases like CouchDB can be ideal tools to construct bioinformatics utilities. We describe CouchDB by presenting three new bioinformatics resources: (a) geneSmash, which collates data from bioinformatics resources and provides automated gene-centric annotations, (b) drugBase, a database of drug-target interactions with a web interface powered by geneSmash, and (c) HapMap-CN, which provides a web interface to query copy number variations from three SNP-chip HapMap datasets. In addition to the web sites, all three systems can be accessed programmatically via web services.

摘要

随着高通量技术的普及，基因组水平数据分析在分子生物学中已变得非常普遍。生物信息学家正在开发广泛的资源，以从高通量数据中注释和挖掘生物特征。大多数生物信息学软件的基础数据库管理系统都基于关系模型。现代非关系数据库提供了一种替代方案，具有灵活性、可扩展性和非刚性设计模式。此外，随着发展步伐的加快，像 CouchDB 这样的非关系数据库可以成为构建生物信息学实用程序的理想工具。我们通过介绍三个新的生物信息学资源来描述 CouchDB：(a) geneSmash，它整理来自生物信息学资源的数据，并提供自动化的以基因为中心的注释，(b) drugBase，一个药物-靶标相互作用数据库，带有由 geneSmash 提供支持的 Web 界面，以及 (c) HapMap-CN，它提供了一个从三个 SNP 芯片 HapMap 数据集查询拷贝数变异的 Web 界面。除了网站之外，所有三个系统都可以通过 Web 服务以编程方式访问。

相似文献

Relax with CouchDB--into the non-relational DBMS era of bioinformatics.

Genomics. 2012 Jul;100(1):1-7. doi: 10.1016/j.ygeno.2012.05.006. Epub 2012 May 17.

qPortal: A platform for data-driven biomedical research.

PLoS One. 2018 Jan 19;13(1):e0191603. doi: 10.1371/journal.pone.0191603. eCollection 2018.

Facilitating the use of large-scale biological data and tools in the era of translational bioinformatics.

Brief Bioinform. 2014 Nov;15(6):942-52. doi: 10.1093/bib/bbt055. Epub 2013 Aug 1.

GeneKeyDB: a lightweight, gene-centric, relational database to support data mining environments.

BMC Bioinformatics. 2005 Mar 24;6:72. doi: 10.1186/1471-2105-6-72.

MetaBasis: a web-based database containing metadata on software tools and databases in the field of bioinformatics.

Appl Bioinformatics. 2006;5(3):187-92. doi: 10.2165/00822942-200605030-00007.

Ergatis: a web interface and scalable software system for bioinformatics workflows.

Bioinformatics. 2010 Jun 15;26(12):1488-92. doi: 10.1093/bioinformatics/btq167. Epub 2010 Apr 22.

BioWarehouse: a bioinformatics database warehouse toolkit.

BMC Bioinformatics. 2006 Mar 23;7:170. doi: 10.1186/1471-2105-7-170.

An XML transfer schema for exchange of genomic and genetic mapping data: implementation as a web service in a Taverna workflow.

BMC Bioinformatics. 2009 Aug 14;10:252. doi: 10.1186/1471-2105-10-252.

OpenFlyData: an exemplar data web integrating gene expression data on the fruit fly Drosophila melanogaster.

J Biomed Inform. 2010 Oct;43(5):752-61. doi: 10.1016/j.jbi.2010.04.004.

Developing a modern web interface for database-driven bioinformatics tools.

IEEE Eng Med Biol Mag. 2007 Mar-Apr;26(2):96-8. doi: 10.1109/memb.2007.335598.

引用本文的文献

RegulonDB v12.0: a comprehensive resource of transcriptional regulation in E. coli K-12.

Nucleic Acids Res. 2024 Jan 5;52(D1):D255-D264. doi: 10.1093/nar/gkad1072.

Benchmarking database systems for Genomic Selection implementation.

Database (Oxford). 2019 Jan 1;2019. doi: 10.1093/database/baz096.

Gigwa-Genotype investigator for genome-wide analyses.

Gigascience. 2016 Jun 6;5:25. doi: 10.1186/s13742-016-0131-8.

BigQ: a NoSQL based framework to handle genomic variants in i2b2.

BMC Bioinformatics. 2015 Dec 29;16:415. doi: 10.1186/s12859-015-0861-0.

Big Data: the challenge for small research groups in the era of cancer genomics.

Br J Cancer. 2015 Nov 17;113(10):1405-12. doi: 10.1038/bjc.2015.341. Epub 2015 Oct 22.

Open source libraries and frameworks for biological data visualisation: a guide for developers.

Proteomics. 2015 Apr;15(8):1356-74. doi: 10.1002/pmic.201400377. Epub 2015 Feb 5.

Big data and biomedical informatics: a challenging opportunity.

Yearb Med Inform. 2014 May 22;9(1):8-13. doi: 10.15265/IY-2014-0024.

RefPrimeCouch--a reference gene primer CouchApp.

Database (Oxford). 2013 Dec 24;2013:bat081. doi: 10.1093/database/bat081. Print 2013.

Exposome informatics: considerations for the design of future biomedical research information systems.

J Am Med Inform Assoc. 2014 May-Jun;21(3):386-90. doi: 10.1136/amiajnl-2013-001772. Epub 2013 Nov 1.

targetHub: a programmable interface for miRNA-gene interactions.

Bioinformatics. 2013 Oct 15;29(20):2657-8. doi: 10.1093/bioinformatics/btt439. Epub 2013 Sep 6.

本文引用的文献

Knowledge engineering tools for reasoning with scientific observations and interpretations: a neural connectivity use case.

BMC Bioinformatics. 2011 Aug 22;12:351. doi: 10.1186/1471-2105-12-351.

SeqWare Query Engine: storing and searching sequence data in the cloud.

BMC Bioinformatics. 2010 Dec 21;11 Suppl 12(Suppl 12):S2. doi: 10.1186/1471-2105-11-S12-S2.

Entrez Gene: gene-centered information at NCBI.

Nucleic Acids Res. 2011 Jan;39(Database issue):D52-7. doi: 10.1093/nar/gkq1237. Epub 2010 Nov 28.

DrugBank 3.0: a comprehensive resource for 'omics' research on drugs.

Nucleic Acids Res. 2011 Jan;39(Database issue):D1035-41. doi: 10.1093/nar/gkq1126. Epub 2010 Nov 8.

miRBase: integrating microRNA annotation and deep-sequencing data.

Nucleic Acids Res. 2011 Jan;39(Database issue):D152-7. doi: 10.1093/nar/gkq1027. Epub 2010 Oct 30.

The RCSB Protein Data Bank: redesigned web site and web services.

Nucleic Acids Res. 2011 Jan;39(Database issue):D392-401. doi: 10.1093/nar/gkq1021. Epub 2010 Oct 29.

The UCSC Genome Browser database: update 2011.

Nucleic Acids Res. 2011 Jan;39(Database issue):D876-82. doi: 10.1093/nar/gkq963. Epub 2010 Oct 18.

GeneCards Version 3: the human gene integrator.

Database (Oxford). 2010 Aug 5;2010:baq020. doi: 10.1093/database/baq020.

GeneDecks: paralog hunting and gene-set distillation with GeneCards annotation.

OMICS. 2009 Dec;13(6):477-87. doi: 10.1089/omi.2009.0069.

Update of TTD: Therapeutic Target Database.

Nucleic Acids Res. 2010 Jan;38(Database issue):D787-91. doi: 10.1093/nar/gkp1014. Epub 2009 Nov 20.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

放松使用 CouchDB——进入生物信息学的非关系型数据库管理系统时代。

Relax with CouchDB--into the non-relational DBMS era of bioinformatics.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献