Suppr超能文献

放松使用 CouchDB——进入生物信息学的非关系型数据库管理系统时代。

Relax with CouchDB--into the non-relational DBMS era of bioinformatics.

机构信息

Department of Bioinformatics and Computational Biology, The University of Texas MD Anderson Cancer Center, Houston, TX 77030, USA.

出版信息

Genomics. 2012 Jul;100(1):1-7. doi: 10.1016/j.ygeno.2012.05.006. Epub 2012 May 17.

Abstract

With the proliferation of high-throughput technologies, genome-level data analysis has become common in molecular biology. Bioinformaticians are developing extensive resources to annotate and mine biological features from high-throughput data. The underlying database management systems for most bioinformatics software are based on a relational model. Modern non-relational databases offer an alternative that has flexibility, scalability, and a non-rigid design schema. Moreover, with an accelerated development pace, non-relational databases like CouchDB can be ideal tools to construct bioinformatics utilities. We describe CouchDB by presenting three new bioinformatics resources: (a) geneSmash, which collates data from bioinformatics resources and provides automated gene-centric annotations, (b) drugBase, a database of drug-target interactions with a web interface powered by geneSmash, and (c) HapMap-CN, which provides a web interface to query copy number variations from three SNP-chip HapMap datasets. In addition to the web sites, all three systems can be accessed programmatically via web services.

摘要

随着高通量技术的普及,基因组水平数据分析在分子生物学中已变得非常普遍。生物信息学家正在开发广泛的资源,以从高通量数据中注释和挖掘生物特征。大多数生物信息学软件的基础数据库管理系统都基于关系模型。现代非关系数据库提供了一种替代方案,具有灵活性、可扩展性和非刚性设计模式。此外,随着发展步伐的加快,像 CouchDB 这样的非关系数据库可以成为构建生物信息学实用程序的理想工具。我们通过介绍三个新的生物信息学资源来描述 CouchDB:(a) geneSmash,它整理来自生物信息学资源的数据,并提供自动化的以基因为中心的注释,(b) drugBase,一个药物-靶标相互作用数据库,带有由 geneSmash 提供支持的 Web 界面,以及 (c) HapMap-CN,它提供了一个从三个 SNP 芯片 HapMap 数据集查询拷贝数变异的 Web 界面。除了网站之外,所有三个系统都可以通过 Web 服务以编程方式访问。

相似文献

1
Relax with CouchDB--into the non-relational DBMS era of bioinformatics.
Genomics. 2012 Jul;100(1):1-7. doi: 10.1016/j.ygeno.2012.05.006. Epub 2012 May 17.
2
qPortal: A platform for data-driven biomedical research.
PLoS One. 2018 Jan 19;13(1):e0191603. doi: 10.1371/journal.pone.0191603. eCollection 2018.
3
Facilitating the use of large-scale biological data and tools in the era of translational bioinformatics.
Brief Bioinform. 2014 Nov;15(6):942-52. doi: 10.1093/bib/bbt055. Epub 2013 Aug 1.
4
GeneKeyDB: a lightweight, gene-centric, relational database to support data mining environments.
BMC Bioinformatics. 2005 Mar 24;6:72. doi: 10.1186/1471-2105-6-72.
5
MetaBasis: a web-based database containing metadata on software tools and databases in the field of bioinformatics.
Appl Bioinformatics. 2006;5(3):187-92. doi: 10.2165/00822942-200605030-00007.
6
Ergatis: a web interface and scalable software system for bioinformatics workflows.
Bioinformatics. 2010 Jun 15;26(12):1488-92. doi: 10.1093/bioinformatics/btq167. Epub 2010 Apr 22.
7
BioWarehouse: a bioinformatics database warehouse toolkit.
BMC Bioinformatics. 2006 Mar 23;7:170. doi: 10.1186/1471-2105-7-170.
9
10
Developing a modern web interface for database-driven bioinformatics tools.
IEEE Eng Med Biol Mag. 2007 Mar-Apr;26(2):96-8. doi: 10.1109/memb.2007.335598.

引用本文的文献

1
RegulonDB v12.0: a comprehensive resource of transcriptional regulation in E. coli K-12.
Nucleic Acids Res. 2024 Jan 5;52(D1):D255-D264. doi: 10.1093/nar/gkad1072.
2
Benchmarking database systems for Genomic Selection implementation.
Database (Oxford). 2019 Jan 1;2019. doi: 10.1093/database/baz096.
3
Gigwa-Genotype investigator for genome-wide analyses.
Gigascience. 2016 Jun 6;5:25. doi: 10.1186/s13742-016-0131-8.
4
BigQ: a NoSQL based framework to handle genomic variants in i2b2.
BMC Bioinformatics. 2015 Dec 29;16:415. doi: 10.1186/s12859-015-0861-0.
5
Big Data: the challenge for small research groups in the era of cancer genomics.
Br J Cancer. 2015 Nov 17;113(10):1405-12. doi: 10.1038/bjc.2015.341. Epub 2015 Oct 22.
6
Open source libraries and frameworks for biological data visualisation: a guide for developers.
Proteomics. 2015 Apr;15(8):1356-74. doi: 10.1002/pmic.201400377. Epub 2015 Feb 5.
7
Big data and biomedical informatics: a challenging opportunity.
Yearb Med Inform. 2014 May 22;9(1):8-13. doi: 10.15265/IY-2014-0024.
8
RefPrimeCouch--a reference gene primer CouchApp.
Database (Oxford). 2013 Dec 24;2013:bat081. doi: 10.1093/database/bat081. Print 2013.
9
Exposome informatics: considerations for the design of future biomedical research information systems.
J Am Med Inform Assoc. 2014 May-Jun;21(3):386-90. doi: 10.1136/amiajnl-2013-001772. Epub 2013 Nov 1.
10
targetHub: a programmable interface for miRNA-gene interactions.
Bioinformatics. 2013 Oct 15;29(20):2657-8. doi: 10.1093/bioinformatics/btt439. Epub 2013 Sep 6.

本文引用的文献

2
SeqWare Query Engine: storing and searching sequence data in the cloud.
BMC Bioinformatics. 2010 Dec 21;11 Suppl 12(Suppl 12):S2. doi: 10.1186/1471-2105-11-S12-S2.
3
Entrez Gene: gene-centered information at NCBI.
Nucleic Acids Res. 2011 Jan;39(Database issue):D52-7. doi: 10.1093/nar/gkq1237. Epub 2010 Nov 28.
4
DrugBank 3.0: a comprehensive resource for 'omics' research on drugs.
Nucleic Acids Res. 2011 Jan;39(Database issue):D1035-41. doi: 10.1093/nar/gkq1126. Epub 2010 Nov 8.
5
miRBase: integrating microRNA annotation and deep-sequencing data.
Nucleic Acids Res. 2011 Jan;39(Database issue):D152-7. doi: 10.1093/nar/gkq1027. Epub 2010 Oct 30.
6
The RCSB Protein Data Bank: redesigned web site and web services.
Nucleic Acids Res. 2011 Jan;39(Database issue):D392-401. doi: 10.1093/nar/gkq1021. Epub 2010 Oct 29.
7
The UCSC Genome Browser database: update 2011.
Nucleic Acids Res. 2011 Jan;39(Database issue):D876-82. doi: 10.1093/nar/gkq963. Epub 2010 Oct 18.
8
GeneCards Version 3: the human gene integrator.
Database (Oxford). 2010 Aug 5;2010:baq020. doi: 10.1093/database/baq020.
9
GeneDecks: paralog hunting and gene-set distillation with GeneCards annotation.
OMICS. 2009 Dec;13(6):477-87. doi: 10.1089/omi.2009.0069.
10
Update of TTD: Therapeutic Target Database.
Nucleic Acids Res. 2010 Jan;38(Database issue):D787-91. doi: 10.1093/nar/gkp1014. Epub 2009 Nov 20.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验