Suppr超能文献

Ensembl核心软件库。

The Ensembl core software libraries.

作者信息

Stabenau Arne, McVicker Graham, Melsopp Craig, Proctor Glenn, Clamp Michele, Birney Ewan

机构信息

EMBL European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton, CB10 1SD, UK.

出版信息

Genome Res. 2004 May;14(5):929-33. doi: 10.1101/gr.1857204.

Abstract

Systems for managing genomic data must store a vast quantity of information. Ensembl stores these data in several MySQL databases. The core software libraries provide a practical and effective means for programmers to access these data. By encapsulating the underlying database structure, the libraries present end users with a simple, abstract interface to a complex data model. Programs that use the libraries rather than SQL to access the data are unaffected by most schema changes. The architecture of the core software libraries, the schema, and the factors influencing their design are described. All code and data are freely available.

摘要

用于管理基因组数据的系统必须存储大量信息。Ensembl将这些数据存储在多个MySQL数据库中。核心软件库为程序员提供了一种实用且有效的方式来访问这些数据。通过封装底层数据库结构,这些库为最终用户提供了一个简单的、抽象的接口,以访问复杂的数据模型。使用这些库而不是SQL来访问数据的程序不受大多数模式更改的影响。本文描述了核心软件库的架构、模式以及影响其设计的因素。所有代码和数据均可免费获取。

相似文献

1
The Ensembl core software libraries.
Genome Res. 2004 May;14(5):929-33. doi: 10.1101/gr.1857204.
2
JEnsembl: a version-aware Java API to Ensembl data systems.
Bioinformatics. 2012 Nov 1;28(21):2724-31. doi: 10.1093/bioinformatics/bts525. Epub 2012 Sep 3.
3
A Chado case study: an ontology-based modular schema for representing genome-associated biological information.
Bioinformatics. 2007 Jul 1;23(13):i337-46. doi: 10.1093/bioinformatics/btm189.
4
The Ensembl Web site: mechanics of a genome browser.
Genome Res. 2004 May;14(5):951-5. doi: 10.1101/gr.1863004.
5
The ensembl regulatory build.
Genome Biol. 2015 Mar 24;16(1):56. doi: 10.1186/s13059-015-0621-5.
6
The Bioperl toolkit: Perl modules for the life sciences.
Genome Res. 2002 Oct;12(10):1611-8. doi: 10.1101/gr.361602.
7
The Ensembl computing architecture.
Genome Res. 2004 May;14(5):971-5. doi: 10.1101/gr.1866304.
8
The Ensembl automatic gene annotation system.
Genome Res. 2004 May;14(5):942-50. doi: 10.1101/gr.1858004.
10
Pegasys: software for executing and integrating analyses of biological sequences.
BMC Bioinformatics. 2004 Apr 19;5:40. doi: 10.1186/1471-2105-5-40.

引用本文的文献

1
High-quality chromosome scale genome assemblies of two important Sorghum inbred lines, Tx2783 and RTx436.
NAR Genom Bioinform. 2024 Aug 9;6(3):lqae097. doi: 10.1093/nargab/lqae097. eCollection 2024 Sep.
2
Generalised interrelations among mutation rates drive the genomic compliance of Chargaff's second parity rule.
Nucleic Acids Res. 2023 Aug 11;51(14):7409-7423. doi: 10.1093/nar/gkad477.
4
Ten new high-quality genome assemblies for diverse bioenergy sorghum genotypes.
Front Plant Sci. 2023 Jan 4;13:1040909. doi: 10.3389/fpls.2022.1040909. eCollection 2022.
5
Antibody Sequence and Structure Analyses Using IMGT: 30 Years of Immunoinformatics.
Methods Mol Biol. 2023;2552:3-59. doi: 10.1007/978-1-0716-2609-2_1.
6
K-mer counting and curated libraries drive efficient annotation of repeats in plant genomes.
Plant Genome. 2021 Nov;14(3):e20143. doi: 10.1002/tpg2.20143. Epub 2021 Sep 25.
7
De novo assembly, annotation, and comparative analysis of 26 diverse maize genomes.
Science. 2021 Aug 6;373(6555):655-662. doi: 10.1126/science.abg5289.
10
Ensembl 2017.
Nucleic Acids Res. 2017 Jan 4;45(D1):D635-D642. doi: 10.1093/nar/gkw1104. Epub 2016 Nov 28.

本文引用的文献

1
Sockeye: a 3D environment for comparative genomics.
Genome Res. 2004 May;14(5):956-62. doi: 10.1101/gr.1890304.
2
Connecting sequence and biology in the laboratory mouse.
Genome Res. 2003 Jun;13(6B):1505-19. doi: 10.1101/gr.991003.
3
Toucan: deciphering the cis-regulatory logic of coregulated genes.
Nucleic Acids Res. 2003 Mar 15;31(6):1753-64. doi: 10.1093/nar/gkg268.
4
An integrated computational pipeline and database to support whole-genome sequence annotation.
Genome Biol. 2002;3(12):RESEARCH0081. doi: 10.1186/gb-2002-3-12-research0081. Epub 2002 Dec 23.
6
7
The UCSC Genome Browser Database.
Nucleic Acids Res. 2003 Jan 1;31(1):51-4. doi: 10.1093/nar/gkg129.
8
The Bioperl toolkit: Perl modules for the life sciences.
Genome Res. 2002 Oct;12(10):1611-8. doi: 10.1101/gr.361602.
9
The generic genome browser: a building block for a model organism system database.
Genome Res. 2002 Oct;12(10):1599-610. doi: 10.1101/gr.403602.
10
WormBase: network access to the genome and biology of Caenorhabditis elegans.
Nucleic Acids Res. 2001 Jan 1;29(1):82-6. doi: 10.1093/nar/29.1.82.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验