Stabenau Arne, McVicker Graham, Melsopp Craig, Proctor Glenn, Clamp Michele, Birney Ewan
EMBL European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton, CB10 1SD, UK.
Genome Res. 2004 May;14(5):929-33. doi: 10.1101/gr.1857204.
Systems for managing genomic data must store a vast quantity of information. Ensembl stores these data in several MySQL databases. The core software libraries provide a practical and effective means for programmers to access these data. By encapsulating the underlying database structure, the libraries present end users with a simple, abstract interface to a complex data model. Programs that use the libraries rather than SQL to access the data are unaffected by most schema changes. The architecture of the core software libraries, the schema, and the factors influencing their design are described. All code and data are freely available.
用于管理基因组数据的系统必须存储大量信息。Ensembl将这些数据存储在多个MySQL数据库中。核心软件库为程序员提供了一种实用且有效的方式来访问这些数据。通过封装底层数据库结构,这些库为最终用户提供了一个简单的、抽象的接口,以访问复杂的数据模型。使用这些库而不是SQL来访问数据的程序不受大多数模式更改的影响。本文描述了核心软件库的架构、模式以及影响其设计的因素。所有代码和数据均可免费获取。