Wang L, Rodriguez-Tomé P, Redaschi N, McNeil P, Robinson A, Lijnzaad P
EMBL Outstation - Hinxton, European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, UK.
Genome Biol. 2000;1(5):RESEARCH0010. doi: 10.1186/gb-2000-1-5-research0010. Epub 2000 Nov 6.
The EMBL Nucleotide Sequence Database is a comprehensive database of DNA and RNA sequences and related information traditionally made available in flat-file format. Queries through tools such as SRS (Sequence Retrieval System) also return data in flat-file format. Flat files have a number of shortcomings, however, and the resources therefore currently lack a flexible environment to meet individual researchers' needs. The Object Management Group's common object request broker architecture (CORBA) is an industry standard that provides platform-independent programming interfaces and models for portable distributed object-oriented computing applications. Its independence from programming languages, computing platforms and network protocols makes it attractive for developing new applications for querying and distributing biological data.
A CORBA infrastructure developed by EMBL-EBI provides an efficient means of accessing and distributing EMBL data. The EMBL object model is defined such that it provides a basis for specifying interfaces in interface definition language (IDL) and thus for developing the CORBA servers. The mapping from the object model to the relational schema in the underlying Oracle database uses the facilities provided by PersistenceTM, an object/relational tool. The techniques of developing loaders and 'live object caching' with persistent objects achieve a smart live object cache where objects are created on demand. The objects are managed by an evictor pattern mechanism.
The CORBA interfaces to the EMBL database address some of the problems of traditional flat-file formats and provide an efficient means for accessing and distributing EMBL data. CORBA also provides a flexible environment for users to develop their applications by building clients to our CORBA servers, which can be integrated into existing systems.
欧洲分子生物学实验室核苷酸序列数据库是一个全面的DNA和RNA序列及相关信息数据库,传统上以平面文件格式提供。通过诸如序列检索系统(SRS)等工具进行的查询也以平面文件格式返回数据。然而,平面文件有许多缺点,因此这些资源目前缺乏一个灵活的环境来满足个别研究人员的需求。对象管理集团的通用对象请求代理体系结构(CORBA)是一种行业标准,它为便携式分布式面向对象计算应用程序提供与平台无关的编程接口和模型。它独立于编程语言、计算平台和网络协议,这使其对于开发用于查询和分发生物数据的新应用程序具有吸引力。
欧洲生物信息学研究所(EMBL-EBI)开发的CORBA基础设施提供了一种访问和分发EMBL数据的有效手段。EMBL对象模型的定义方式使其为用接口定义语言(IDL)指定接口提供了基础,从而为开发CORBA服务器提供了基础。从对象模型到基础Oracle数据库中关系模式的映射使用了对象/关系工具PersistenceTM提供的功能。使用持久对象开发加载器和“实时对象缓存”的技术实现了一个智能实时对象缓存,其中对象按需创建。对象由逐出器模式机制管理。
与EMBL数据库的CORBA接口解决了传统平面文件格式的一些问题,并为访问和分发EMBL数据提供了一种有效手段。CORBA还为用户提供了一个灵活的环境,通过构建我们CORBA服务器的客户端来开发他们的应用程序,这些客户端可以集成到现有系统中。