CIRAD, UMR AGAP, Montpellier F-34398, France.
BMC Bioinformatics. 2013 Apr 15;14:126. doi: 10.1186/1471-2105-14-126.
In recent years, a large amount of "-omics" data have been produced. However, these data are stored in many different species-specific databases that are managed by different institutes and laboratories. Biologists often need to find and assemble data from disparate sources to perform certain analyses. Searching for these data and assembling them is a time-consuming task. The Semantic Web helps to facilitate interoperability across databases. A common approach involves the development of wrapper systems that map a relational database schema onto existing domain ontologies. However, few attempts have been made to automate the creation of such wrappers.
We developed a framework, named BioSemantic, for the creation of Semantic Web Services that are applicable to relational biological databases. This framework makes use of both Semantic Web and Web Services technologies and can be divided into two main parts: (i) the generation and semi-automatic annotation of an RDF view; and (ii) the automatic generation of SPARQL queries and their integration into Semantic Web Services backbones. We have used our framework to integrate genomic data from different plant databases.
BioSemantic is a framework that was designed to speed integration of relational databases. We present how it can be used to speed the development of Semantic Web Services for existing relational biological databases. Currently, it creates and annotates RDF views that enable the automatic generation of SPARQL queries. Web Services are also created and deployed automatically, and the semantic annotations of our Web Services are added automatically using SAWSDL attributes. BioSemantic is downloadable at http://southgreen.cirad.fr/?q=content/Biosemantic.
近年来,产生了大量的“组学”数据。然而,这些数据存储在许多不同的物种特定数据库中,这些数据库由不同的研究所和实验室管理。生物学家经常需要从不同的来源查找和组装数据来执行某些分析。搜索这些数据并将其组装在一起是一项耗时的任务。语义网有助于促进数据库之间的互操作性。一种常见的方法是开发包装系统,将关系数据库模式映射到现有领域本体上。然而,很少有人试图自动创建这些包装器。
我们开发了一个名为 BioSemantic 的框架,用于创建适用于关系生物数据库的语义网服务。该框架利用语义网和 Web 服务技术,可以分为两个主要部分:(i)生成和半自动注释 RDF 视图;(ii)自动生成 SPARQL 查询并将其集成到语义网服务骨干中。我们已经使用我们的框架整合了来自不同植物数据库的基因组数据。
BioSemantic 是一个旨在加速关系数据库集成的框架。我们展示了如何使用它来加速现有关系生物数据库的语义网服务的开发。目前,它创建和注释 RDF 视图,从而能够自动生成 SPARQL 查询。Web 服务也会自动创建和部署,并且使用 SAWSDL 属性自动添加我们的 Web 服务的语义注释。BioSemantic 可在 http://southgreen.cirad.fr/?q=content/Biosemantic 下载。