Schoof H, Ernst R, Mayer K F X
Technische Universität München, Chair of Genome-oriented Bioinformatics, Center of Life and Food Science, Freising-Weihenstephan D-85350, Germany.
Comp Funct Genomics. 2004;5(2):184-9. doi: 10.1002/cfg.374.
The completion of the Arabidopsis genome and the large collections of other plant sequences generated in recent years have sparked extensive functional genomics efforts. However, the utilization of this data is inefficient, as data sources are distributed and heterogeneous and efforts at data integration are lagging behind. PlaNet aims to overcome the limitations of individual efforts as well as the limitations of heterogeneous, independent data collections. PlaNet is a distributed effort among European bioinformatics groups and plant molecular biologists to establish a comprehensive integrated database in a collaborative network. Objectives are the implementation of infrastructure and data sources to capture plant genomic information into a comprehensive, integrated platform. This will facilitate the systematic exploration of Arabidopsis and other plants. New methods for data exchange, database integration and access are being developed to create a highly integrated, federated data resource for research. The connection between the individual resources is realized with BioMOBY. BioMOBY provides an architecture for the discovery and distribution of biological data through web services. While knowledge is centralized, data is maintained at its primary source without a need for warehousing. To standardize nomenclature and data representation, ontologies and generic data models are defined in interaction with the relevant communities.Minimal data models should make it simple to allow broad integration, while inheritance allows detail and depth to be added to more complex data objects without losing integration. To allow expert annotation and keep databases curated, local and remote annotation interfaces are provided. Easy and direct access to all data is key to the project.
近年来,拟南芥基因组测序的完成以及大量其他植物序列的产生,引发了广泛的功能基因组学研究工作。然而,由于数据来源分散且异构,以及数据整合工作滞后,这些数据的利用效率低下。PlaNet旨在克服个体研究的局限性以及异构、独立数据收集的局限性。PlaNet是欧洲生物信息学团队和植物分子生物学家的一项分布式工作,旨在通过合作网络建立一个全面的综合数据库。目标是实施基础设施和数据源,将植物基因组信息捕获到一个全面的综合平台中。这将有助于对拟南芥和其他植物进行系统的探索。目前正在开发新的数据交换、数据库整合和访问方法,以创建一个高度集成的联合数据资源用于研究。各个资源之间的连接通过BioMOBY实现。BioMOBY提供了一种通过网络服务发现和分发生物数据的架构。虽然知识是集中的,但数据仍保存在其原始来源,无需进行仓储。为了标准化命名法和数据表示,与相关社区互动定义了本体和通用数据模型。最小数据模型应使广泛整合变得简单,而继承允许在不失去整合的情况下为更复杂的数据对象添加细节和深度。为了实现专家注释并保持数据库的精心策划,提供了本地和远程注释接口。轻松直接地访问所有数据是该项目的关键。