Wolstencroft Katy, Owen Stuart, du Preez Franco, Krebs Olga, Mueller Wolfgang, Goble Carole, Snoep Jacky L
School of Computer Science, University of Manchester, Manchester, United Kingdom.
Methods Enzymol. 2011;500:629-55. doi: 10.1016/B978-0-12-385118-5.00029-3.
Systems biology research is typically performed by multidisciplinary groups of scientists, often in large consortia and in distributed locations. The data generated in these projects tend to be heterogeneous and often involves high-throughput "omics" analyses. Models are developed iteratively from data generated in the projects and from the literature. Consequently, there is a growing requirement for exchanging experimental data, mathematical models, and scientific protocols between consortium members and a necessity to record and share the outcomes of experiments and the links between data and models. The overall output of a research consortium is also a valuable commodity in its own right. The research and associated data and models should eventually be available to the whole community for reuse and future analysis. The SEEK is an open-source, Web-based platform designed for the management and exchange of systems biology data and models. The SEEK was originally developed for the SysMO (systems biology of microorganisms) consortia, but the principles and objectives are applicable to any systems biology project. The SEEK provides an index of consortium resources and acts as gateway to other tools and services commonly used in the community. For example, the model simulation tool, JWS Online, has been integrated into the SEEK, and a plug-in to PubMed allows publications to be linked to supporting data and author profiles in the SEEK. The SEEK is a pragmatic solution to data management which encourages, but does not force, researchers to share and disseminate their data to community standard formats. It provides tools to assist with management and annotation as well as incentives and added value for following these recommendations. Data exchange and reuse rely on sufficient annotation, consistent metadata descriptions, and the use of standard exchange formats for models, data, and the experiments they are derived from. In this chapter, we present the SEEK platform, its functionalities, and the methods employed for lowering the barriers to adoption of standard formats. As the production of biological data continues to grow, in systems biology and in the life sciences in general, the need to record, manage, and exploit this wealth of information in the future is increasing. We promote the SEEK as a data and model management tool that can be adapted to the specific needs of a particular systems biology project.
系统生物学研究通常由多学科的科学家团队进行,这些团队往往规模庞大,分布在不同地点。这些项目产生的数据往往具有异质性,并且常常涉及高通量的“组学”分析。模型是根据项目中产生的数据以及文献中的数据迭代开发的。因此,联盟成员之间交换实验数据、数学模型和科学协议的需求日益增长,同时有必要记录和共享实验结果以及数据与模型之间的联系。研究联盟的整体产出本身也是一种有价值的商品。研究以及相关数据和模型最终应可供整个科学界重复使用和进行未来分析。SEEK是一个基于网络的开源平台,旨在管理和交换系统生物学数据及模型。SEEK最初是为SysMO(微生物系统生物学)联盟开发的,但其中的原则和目标适用于任何系统生物学项目。SEEK提供联盟资源索引,并作为通往该领域常用的其他工具和服务的网关。例如,模型模拟工具JWS Online已集成到SEEK中,并且一个PubMed插件可将出版物与SEEK中的支持数据和作者简介相链接。SEEK是一种务实的数据管理解决方案,它鼓励但不强制研究人员将其数据共享并传播为社区标准格式。它提供工具来辅助管理和注释,以及遵循这些建议的激励措施和附加值。数据交换和重用依赖于充分的注释、一致的元数据描述,以及对模型、数据及其所源自的实验使用标准交换格式。在本章中,我们介绍SEEK平台、其功能以及为降低采用标准格式的障碍而采用的方法。随着生物数据的产量在系统生物学以及整个生命科学领域持续增长,未来记录、管理和利用这些丰富信息的需求也在增加。我们推广SEEK作为一种可适应特定系统生物学项目特定需求的数据和模型管理工具。