Lo Siaw Ling, You Tao, Lin Qingsong, Joshi Shashikant B, Chung Maxey C M, Hew Choy Leong
Department of Biological Sciences, Faculty of Science, National University of Singapore, Singapore.
Proteomics. 2006 Mar;6(6):1758-69. doi: 10.1002/pmic.200500378.
In the field of proteomics, the increasing difficulty to unify the data format, due to the different platforms/instrumentation and laboratory documentation systems, greatly hinders experimental data verification, exchange, and comparison. Therefore, it is essential to establish standard formats for every necessary aspect of proteomics data. One of the recently published data models is the proteomics experiment data repository [Taylor, C. F., Paton, N. W., Garwood, K. L., Kirby, P. D. et al., Nat. Biotechnol. 2003, 21, 247-254]. Compliant with this format, we developed the systematic proteomics laboratory analysis and storage hub (SPLASH) database system as an informatics infrastructure to support proteomics studies. It consists of three modules and provides proteomics researchers a common platform to store, manage, search, analyze, and exchange their data. (i) Data maintenance includes experimental data entry and update, uploading of experimental results in batch mode, and data exchange in the original PEDRo format. (ii) The data search module provides several means to search the database, to view either the protein information or the differential expression display by clicking on a gel image. (iii) The data mining module contains tools that perform biochemical pathway, statistics-associated gene ontology, and other comparative analyses for all the sample sets to interpret its biological meaning. These features make SPLASH a practical and powerful tool for the proteomics community.
在蛋白质组学领域,由于不同的平台/仪器以及实验室文档系统,统一数据格式的难度日益增加,这极大地阻碍了实验数据的验证、交换和比较。因此,为蛋白质组学数据的各个必要方面建立标准格式至关重要。最近发布的数据模型之一是蛋白质组学实验数据储存库[泰勒,C.F.,帕顿,N.W.,加伍德,K.L.,柯比,P.D.等人,《自然生物技术》,2003年,21卷,247 - 254页]。遵循这种格式,我们开发了系统蛋白质组学实验室分析与存储中心(SPLASH)数据库系统,作为支持蛋白质组学研究的信息学基础设施。它由三个模块组成,为蛋白质组学研究人员提供了一个存储、管理、搜索、分析和交换数据的通用平台。(i)数据维护包括实验数据录入和更新、以批处理模式上传实验结果以及以原始PEDRo格式进行数据交换。(ii)数据搜索模块提供了多种搜索数据库的方式,通过点击凝胶图像可查看蛋白质信息或差异表达显示。(iii)数据挖掘模块包含一些工具,这些工具对所有样本集进行生化途径、统计相关基因本体以及其他比较分析,以解读其生物学意义。这些特性使SPLASH成为蛋白质组学领域实用且强大的工具。