Hartler Jürgen, Thallinger Gerhard G, Stocker Gernot, Sturn Alexander, Burkard Thomas R, Körner Erik, Rader Robert, Schmidt Andreas, Mechtler Karl, Trajanoski Zlatko
Institute for Genomics and Bioinformatics and Christian-Doppler Laboratory for Genomics and Bioinformatics, Graz University of Technology, Petersgasse 14, Graz, Austria.
BMC Bioinformatics. 2007 Jun 13;8:197. doi: 10.1186/1471-2105-8-197.
The advancements of proteomics technologies have led to a rapid increase in the number, size and rate at which datasets are generated. Managing and extracting valuable information from such datasets requires the use of data management platforms and computational approaches.
We have developed the MAss SPECTRometry Analysis System (MASPECTRAS), a platform for management and analysis of proteomics LC-MS/MS data. MASPECTRAS is based on the Proteome Experimental Data Repository (PEDRo) relational database schema and follows the guidelines of the Proteomics Standards Initiative (PSI). Analysis modules include: 1) import and parsing of the results from the search engines SEQUEST, Mascot, Spectrum Mill, X! Tandem, and OMSSA; 2) peptide validation, 3) clustering of proteins based on Markov Clustering and multiple alignments; and 4) quantification using the Automated Statistical Analysis of Protein Abundance Ratios algorithm (ASAPRatio). The system provides customizable data retrieval and visualization tools, as well as export to PRoteomics IDEntifications public repository (PRIDE). MASPECTRAS is freely available at http://genome.tugraz.at/maspectras
Given the unique features and the flexibility due to the use of standard software technology, our platform represents significant advance and could be of great interest to the proteomics community.
蛋白质组学技术的进步使得数据集的数量、规模和生成速度迅速增加。从这些数据集中管理和提取有价值的信息需要使用数据管理平台和计算方法。
我们开发了质谱分析系统(MASPECTRAS),这是一个用于管理和分析蛋白质组学液相色谱-串联质谱(LC-MS/MS)数据的平台。MASPECTRAS基于蛋白质组实验数据存储库(PEDRo)关系数据库模式,并遵循蛋白质组学标准倡议(PSI)的指导方针。分析模块包括:1)导入和解析来自搜索引擎SEQUEST、Mascot、Spectrum Mill、X! Tandem和OMSSA的结果;2)肽段验证;3)基于马尔可夫聚类和多序列比对的蛋白质聚类;4)使用蛋白质丰度比自动统计分析算法(ASAPRatio)进行定量分析。该系统提供可定制的数据检索和可视化工具,以及导出到蛋白质组学鉴定公共存储库(PRIDE)。MASPECTRAS可从http://genome.tugraz.at/maspectras免费获取。
鉴于其独特的功能以及由于使用标准软件技术而具有的灵活性,我们的平台代表了重大进展,可能会引起蛋白质组学界的极大兴趣。