Suppr超能文献

TOPP——开放式质谱蛋白质组学流程

TOPP--the OpenMS proteomics pipeline.

作者信息

Kohlbacher Oliver, Reinert Knut, Gröpl Clemens, Lange Eva, Pfeifer Nico, Schulz-Trieglaff Ole, Sturm Marc

机构信息

Simulation of Biological Systems, Eberhard Karls University Tübingen Sand 14, 72076 Tübingen, Germany.

出版信息

Bioinformatics. 2007 Jan 15;23(2):e191-7. doi: 10.1093/bioinformatics/btl299.

Abstract

MOTIVATION

Experimental techniques in proteomics have seen rapid development over the last few years. Volume and complexity of the data have both been growing at a similar rate. Accordingly, data management and analysis are one of the major challenges in proteomics. Flexible algorithms are required to handle changing experimental setups and to assist in developing and validating new methods. In order to facilitate these studies, it would be desirable to have a flexible 'toolbox' of versatile and user-friendly applications allowing for rapid construction of computational workflows in proteomics.

RESULTS

We describe a set of tools for proteomics data analysis-TOPP, The OpenMS Proteomics Pipeline. TOPP provides a set of computational tools which can be easily combined into analysis pipelines even by non-experts and can be used in proteomics workflows. These applications range from useful utilities (file format conversion, peak picking) over wrapper applications for known applications (e.g. Mascot) to completely new algorithmic techniques for data reduction and data analysis. We anticipate that TOPP will greatly facilitate rapid prototyping of proteomics data evaluation pipelines. As such, we describe the basic concepts and the current abilities of TOPP and illustrate these concepts in the context of two example applications: the identification of peptides from a raw dataset through database search and the complex analysis of a standard addition experiment for the absolute quantitation of biomarkers. The latter example demonstrates TOPP's ability to construct flexible analysis pipelines in support of complex experimental setups.

AVAILABILITY

The TOPP components are available as open-source software under the lesser GNU public license (LGPL). Source code is available from the project website at www.OpenMS.de

摘要

动机

蛋白质组学的实验技术在过去几年中取得了快速发展。数据的量和复杂度都以相似的速度增长。因此,数据管理和分析是蛋白质组学的主要挑战之一。需要灵活的算法来处理不断变化的实验设置,并协助开发和验证新方法。为了促进这些研究,拥有一个灵活的“工具箱”,其中包含通用且用户友好的应用程序,以便在蛋白质组学中快速构建计算工作流程,将是很理想的。

结果

我们描述了一套用于蛋白质组学数据分析的工具——TOPP,即开放质谱蛋白质组学管道(The OpenMS Proteomics Pipeline)。TOPP提供了一组计算工具,即使非专业人员也能轻松将其组合成分析管道,并可用于蛋白质组学工作流程。这些应用范围从实用工具(文件格式转换、峰检测)到已知应用程序的包装应用(如Mascot),再到用于数据简化和数据分析的全新算法技术。我们预计TOPP将极大地促进蛋白质组学数据评估管道的快速原型设计。因此,我们描述了TOPP的基本概念和当前功能,并在两个示例应用的背景下说明这些概念:通过数据库搜索从原始数据集中鉴定肽段,以及对用于生物标志物绝对定量的标准加入实验进行复杂分析。后一个示例展示了TOPP构建灵活分析管道以支持复杂实验设置的能力。

可用性

TOPP组件以较小的GNU公共许可证(LGPL)作为开源软件提供。源代码可从项目网站www.OpenMS.de获取。

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验