Suppr超能文献

通过质谱数据聚类改进大规模蛋白质组学

Improving large-scale proteomics by clustering of mass spectrometry data.

作者信息

Beer Ilan, Barnea Eilon, Ziv Tamar, Admon Arie

机构信息

IBM Haifa Research Lab, Haifa, Israel.

出版信息

Proteomics. 2004 Apr;4(4):950-60. doi: 10.1002/pmic.200300652.

Abstract

Tandem mass spectrometry (MS/MS), coupled with liquid chromatography (LC), is a powerful tool for the analysis and comparison of complex protein and peptide mixtures. However, the extremely large amounts of data that result from the process are very complex and difficult to analyze. We show how the clustering of similar spectra from multiple LC-MS/MS runs can help in data management and improve the analysis of complex peptide mixtures. The major effect of spectrum clustering is the reduction of the huge amounts of data to a manageable size. As a result, analysis time is shorter and more data can be stored for further analysis. Furthermore, spectrum quality improvement allows the identification of more peptides with greater confidence, the comparison of complex peptide mixtures is facilitated, and the entire proteomics project is presented in concise form. Pep-Miner is an advanced software tool that implements these clustering-based applications. It proved useful in several comparative proteomics projects involving lung cancer cells and various other cell types. In one of these projects, Pep-Miner reduced 517 000 spectra to 20 900 clusters and identified 2518 peptides derived from 830 proteins. Clustering and identification lasted less than two hours on an IBM Thinkpad T23 computer (laptop). Pep-Miner's unique properties make it a very useful tool for large-scale shotgun proteomics projects.

摘要

串联质谱法(MS/MS)与液相色谱法(LC)联用,是分析和比较复杂蛋白质及肽混合物的强大工具。然而,该过程产生的海量数据非常复杂且难以分析。我们展示了如何通过对多次LC-MS/MS运行产生的相似光谱进行聚类,来帮助进行数据管理并改进对复杂肽混合物的分析。光谱聚类的主要作用是将海量数据减少到可管理的规模。结果,分析时间更短,且能存储更多数据以供进一步分析。此外,光谱质量的提高使得能够更有信心地鉴定更多肽段,便于对复杂肽混合物进行比较,并以简洁的形式呈现整个蛋白质组学项目。Pep-Miner是一款实现这些基于聚类应用的先进软件工具。它在涉及肺癌细胞及其他多种细胞类型的多个比较蛋白质组学项目中证明很有用。在其中一个项目中,Pep-Miner将517000个光谱减少到20900个聚类,并鉴定出源自830种蛋白质的2518个肽段。在一台IBM Thinkpad T23笔记本电脑上,聚类和鉴定耗时不到两小时。Pep-Miner的独特特性使其成为大规模鸟枪法蛋白质组学项目的非常有用的工具。

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验