鸟枪法蛋白质组学数据的解读:蛋白质推断问题。

Interpretation of shotgun proteomic data: the protein inference problem.

作者信息

Nesvizhskii Alexey I, Aebersold Ruedi

机构信息

Institute for Systems Biology, Seattle, Washington 98103, USA.

出版信息

Mol Cell Proteomics. 2005 Oct;4(10):1419-40. doi: 10.1074/mcp.R500012-MCP200. Epub 2005 Jul 11.

Abstract

The shotgun proteomic strategy based on digesting proteins into peptides and sequencing them using tandem mass spectrometry and automated database searching has become the method of choice for identifying proteins in most large scale studies. However, the peptide-centric nature of shotgun proteomics complicates the analysis and biological interpretation of the data especially in the case of higher eukaryote organisms. The same peptide sequence can be present in multiple different proteins or protein isoforms. Such shared peptides therefore can lead to ambiguities in determining the identities of sample proteins. In this article we illustrate the difficulties of interpreting shotgun proteomic data and discuss the need for common nomenclature and transparent informatic approaches. We also discuss related issues such as the state of protein sequence databases and their role in shotgun proteomic analysis, interpretation of relative peptide quantification data in the presence of multiple protein isoforms, the integration of proteomic and transcriptional data, and the development of a computational infrastructure for the integration of multiple diverse datasets.

摘要

基于将蛋白质消化成肽段并使用串联质谱和自动数据库搜索对其进行测序的鸟枪法蛋白质组学策略,已成为大多数大规模研究中鉴定蛋白质的首选方法。然而,鸟枪法蛋白质组学以肽段为中心的性质使数据的分析和生物学解释变得复杂,尤其是在高等真核生物的情况下。相同的肽段序列可能存在于多种不同的蛋白质或蛋白质异构体中。因此,这种共享肽段可能会导致在确定样品蛋白质的身份时产生歧义。在本文中,我们阐述了解释鸟枪法蛋白质组学数据的困难,并讨论了通用命名法和透明信息学方法的必要性。我们还讨论了相关问题,如蛋白质序列数据库的状态及其在鸟枪法蛋白质组学分析中的作用、在存在多种蛋白质异构体的情况下相对肽段定量数据的解释、蛋白质组学和转录数据的整合,以及用于整合多个不同数据集的计算基础设施的开发。

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索