Suppr超能文献

有效利用靶向搜索空间以改善基于串联质谱的蛋白质组学中的肽段鉴定

Effective Leveraging of Targeted Search Spaces for Improving Peptide Identification in Tandem Mass Spectrometry Based Proteomics.

作者信息

Shanmugam Avinash K, Nesvizhskii Alexey I

机构信息

Department of Computational Medicine and Bioinformatics and ‡Department of Pathology, University of Michigan , Ann Arbor, Michigan 48109, United States.

出版信息

J Proteome Res. 2015 Dec 4;14(12):5169-78. doi: 10.1021/acs.jproteome.5b00504. Epub 2015 Nov 24.

Abstract

In shotgun proteomics, peptides are typically identified using database searching, which involves scoring acquired tandem mass spectra against peptides derived from standard protein sequence databases such as Uniprot, Refseq, or Ensembl. In this strategy, the sensitivity of peptide identification is known to be affected by the size of the search space. Therefore, creating a targeted sequence database containing only peptides likely to be present in the analyzed sample can be a useful technique for improving the sensitivity of peptide identification. In this study, we describe how targeted peptide databases can be created based on the frequency of identification in the global proteome machine database (GPMDB), the largest publicly available repository of peptide and protein identification data. We demonstrate that targeted peptide databases can be easily integrated into existing proteome analysis workflows and describe a computational strategy for minimizing any loss of peptide identifications arising from potential search space incompleteness in the targeted search spaces. We demonstrate the performance of our workflow using several data sets of varying size and sample complexity.

摘要

在鸟枪法蛋白质组学中,肽段通常通过数据库搜索来鉴定,这涉及将获取的串联质谱与从诸如Uniprot、Refseq或Ensembl等标准蛋白质序列数据库衍生的肽段进行评分。在这种策略中,已知肽段鉴定的灵敏度会受到搜索空间大小的影响。因此,创建一个仅包含可能存在于分析样品中的肽段的靶向序列数据库,可能是提高肽段鉴定灵敏度的一种有用技术。在本研究中,我们描述了如何基于全球蛋白质组机器数据库(GPMDB)(最大的公开可用肽段和蛋白质鉴定数据存储库)中的鉴定频率来创建靶向肽段数据库。我们证明靶向肽段数据库可以轻松整合到现有的蛋白质组分析工作流程中,并描述了一种计算策略,以最小化因靶向搜索空间中潜在的搜索空间不完整性而导致的肽段鉴定损失。我们使用几个不同大小和样品复杂度的数据集展示了我们工作流程的性能。

相似文献

引用本文的文献

本文引用的文献

7
Ensembl 2014.Ensembl 2014.
Nucleic Acids Res. 2014 Jan;42(Database issue):D749-55. doi: 10.1093/nar/gkt1196. Epub 2013 Dec 6.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验