Suppr超能文献

有效利用靶向搜索空间以改善基于串联质谱的蛋白质组学中的肽段鉴定

Effective Leveraging of Targeted Search Spaces for Improving Peptide Identification in Tandem Mass Spectrometry Based Proteomics.

作者信息

Shanmugam Avinash K, Nesvizhskii Alexey I

机构信息

Department of Computational Medicine and Bioinformatics and ‡Department of Pathology, University of Michigan , Ann Arbor, Michigan 48109, United States.

出版信息

J Proteome Res. 2015 Dec 4;14(12):5169-78. doi: 10.1021/acs.jproteome.5b00504. Epub 2015 Nov 24.

Abstract

In shotgun proteomics, peptides are typically identified using database searching, which involves scoring acquired tandem mass spectra against peptides derived from standard protein sequence databases such as Uniprot, Refseq, or Ensembl. In this strategy, the sensitivity of peptide identification is known to be affected by the size of the search space. Therefore, creating a targeted sequence database containing only peptides likely to be present in the analyzed sample can be a useful technique for improving the sensitivity of peptide identification. In this study, we describe how targeted peptide databases can be created based on the frequency of identification in the global proteome machine database (GPMDB), the largest publicly available repository of peptide and protein identification data. We demonstrate that targeted peptide databases can be easily integrated into existing proteome analysis workflows and describe a computational strategy for minimizing any loss of peptide identifications arising from potential search space incompleteness in the targeted search spaces. We demonstrate the performance of our workflow using several data sets of varying size and sample complexity.

摘要

在鸟枪法蛋白质组学中,肽段通常通过数据库搜索来鉴定,这涉及将获取的串联质谱与从诸如Uniprot、Refseq或Ensembl等标准蛋白质序列数据库衍生的肽段进行评分。在这种策略中,已知肽段鉴定的灵敏度会受到搜索空间大小的影响。因此,创建一个仅包含可能存在于分析样品中的肽段的靶向序列数据库,可能是提高肽段鉴定灵敏度的一种有用技术。在本研究中,我们描述了如何基于全球蛋白质组机器数据库(GPMDB)(最大的公开可用肽段和蛋白质鉴定数据存储库)中的鉴定频率来创建靶向肽段数据库。我们证明靶向肽段数据库可以轻松整合到现有的蛋白质组分析工作流程中,并描述了一种计算策略,以最小化因靶向搜索空间中潜在的搜索空间不完整性而导致的肽段鉴定损失。我们使用几个不同大小和样品复杂度的数据集展示了我们工作流程的性能。

相似文献

引用本文的文献

本文引用的文献

7
Ensembl 2014.Ensembl 2014.
Nucleic Acids Res. 2014 Jan;42(Database issue):D749-55. doi: 10.1093/nar/gkt1196. Epub 2013 Dec 6.

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验