Suppr超能文献

通过数据库搜索为表型肽赋予统计学意义。

Assigning statistical significance to proteotypic peptides via database searches.

机构信息

National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA.

出版信息

J Proteomics. 2011 Feb 1;74(2):199-211. doi: 10.1016/j.jprot.2010.10.005. Epub 2010 Nov 3.

Abstract

Querying MS/MS spectra against a database containing only proteotypic peptides reduces data analysis time due to reduction of database size. Despite the speed advantage, this search strategy is challenged by issues of statistical significance and coverage. The former requires separating systematically significant identifications from less confident identifications, while the latter arises when the underlying peptide is not present, due to single amino acid polymorphisms (SAPs) or post-translational modifications (PTMs), in the proteotypic peptide libraries searched. To address both issues simultaneously, we have extended RAId's knowledge database to include proteotypic information, utilized RAId's statistical strategy to assign statistical significance to proteotypic peptides, and modified RAId's programs to allow for consideration of proteotypic information during database searches. The extended database alleviates the coverage problem since all annotated modifications, even those that occurred within proteotypic peptides, may be considered. Taking into account the likelihoods of observation, the statistical strategy of RAId provides accurate E-value assignments regardless whether a candidate peptide is proteotypic or not. The advantage of including proteotypic information is evidenced by its superior retrieval performance when compared to regular database searches.

摘要

查询仅包含蛋白质肽的数据库中的 MS/MS 光谱可以减少数据分析时间,因为数据库的大小会减小。尽管这种搜索策略具有速度优势,但它受到统计显著性和覆盖率问题的挑战。前者需要系统地将系统重要的鉴定与不太可信的鉴定区分开来,而后者则出现在基础肽由于单氨基酸多态性 (SAP) 或翻译后修饰 (PTM) 而不存在于所搜索的蛋白质肽库中时。为了同时解决这两个问题,我们已经扩展了 RAId 的知识库以包含蛋白质肽信息,利用 RAId 的统计策略为蛋白质肽分配统计显著性,并修改了 RAId 的程序,以便在数据库搜索期间考虑蛋白质肽信息。扩展后的数据库缓解了覆盖率问题,因为所有注释的修饰,甚至那些发生在蛋白质肽内的修饰,都可以被考虑。考虑到观察的可能性,RAId 的统计策略提供了准确的 E 值分配,无论候选肽是否是蛋白质肽。包含蛋白质肽信息的优势在于,与常规数据库搜索相比,它具有更好的检索性能。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/384b/3186061/7156f54f7754/nihms-258139-f0001.jpg

相似文献

本文引用的文献

6
High quality catalog of proteotypic peptides from human heart.来自人类心脏的蛋白型肽的高质量目录。
J Proteome Res. 2008 Nov;7(11):5055-61. doi: 10.1021/pr800239e. Epub 2008 Sep 20.

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验