Suppr超能文献

串联质谱中肽鉴定的次模广义匹配。

Submodular Generalized Matching for Peptide Identification in Tandem Mass Spectrometry.

出版信息

IEEE/ACM Trans Comput Biol Bioinform. 2019 Jul-Aug;16(4):1168-1181. doi: 10.1109/TCBB.2018.2822280. Epub 2018 Apr 2.

Abstract

Identification of spectra produced by a shotgun proteomics mass spectrometry experiment is commonly performed by searching the observed spectra against a peptide database. The heart of this search procedure is a score function that evaluates the quality of a hypothesized match between an observed spectrum and a theoretical spectrum corresponding to a particular peptide sequence. Accordingly, the success of a spectrum analysis pipeline depends critically upon this peptide-spectrum score function. We develop peptide-spectrum score functions that compute the maximum value of a submodular function under $m$ m matroid constraints. We call this procedure a submodular generalized matching (SGM) since it generalizes bipartite matching. We use a greedy algorithm to compute maximization, which can achieve a solution whose objective is guaranteed to be at least $\frac{1}{1+m}$ 1 1 + m of the true optimum. The advantage of the SGM framework is that known long-range properties of experimental spectra can be modeled by designing suitable submodular functions and matroid constraints. Experiments on four data sets from various organisms and mass spectrometry platforms show that the SGM approach leads to significantly improved performance compared to several state-of-the-art methods. Supplementary information, C++ source code, and data sets can be found at https://melodi-lab.github.io/SGM.

摘要

鉴定 shotgun 蛋白质组学质谱实验产生的谱通常是通过将观察到的谱与肽数据库进行搜索来完成的。该搜索过程的核心是一个评分函数,用于评估观测谱与对应特定肽序列的理论谱之间假设匹配的质量。因此,谱分析管道的成功与否取决于该肽谱评分函数。我们开发了肽谱评分函数,该函数计算了在 m 个拟阵约束下子模函数的最大值。我们称这个过程为子模广义匹配(SGM),因为它推广了二部匹配。我们使用贪婪算法来计算最大化问题,其解决方案的目标至少为真实最优值的 1 1 + m 1 1 + m 。SGM 框架的优势在于,可以通过设计合适的子模函数和拟阵约束来模拟实验谱的已知远程特性。来自不同生物体和质谱平台的四个数据集的实验表明,与几种最先进的方法相比,SGM 方法显著提高了性能。补充信息、C++源代码和数据集可在 https://melodi-lab.github.io/SGM 找到。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bdf8/8641787/905480fd1824/nihms-1537345-f0001.jpg

相似文献

1
Submodular Generalized Matching for Peptide Identification in Tandem Mass Spectrometry.串联质谱中肽鉴定的次模广义匹配。
IEEE/ACM Trans Comput Biol Bioinform. 2019 Jul-Aug;16(4):1168-1181. doi: 10.1109/TCBB.2018.2822280. Epub 2018 Apr 2.
7

本文引用的文献

6
Variation and genetic control of protein abundance in humans.人类蛋白质丰度的变化和遗传控制。
Nature. 2013 Jul 4;499(7456):79-82. doi: 10.1038/nature12223. Epub 2013 May 15.
10

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验