• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

ProbPS:一种基于量化衍生峰存在与母离子强度关系的峰选择新模型。

ProbPS: a new model for peak selection based on quantifying the dependence of the existence of derivative peaks on primary ion intensity.

机构信息

Institute of Computing Technology, Chinese Academy of Sciences, Beijing 100190, China.

出版信息

BMC Bioinformatics. 2011 Aug 17;12:346. doi: 10.1186/1471-2105-12-346.

DOI:10.1186/1471-2105-12-346
PMID:21849060
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3179969/
Abstract

BACKGROUND

The analysis of mass spectra suggests that the existence of derivative peaks is strongly dependent on the intensity of the primary peaks. Peak selection from tandem mass spectrum is used to filter out noise and contaminant peaks. It is widely accepted that a valid primary peak tends to have high intensity and is accompanied by derivative peaks, including isotopic peaks, neutral loss peaks, and complementary peaks. Existing models for peak selection ignore the dependence between the existence of the derivative peaks and the intensity of the primary peaks. Simple models for peak selection assume that these two attributes are independent; however, this assumption is contrary to real data and prone to error.

RESULTS

In this paper, we present a statistical model to quantitatively measure the dependence of the derivative peak's existence on the primary peak's intensity. Here, we propose a statistical model, named ProbPS, to capture the dependence in a quantitative manner and describe a statistical model for peak selection. Our results show that the quantitative understanding can successfully guide the peak selection process. By comparing ProbPS with AuDeNS we demonstrate the advantages of our method in both filtering out noise peaks and in improving de novo identification. In addition, we present a tag identification approach based on our peak selection method. Our results, using a test data set, suggest that our tag identification method (876 correct tags in 1000 spectra) outperforms PepNovoTag (790 correct tags in 1000 spectra).

CONCLUSIONS

We have shown that ProbPS improves the accuracy of peak selection which further enhances the performance of de novo sequencing and tag identification. Thus, our model saves valuable computation time and improving the accuracy of the results.

摘要

背景

质谱分析表明,衍生峰的存在强烈依赖于主峰的强度。串联质谱中的峰选择用于滤除噪声和杂质峰。人们普遍认为,有效的主峰往往具有高强度,并伴有衍生峰,包括同位素峰、中性丢失峰和互补峰。现有的峰选择模型忽略了衍生峰的存在与主峰强度之间的依赖性。简单的峰选择模型假设这两个属性是独立的;然而,这种假设与实际数据相悖,容易出错。

结果

在本文中,我们提出了一个统计模型来定量测量衍生峰的存在对主峰强度的依赖性。在这里,我们提出了一个名为 ProbPS 的统计模型,以定量捕捉依赖性,并描述一个用于峰选择的统计模型。我们的结果表明,定量理解可以成功地指导峰选择过程。通过将 ProbPS 与 AuDeNS 进行比较,我们证明了我们的方法在滤除噪声峰和提高从头鉴定方面的优势。此外,我们还提出了一种基于我们的峰选择方法的标签鉴定方法。我们使用测试数据集的结果表明,我们的标签鉴定方法(在 1000 个光谱中有 876 个正确标签)优于 PepNovoTag(在 1000 个光谱中有 790 个正确标签)。

结论

我们已经表明,ProbPS 提高了峰选择的准确性,从而进一步提高了从头测序和标签鉴定的性能。因此,我们的模型节省了宝贵的计算时间,并提高了结果的准确性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ae19/3179969/b99b7bdd04e2/1471-2105-12-346-15.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ae19/3179969/99a582c61ff9/1471-2105-12-346-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ae19/3179969/0e2baee9af01/1471-2105-12-346-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ae19/3179969/0582c4d933ac/1471-2105-12-346-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ae19/3179969/8aa5304569bc/1471-2105-12-346-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ae19/3179969/b343c2b8da89/1471-2105-12-346-5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ae19/3179969/d865b6a5a66d/1471-2105-12-346-6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ae19/3179969/326c88579233/1471-2105-12-346-7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ae19/3179969/44454f8c9487/1471-2105-12-346-8.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ae19/3179969/447b8e84b5ae/1471-2105-12-346-9.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ae19/3179969/9437dfebd8b2/1471-2105-12-346-10.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ae19/3179969/fd12085abf6b/1471-2105-12-346-11.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ae19/3179969/2fad9dcb480b/1471-2105-12-346-12.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ae19/3179969/3a26d3dc29c4/1471-2105-12-346-13.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ae19/3179969/aa6a571cee2c/1471-2105-12-346-14.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ae19/3179969/b99b7bdd04e2/1471-2105-12-346-15.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ae19/3179969/99a582c61ff9/1471-2105-12-346-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ae19/3179969/0e2baee9af01/1471-2105-12-346-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ae19/3179969/0582c4d933ac/1471-2105-12-346-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ae19/3179969/8aa5304569bc/1471-2105-12-346-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ae19/3179969/b343c2b8da89/1471-2105-12-346-5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ae19/3179969/d865b6a5a66d/1471-2105-12-346-6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ae19/3179969/326c88579233/1471-2105-12-346-7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ae19/3179969/44454f8c9487/1471-2105-12-346-8.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ae19/3179969/447b8e84b5ae/1471-2105-12-346-9.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ae19/3179969/9437dfebd8b2/1471-2105-12-346-10.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ae19/3179969/fd12085abf6b/1471-2105-12-346-11.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ae19/3179969/2fad9dcb480b/1471-2105-12-346-12.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ae19/3179969/3a26d3dc29c4/1471-2105-12-346-13.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ae19/3179969/aa6a571cee2c/1471-2105-12-346-14.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ae19/3179969/b99b7bdd04e2/1471-2105-12-346-15.jpg

相似文献

1
ProbPS: a new model for peak selection based on quantifying the dependence of the existence of derivative peaks on primary ion intensity.ProbPS:一种基于量化衍生峰存在与母离子强度关系的峰选择新模型。
BMC Bioinformatics. 2011 Aug 17;12:346. doi: 10.1186/1471-2105-12-346.
2
Evaluating de novo sequencing in proteomics: already an accurate alternative to database-driven peptide identification?评估蛋白质组学中的从头测序:是否已经成为数据库驱动肽鉴定的准确替代方法?
Brief Bioinform. 2018 Sep 28;19(5):954-970. doi: 10.1093/bib/bbx033.
3
Filtering of MS/MS data for peptide identification.用于肽段鉴定的MS/MS数据过滤
BMC Genomics. 2013;14 Suppl 7(Suppl 7):S2. doi: 10.1186/1471-2164-14-S7-S2. Epub 2013 Nov 5.
4
Performance evaluation of existing de novo sequencing algorithms.现有从头测序算法的性能评估。
J Proteome Res. 2006 Nov;5(11):3018-28. doi: 10.1021/pr060222h.
5
Precursor mass dependent filtering of mass spectra for proteomics analysis.用于蛋白质组学分析的质谱前体质量依赖性过滤
Protein Pept Lett. 2014;21(8):858-63. doi: 10.2174/09298665113209990065.
6
De novo peptide sequencing using ion peak intensity and amino acid cleavage intensity ratio.利用离子峰强度和氨基酸裂解强度比进行从头肽测序。
Bioinformatics. 2007 May 1;23(9):1068-72. doi: 10.1093/bioinformatics/btm062. Epub 2007 Mar 6.
7
Peptide mass fingerprinting peak intensity prediction: extracting knowledge from spectra.肽质量指纹图谱峰强度预测:从光谱中提取知识。
Proteomics. 2002 Oct;2(10):1374-91. doi: 10.1002/1615-9861(200210)2:10<1374::AID-PROT1374>3.0.CO;2-D.
8
De novo analysis of peptide tandem mass spectra by spectral graph partitioning.通过谱图划分对肽串联质谱进行从头分析。
J Comput Biol. 2006 Mar;13(2):364-78. doi: 10.1089/cmb.2006.13.364.
9
Determination of monoisotopic masses of chimera spectra from high-resolution mass spectrometric data by use of isotopic peak intensity ratio modeling.利用同位素峰强度比建模确定高分辨质谱数据中嵌合体谱的单同位素质量。
Rapid Commun Mass Spectrom. 2012 Aug 30;26(16):1875-86. doi: 10.1002/rcm.6293.
10
Isotopic peak intensity ratio based algorithm for determination of isotopic clusters and monoisotopic masses of polypeptides from high-resolution mass spectrometric data.基于同位素峰强度比的算法,用于从高分辨率质谱数据中确定多肽的同位素簇和单同位素质量。
Anal Chem. 2008 Oct 1;80(19):7294-303. doi: 10.1021/ac800913b. Epub 2008 Aug 28.

本文引用的文献

1
Fast algorithm for peptide sequencing by mass spectroscopy.用于肽段质谱测序的快速算法
Biomed Environ Mass Spectrom. 1990;19(6):363-8. doi: 10.1002/bms.1200190607.
2
PI: an open-source software package for validation of the SEQUEST result and visualization of mass spectrum.PI:一个用于 SEQUEST 结果验证和质谱可视化的开源软件包。
BMC Bioinformatics. 2011 Jun 15;12:234. doi: 10.1186/1471-2105-12-234.
3
pNovo: de novo peptide sequencing and identification using HCD spectra.pNovo:利用 HCD 谱进行从头多肽测序和鉴定。
J Proteome Res. 2010 May 7;9(5):2713-24. doi: 10.1021/pr100182k.
4
De novo sequencing of unique sequence tags for discovery of post-translational modifications of proteins.用于发现蛋白质翻译后修饰的独特序列标签的从头测序。
Anal Chem. 2008 Oct 15;80(20):7742-54. doi: 10.1021/ac801123p. Epub 2008 Sep 11.
5
DirecTag: accurate sequence tags from peptide MS/MS through statistical scoring.DirecTag:通过统计评分从肽段串联质谱中获得准确的序列标签。
J Proteome Res. 2008 Sep;7(9):3838-46. doi: 10.1021/pr800154p. Epub 2008 Jul 17.
6
Deriving the probabilities of water loss and ammonia loss for amino acids from tandem mass spectra.从串联质谱推导氨基酸失水和失氨的概率。
J Proteome Res. 2008 Jan;7(1):202-8. doi: 10.1021/pr070479v. Epub 2007 Dec 20.
7
SwedCAD, a database of annotated high-mass accuracy MS/MS spectra of tryptic peptides.SwedCAD,一个经注释的胰蛋白酶肽段高精度质谱/质谱图谱数据库。
J Proteome Res. 2007 Oct;6(10):4063-7. doi: 10.1021/pr070345h. Epub 2007 Aug 21.
8
MSNovo: a dynamic programming algorithm for de novo peptide sequencing via tandem mass spectrometry.MSNovo:一种通过串联质谱进行肽段从头测序的动态规划算法。
Anal Chem. 2007 Jul 1;79(13):4870-8. doi: 10.1021/ac070039n. Epub 2007 Jun 6.
9
De novo peptide identification via tandem mass spectrometry and integer linear optimization.通过串联质谱和整数线性优化进行从头肽段鉴定
Anal Chem. 2007 Feb 15;79(4):1433-46. doi: 10.1021/ac0618425.
10
De novo peptide sequencing and identification with precision mass spectrometry.利用精确质谱进行从头肽测序和鉴定。
J Proteome Res. 2007 Jan;6(1):114-23. doi: 10.1021/pr060271u.