• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

肽质量指纹图谱峰强度预测:从光谱中提取知识。

Peptide mass fingerprinting peak intensity prediction: extracting knowledge from spectra.

作者信息

Gay Steven, Binz Pierre-Alain, Hochstrasser Denis F, Appel Ron D

机构信息

Swiss Institute of Bioinformatics, Geneva, Switzerland.

出版信息

Proteomics. 2002 Oct;2(10):1374-91. doi: 10.1002/1615-9861(200210)2:10<1374::AID-PROT1374>3.0.CO;2-D.

DOI:10.1002/1615-9861(200210)2:10<1374::AID-PROT1374>3.0.CO;2-D
PMID:12422355
Abstract

Matrix-assisted laser desorption/ionization-time of flight mass spectrometry has become a valuable tool in proteomics. With the increasing acquisition rate of mass spectrometers, one of the major issues is the development of accurate, efficient and automatic peptide mass fingerprinting (PMF) identification tools. Current tools are mostly based on counting the number of experimental peptide masses matching with theoretical masses. Almost all of them use additional criteria such as isoelectric point, molecular weight, PTMs, taxonomy or enzymatic cleavage rules to enhance prediction performance. However, these identification tools seldom use peak intensities as parameter as there is currently no model predicting the intensities based on the physicochemical properties of peptides. In this work, we used standard datamining methods such as classification and regression methods to find correlations between peak intensities and the properties of the peptides composing a PMF spectrum. These methods were applied on a dataset comprising a series of PMF experiments involving 157 proteins. We found that the C4.5 method gave the more informative results for the classification task (prediction of the presence or absence of a peptide in a spectra) and M5' for the regression methods (prediction of the normalized intensity of a peptide peak). The C4.5 result correctly classified 88% of the theoretical peaks; whereas the M5' peak intensities had a correlation coefficient of 0.6743 with the experimental peak intensities. These methods enabled us to obtain decision and model trees that can be directly used for prediction and identification of PMF results. The work performed permitted to lay the foundations of a method to analyze factors influencing the peak intensity of PMF spectra. A simple extension of this analysis could lead to improve the accuracy of the results by using a larger dataset. Additional peptide characteristics or even PMF experimental parameters can also be taken into account in the datamining process to analyze their influence on the peak intensity. Furthermore, this datamining approach can certainly be extended to the tandem mass spectrometry domain or other mass spectrometry derived methods.

摘要

基质辅助激光解吸/电离飞行时间质谱已成为蛋白质组学中的一种重要工具。随着质谱仪采集速率的提高,主要问题之一是开发准确、高效且自动的肽质量指纹图谱(PMF)鉴定工具。当前的工具大多基于计算与理论质量匹配的实验肽质量的数量。几乎所有工具都使用诸如等电点、分子量、翻译后修饰、分类学或酶切规则等附加标准来提高预测性能。然而,这些鉴定工具很少将峰强度用作参数,因为目前尚无基于肽的物理化学性质预测强度的模型。在这项工作中,我们使用了分类和回归方法等标准数据挖掘方法来寻找峰强度与构成PMF谱的肽的性质之间的相关性。这些方法应用于一个包含一系列涉及157种蛋白质的PMF实验的数据集。我们发现,C4.5方法在分类任务(预测谱中肽的存在与否)中给出了更具信息性的结果,而M5'在回归方法(预测肽峰的归一化强度)中表现更佳。C4.5的结果正确分类了88%的理论峰;而M5'的峰强度与实验峰强度的相关系数为0.6743。这些方法使我们能够获得可直接用于预测和鉴定PMF结果的决策树和模型树。所开展的工作为分析影响PMF谱峰强度的因素奠定了方法基础。通过使用更大的数据集,对该分析进行简单扩展可能会提高结果的准确性。在数据挖掘过程中还可以考虑其他肽特征甚至PMF实验参数,以分析它们对峰强度的影响。此外,这种数据挖掘方法肯定可以扩展到串联质谱领域或其他质谱衍生方法。

相似文献

1
Peptide mass fingerprinting peak intensity prediction: extracting knowledge from spectra.肽质量指纹图谱峰强度预测:从光谱中提取知识。
Proteomics. 2002 Oct;2(10):1374-91. doi: 10.1002/1615-9861(200210)2:10<1374::AID-PROT1374>3.0.CO;2-D.
2
Molecular scanner experiment with human plasma: improving protein identification by using intensity distributions of matching peptide masses.人体血浆分子扫描实验:利用匹配肽质量的强度分布改进蛋白质鉴定
Proteomics. 2002 Oct;2(10):1413-25. doi: 10.1002/1615-9861(200210)2:10<1413::AID-PROT1413>3.0.CO;2-P.
3
Enhanced peptide mass fingerprinting through high mass accuracy: Exclusion of non-peptide signals based on residual mass.通过高质量精度增强肽质量指纹图谱:基于剩余质量排除非肽信号。
J Proteome Res. 2006 May;5(5):1195-203. doi: 10.1021/pr050486o.
4
Development and assessment of scoring functions for protein identification using PMF data.利用肽质量指纹图谱数据进行蛋白质鉴定的评分函数的开发与评估。
Electrophoresis. 2007 Mar;28(5):864-70. doi: 10.1002/elps.200600305.
5
Precalibration of matrix-assisted laser desorption/ionization-time of flight spectra for peptide mass fingerprinting.用于肽质量指纹图谱分析的基质辅助激光解吸/电离飞行时间光谱的预校准
Proteomics. 2002 Oct;2(10):1365-73. doi: 10.1002/1615-9861(200210)2:10<1365::AID-PROT1365>3.0.CO;2-9.
6
Peak intensity prediction in MALDI-TOF mass spectrometry: a machine learning study to support quantitative proteomics.基质辅助激光解吸电离飞行时间质谱中的峰强度预测:一项支持定量蛋白质组学的机器学习研究。
BMC Bioinformatics. 2008 Oct 20;9:443. doi: 10.1186/1471-2105-9-443.
7
Feature-matching pattern-based support vector machines for robust peptide mass fingerprinting.基于特征匹配模式的支持向量机在肽质量指纹图谱分析中的稳健性
Mol Cell Proteomics. 2011 Dec;10(12):M110.005785. doi: 10.1074/mcp.M110.005785. Epub 2011 Jul 20.
8
Protein identification and tracking in two-dimensional electrophoretic gels by minimal protein identifiers.利用最小蛋白质标识符在二维电泳凝胶中进行蛋白质鉴定与追踪
Proteomics. 2004 Oct;4(10):2927-41. doi: 10.1002/pmic.200400908.
9
Improvement of an in-gel tryptic digestion method for matrix-assisted laser desorption/ionization-time of flight mass spectrometry peptide mapping by use of volatile solubilizing agents.利用挥发性增溶剂改进用于基质辅助激光解吸/电离飞行时间质谱肽图谱分析的凝胶内胰蛋白酶消化方法。
Proteomics. 2001 Nov;1(11):1364-7. doi: 10.1002/1615-9861(200111)1:11<1364::AID-PROT1364>3.0.CO;2-H.
10
Peptide mass fingerprinting: protein identification using MALDI-TOF mass spectrometry.肽质量指纹图谱:使用基质辅助激光解吸电离飞行时间质谱法进行蛋白质鉴定。
Methods Mol Biol. 2005;310:227-40. doi: 10.1007/978-1-59259-948-6_16.

引用本文的文献

1
Cancer pharmacoinformatics: Databases and analytical tools.癌症药物信息学:数据库和分析工具。
Funct Integr Genomics. 2024 Sep 19;24(5):166. doi: 10.1007/s10142-024-01445-5.
2
Machine Learning on Signal-to-Noise Ratios Improves Peptide Array Design in SAMDI Mass Spectrometry.基于信噪比的机器学习可提高 SAMDI 质谱法中肽阵列设计的质量。
Anal Chem. 2017 Sep 5;89(17):9039-9047. doi: 10.1021/acs.analchem.7b01728. Epub 2017 Aug 7.
3
Power Normalization for Mass Spectrometry Data Analysis and Analytical Method Assessment.用于质谱数据分析和分析方法评估的功率归一化
Anal Chem. 2016 Mar 15;88(6):3156-63. doi: 10.1021/acs.analchem.5b04418. Epub 2016 Feb 24.
4
High molecular mass proteomics analyses of left ventricle from rats subjected to differential swimming training.对接受不同游泳训练的大鼠左心室进行的高分子量蛋白质组学分析。
BMC Physiol. 2012 Sep 5;12:11. doi: 10.1186/1472-6793-12-11.
5
Label-free protein quantitation using weighted spectral counting.使用加权光谱计数法进行无标记蛋白质定量分析。
Methods Mol Biol. 2012;893:321-41. doi: 10.1007/978-1-61779-885-6_20.
6
CONSeQuence: prediction of reference peptides for absolute quantitative proteomics using consensus machine learning approaches.CONSeQuence:使用共识机器学习方法预测绝对定量蛋白质组学的参考肽。
Mol Cell Proteomics. 2011 Nov;10(11):M110.003384. doi: 10.1074/mcp.M110.003384. Epub 2011 Aug 3.
7
Advances in structure elucidation of small molecules using mass spectrometry.利用质谱法进行小分子结构解析的进展
Bioanal Rev. 2010 Dec;2(1-4):23-60. doi: 10.1007/s12566-010-0015-9. Epub 2010 Aug 21.
8
Bioinformatics methods for learning radiation-induced lung inflammation from heterogeneous retrospective and prospective data.用于从异质性回顾性和前瞻性数据中学习辐射诱导的肺部炎症的生物信息学方法。
J Biomed Biotechnol. 2009;2009:892863. doi: 10.1155/2009/892863. Epub 2009 May 28.
9
Blind search for post-translational modifications and amino acid substitutions using peptide mass fingerprints from two proteases.利用两种蛋白酶的肽质量指纹图谱对翻译后修饰和氨基酸取代进行盲目搜索。
BMC Res Notes. 2008 Dec 19;1:130. doi: 10.1186/1756-0500-1-130.
10
Peak intensity prediction in MALDI-TOF mass spectrometry: a machine learning study to support quantitative proteomics.基质辅助激光解吸电离飞行时间质谱中的峰强度预测:一项支持定量蛋白质组学的机器学习研究。
BMC Bioinformatics. 2008 Oct 20;9:443. doi: 10.1186/1471-2105-9-443.