• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

在实验装置水平上应用质谱法对可观测肽段的预测。

Prediction of peptides observable by mass spectrometry applied at the experimental set level.

作者信息

Sanders William S, Bridges Susan M, McCarthy Fiona M, Nanduri Bindu, Burgess Shane C

机构信息

Department of Biochemistry & Molecular Biology, Mississippi State University, MS, USA.

出版信息

BMC Bioinformatics. 2007 Nov 1;8 Suppl 7(Suppl 7):S23. doi: 10.1186/1471-2105-8-S7-S23.

DOI:10.1186/1471-2105-8-S7-S23
PMID:18047723
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2099492/
Abstract

BACKGROUND

When proteins are subjected to proteolytic digestion and analyzed by mass spectrometry using a method such as 2D LC MS/MS, only a portion of the proteotypic peptides associated with each protein will be observed. The ability to predict which peptides can and cannot potentially be observed for a particular experimental dataset has several important applications in proteomics research including calculation of peptide coverage in terms of potentially detectable peptides, systems biology analysis of data sets, and protein quantification.

RESULTS

We have developed a methodology for constructing artificial neural networks that can be used to predict which peptides are potentially observable for a given set of experimental, instrumental, and analytical conditions for 2D LC MS/MS (a.k.a Multidimensional Protein Identification Technology [MudPIT]) datasets. Neural network classifiers constructed using this procedure for two MudPIT datasets exhibit 10-fold cross validation accuracy of about 80%. We show that a classifier constructed for one dataset has poor predictive performance with the other dataset, thus demonstrating the need for dataset specific classifiers. Classification results with each dataset are used to compute informative percent amino acid coverage statistics for each protein in terms of the predicted detectable peptides in addition to the percent coverage of the complete sequence. We also demonstrate the utility of predicted peptide observability for systems analysis to help determine if proteins that were expected but not observed generate sufficient peptides for detection.

CONCLUSION

Classifiers that accurately predict the likelihood of detecting proteotypic peptides by mass spectrometry provide proteomics researchers with powerful new approaches for data analysis. We demonstrate that the procedure we have developed for building a classifier based on an individual experimental data set results in classifiers with accuracy comparable to those reported in the literature based on large training sets collected from multiple experiments. Our approach allows the researcher to construct a classifier that is specific for the experimental, instrument, and analytical conditions of a single experiment and amenable to local, condition-specific, implementation. The resulting classifiers have application in a number of areas such as determination of peptide coverage for protein identification, pathway analysis, and protein quantification.

摘要

背景

当蛋白质进行蛋白酶解消化并使用二维液相色谱串联质谱法(2D LC MS/MS)等方法进行质谱分析时,与每种蛋白质相关的蛋白型肽段中只有一部分会被观察到。预测特定实验数据集可能观察到和无法观察到哪些肽段的能力在蛋白质组学研究中有几个重要应用,包括根据潜在可检测肽段计算肽段覆盖率、数据集的系统生物学分析以及蛋白质定量。

结果

我们开发了一种构建人工神经网络的方法,可用于预测在二维液相色谱串联质谱法(又称多维蛋白质鉴定技术 [MudPIT])数据集的给定实验、仪器和分析条件下哪些肽段可能被观察到。使用该程序为两个 MudPIT 数据集构建的神经网络分类器在 10 倍交叉验证中的准确率约为 80%。我们表明,为一个数据集构建的分类器对另一个数据集的预测性能较差,从而证明了需要特定于数据集的分类器。除了完整序列的覆盖率百分比外,每个数据集的分类结果还用于根据预测的可检测肽段计算每种蛋白质的信息性氨基酸覆盖率统计数据。我们还证明了预测肽段可观察性在系统分析中的实用性,以帮助确定预期但未观察到的蛋白质是否产生足够的肽段用于检测。

结论

准确预测通过质谱检测蛋白型肽段可能性的分类器为蛋白质组学研究人员提供了强大的新数据分析方法。我们证明,我们基于单个实验数据集构建分类器的程序所得到的分类器,其准确率与基于从多个实验收集的大型训练集的文献报道相当。我们的方法允许研究人员构建一个特定于单个实验的实验、仪器和分析条件的分类器,并且适合于本地、特定条件的实施。所得分类器在许多领域都有应用,如用于蛋白质鉴定的肽段覆盖率测定、通路分析和蛋白质定量。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4f32/2099492/cfe0a1ef7638/1471-2105-8-S7-S23-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4f32/2099492/cfe0a1ef7638/1471-2105-8-S7-S23-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4f32/2099492/cfe0a1ef7638/1471-2105-8-S7-S23-1.jpg

相似文献

1
Prediction of peptides observable by mass spectrometry applied at the experimental set level.在实验装置水平上应用质谱法对可观测肽段的预测。
BMC Bioinformatics. 2007 Nov 1;8 Suppl 7(Suppl 7):S23. doi: 10.1186/1471-2105-8-S7-S23.
2
ProtQuant: a tool for the label-free quantification of MudPIT proteomics data.ProtQuant:一种用于MudPIT蛋白质组学数据无标记定量的工具。
BMC Bioinformatics. 2007 Nov 1;8 Suppl 7(Suppl 7):S24. doi: 10.1186/1471-2105-8-S7-S24.
3
Improved prediction of peptide detectability for targeted proteomics using a rank-based algorithm and organism-specific data.使用基于排序的算法和特定生物体数据改进靶向蛋白质组学中肽段可检测性的预测。
J Proteomics. 2014 Aug 28;108:269-83. doi: 10.1016/j.jprot.2014.05.011. Epub 2014 May 27.
4
Chromatographic alignment of LC-MS and LC-MS/MS datasets by genetic algorithm feature extraction.通过遗传算法特征提取实现液相色谱-质谱联用(LC-MS)和液相色谱-串联质谱联用(LC-MS/MS)数据集的色谱对齐。
J Am Soc Mass Spectrom. 2007 Oct;18(10):1835-43. doi: 10.1016/j.jasms.2007.07.018. Epub 2007 Jul 26.
5
Modeling peptide fragmentation with dynamic Bayesian networks for peptide identification.使用动态贝叶斯网络对肽段进行建模以用于肽段鉴定
Bioinformatics. 2008 Jul 1;24(13):i348-56. doi: 10.1093/bioinformatics/btn189.
6
Computational prediction of proteotypic peptides for quantitative proteomics.用于定量蛋白质组学的蛋白型肽段的计算预测
Nat Biotechnol. 2007 Jan;25(1):125-31. doi: 10.1038/nbt1275. Epub 2006 Dec 31.
7
Feature selection in validating mass spectrometry database search results.验证质谱数据库搜索结果中的特征选择。
J Bioinform Comput Biol. 2008 Feb;6(1):223-40. doi: 10.1142/s0219720008003345.
8
Proteomic mass spectra classification using decision tree based ensemble methods.使用基于决策树的集成方法进行蛋白质组质谱分类。
Bioinformatics. 2005 Jul 15;21(14):3138-45. doi: 10.1093/bioinformatics/bti494. Epub 2005 May 12.
9
Phosphoproteomics by mass spectrometry and classical protein chemistry approaches.基于质谱和经典蛋白质化学方法的磷酸化蛋白质组学
Mass Spectrom Rev. 2005 Nov-Dec;24(6):828-46. doi: 10.1002/mas.20042.
10
Robust accurate identification of peptides (RAId): deciphering MS2 data using a structured library search with de novo based statistics.肽段的稳健准确鉴定(RAId):使用基于从头统计的结构化库搜索来解析MS2数据。
Bioinformatics. 2005 Oct 1;21(19):3726-32. doi: 10.1093/bioinformatics/bti620. Epub 2005 Aug 16.

引用本文的文献

1
DbyDeep: Exploration of MS-Detectable Peptides via Deep Learning.DbyDeep:基于深度学习的 MS 可检测肽的探索。
Anal Chem. 2023 Aug 1;95(30):11193-11200. doi: 10.1021/acs.analchem.3c00460. Epub 2023 Jul 17.
2
Near atmospheric carbon dioxide activates plant ubiquitin cross-linking.接近大气水平的二氧化碳会激活植物泛素交联。
BBA Adv. 2023 Jun 17;4:100096. doi: 10.1016/j.bbadva.2023.100096. eCollection 2023.
3
Toward an Integrated Machine Learning Model of a Proteomics Experiment.迈向蛋白质组学实验的集成机器学习模型。

本文引用的文献

1
Modeling the proteome of a Marek's disease transformed cell line: a natural animal model for CD30 overexpressing lymphomas.构建马立克氏病转化细胞系的蛋白质组模型:一种过表达CD30的淋巴瘤天然动物模型。
Proteomics. 2007 Apr;7(8):1316-26. doi: 10.1002/pmic.200600946.
2
Computational prediction of proteotypic peptides for quantitative proteomics.用于定量蛋白质组学的蛋白型肽段的计算预测
Nat Biotechnol. 2007 Jan;25(1):125-31. doi: 10.1038/nbt1275. Epub 2006 Dec 31.
3
Absolute protein expression profiling estimates the relative contributions of transcriptional and translational regulation.
J Proteome Res. 2023 Mar 3;22(3):681-696. doi: 10.1021/acs.jproteome.2c00711. Epub 2023 Feb 6.
4
Reducing Peptide Sequence Bias in Quantitative Mass Spectrometry Data with Machine Learning.用机器学习减少定量质谱数据中的肽序列偏差。
J Proteome Res. 2022 Jul 1;21(7):1771-1782. doi: 10.1021/acs.jproteome.2c00211. Epub 2022 Jun 13.
5
Purple: A Computational Workflow for Strategic Selection of Peptides for Viral Diagnostics Using MS-Based Targeted Proteomics.紫色:一种基于 MS 的靶向蛋白质组学的病毒诊断中肽段的战略选择的计算工作流程。
Viruses. 2019 Jun 8;11(6):536. doi: 10.3390/v11060536.
6
DeepPep: Deep proteome inference from peptide profiles.DeepPep:基于肽谱的深度蛋白质组推断。
PLoS Comput Biol. 2017 Sep 5;13(9):e1005661. doi: 10.1371/journal.pcbi.1005661. eCollection 2017 Sep.
7
Food allergen detection by mass spectrometry: the role of systems biology.基于质谱法的食物过敏原检测:系统生物学的作用。
NPJ Syst Biol Appl. 2016 Sep 29;2:16022. doi: 10.1038/npjsba.2016.22. eCollection 2016.
8
Machine Learning on Signal-to-Noise Ratios Improves Peptide Array Design in SAMDI Mass Spectrometry.基于信噪比的机器学习可提高 SAMDI 质谱法中肽阵列设计的质量。
Anal Chem. 2017 Sep 5;89(17):9039-9047. doi: 10.1021/acs.analchem.7b01728. Epub 2017 Aug 7.
9
Prediction of Hopeless Peptides Unlikely to be Selected for Targeted Proteome Analysis.预测不太可能被选用于靶向蛋白质组分析的无希望肽段。
Mass Spectrom (Tokyo). 2017;6(1):A0056. doi: 10.5702/massspectrometry.A0056. Epub 2017 Jun 2.
10
Recommendations for the Generation, Quantification, Storage, and Handling of Peptides Used for Mass Spectrometry-Based Assays.基于质谱分析的肽段生成、定量、储存及处理建议
Clin Chem. 2016 Jan;62(1):48-69. doi: 10.1373/clinchem.2015.250563.
绝对蛋白质表达谱分析可估计转录调控和翻译调控的相对贡献。
Nat Biotechnol. 2007 Jan;25(1):117-24. doi: 10.1038/nbt1270. Epub 2006 Dec 24.
4
Detecting differential and correlated protein expression in label-free shotgun proteomics.在无标记鸟枪法蛋白质组学中检测差异和相关蛋白质表达
J Proteome Res. 2006 Nov;5(11):2909-18. doi: 10.1021/pr0600273.
5
Modeling a whole organ using proteomics: the avian bursa of Fabricius.利用蛋白质组学对整个器官进行建模:法氏囊。
Proteomics. 2006 May;6(9):2759-71. doi: 10.1002/pmic.200500648.
6
Effects of subminimum inhibitory concentrations of antibiotics on the Pasteurella multocida proteome.低于最低抑菌浓度的抗生素对多杀巴斯德菌蛋白质组的影响。
J Proteome Res. 2006 Mar;5(3):572-80. doi: 10.1021/pr050360r.
7
Comprehensive label-free method for the relative quantification of proteins from biological samples.用于生物样品蛋白质相对定量的无标记综合方法。
J Proteome Res. 2005 Jul-Aug;4(4):1442-50. doi: 10.1021/pr050109b.
8
Scoring proteomes with proteotypic peptide probes.使用蛋白质型肽探针进行蛋白质组评分。
Nat Rev Mol Cell Biol. 2005 Jul;6(7):577-83. doi: 10.1038/nrm1683.
9
Marek's disease is a natural model for lymphomas overexpressing Hodgkin's disease antigen (CD30).马立克氏病是一种淋巴瘤的天然模型,这些淋巴瘤过度表达霍奇金病抗原(CD30)。
Proc Natl Acad Sci U S A. 2004 Sep 21;101(38):13879-84. doi: 10.1073/pnas.0305789101. Epub 2004 Sep 8.
10
What to do with "one-hit wonders"?
Electrophoresis. 2004 May;25(9):1278-9. doi: 10.1002/elps.200490007.