关于HPLC-ESI-MS/MS中控制肽段MS1响应的物理化学性质的见解：一种深度学习方法。

Insight on physicochemical properties governing peptide MS1 response in HPLC-ESI-MS/MS: A deep learning approach.

作者信息

Abdul-Khalek Naim, Wimmer Reinhard, Overgaard Michael Toft, Gregersen Echers Simon

机构信息

Department of Chemistry and Bioscience, Aalborg University, Aalborg 9220, Denmark.

出版信息

Comput Struct Biotechnol J. 2023 Jul 22;21:3715-3727. doi: 10.1016/j.csbj.2023.07.027. eCollection 2023.

DOI:10.1016/j.csbj.2023.07.027

PMID:37560124

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10407266/

Abstract

Accurate and absolute quantification of peptides in complex mixtures using quantitative mass spectrometry (MS)-based methods requires foreground knowledge and isotopically labeled standards, thereby increasing analytical expenses, time consumption, and labor, thus limiting the number of peptides that can be accurately quantified. This originates from differential ionization efficiency between peptides and thus, understanding the physicochemical properties that influence the ionization and response in MS analysis is essential for developing less restrictive label-free quantitative methods. Here, we used equimolar peptide pool repository data to develop a deep learning model capable of identifying amino acids influencing the MS1 response. By using an encoder-decoder with an attention mechanism and correlating attention weights with amino acid physicochemical properties, we obtain insight on properties governing the peptide-level MS1 response within the datasets. While the problem cannot be described by one single set of amino acids and properties, distinct patterns were reproducibly obtained. Properties are grouped in three main categories related to peptide hydrophobicity, charge, and structural propensities. Moreover, our model can predict MS1 intensity output under defined conditions based solely on peptide sequence input. Using a refined training dataset, the model predicted log-transformed peptide MS1 intensities with an average error of 9.7 ± 0.5% based on 5-fold cross validation, and outperformed random forest and ridge regression models on both log-transformed and real scale data. This work demonstrates how deep learning can facilitate identification of physicochemical properties influencing peptide MS1 responses, but also illustrates how sequence-based response prediction and label-free peptide-level quantification may impact future workflows within quantitative proteomics.

摘要

使用基于定量质谱（MS）的方法对复杂混合物中的肽进行准确和绝对定量需要先验知识和同位素标记标准品，从而增加了分析成本、时间消耗和人力，因此限制了能够准确量化的肽的数量。这源于肽之间不同的电离效率，因此，了解影响MS分析中电离和响应的物理化学性质对于开发限制较少的无标记定量方法至关重要。在这里，我们使用等摩尔肽库存储库数据来开发一个深度学习模型，该模型能够识别影响MS1响应的氨基酸。通过使用带有注意力机制的编码器-解码器，并将注意力权重与氨基酸物理化学性质相关联，我们深入了解了数据集中控制肽水平MS1响应的性质。虽然这个问题不能用一组单一的氨基酸和性质来描述，但可以重复获得不同的模式。这些性质分为与肽的疏水性、电荷和结构倾向相关的三个主要类别。此外，我们的模型仅基于肽序列输入就能预测在定义条件下的MS1强度输出。使用经过优化的训练数据集，该模型基于5折交叉验证预测对数转换后的肽MS1强度，平均误差为9.7±0.5%，并且在对数转换数据和实际尺度数据上均优于随机森林和岭回归模型。这项工作展示了深度学习如何促进对影响肽MS1响应的物理化学性质的识别，同时也说明了基于序列的响应预测和无标记肽水平定量如何可能影响定量蛋白质组学中的未来工作流程。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2e7a/10407266/7472178c58d3/ga1.jpg

相似文献

Insight on physicochemical properties governing peptide MS1 response in HPLC-ESI-MS/MS: A deep learning approach.关于HPLC-ESI-MS/MS中控制肽段MS1响应的物理化学性质的见解：一种深度学习方法。

Comput Struct Biotechnol J. 2023 Jul 22;21:3715-3727. doi: 10.1016/j.csbj.2023.07.027. eCollection 2023.

Decoding the impact of neighboring amino acids on ESI-MS intensity output through deep learning.通过深度学习解码邻近氨基酸对 ESI-MS 强度输出的影响。

J Proteomics. 2024 Oct 30;309:105322. doi: 10.1016/j.jprot.2024.105322. Epub 2024 Sep 26.

Platform-independent and label-free quantitation of proteomic data using MS1 extracted ion chromatograms in skyline: application to protein acetylation and phosphorylation.使用 Skyline 中的 MS1 提取离子色谱图进行蛋白质组学数据的无平台依赖和无标签定量：在蛋白质乙酰化和磷酸化中的应用。

Mol Cell Proteomics. 2012 May;11(5):202-14. doi: 10.1074/mcp.M112.017707. Epub 2012 Mar 26.

Tutorial: Correction of shifts in single-stage LC-MS(/MS) data.教程：单级 LC-MS(/MS) 数据中移位的校正。

Anal Chim Acta. 2018 Jan 25;999:37-53. doi: 10.1016/j.aca.2017.09.039. Epub 2017 Nov 2.

MS1 Peptide Ion Intensity Chromatograms in MS2 (SWATH) Data Independent Acquisitions. Improving Post Acquisition Analysis of Proteomic Experiments.MS2（SWATH）数据独立采集模式下的MS1肽离子强度色谱图。改进蛋白质组学实验的采集后分析。

Mol Cell Proteomics. 2015 Sep;14(9):2405-19. doi: 10.1074/mcp.O115.048181. Epub 2015 May 17.

Prediction of LC-MS/MS Properties of Peptides from Sequence by Deep Learning.通过深度学习预测肽段的 LC-MS/MS 性质。

Mol Cell Proteomics. 2019 Oct;18(10):2099-2107. doi: 10.1074/mcp.TIR119.001412. Epub 2019 Jun 27.

Analysis of Intrinsic Peptide Detectability via Integrated Label-Free and SRM-Based Absolute Quantitative Proteomics.通过集成的无标记和基于SRM的绝对定量蛋白质组学分析内源性肽的可检测性

J Proteome Res. 2016 Sep 2;15(9):2945-59. doi: 10.1021/acs.jproteome.6b00048. Epub 2016 Aug 8.

Predicting Peptide Ionization Efficiencies for Electrospray Ionization Mass Spectrometry Using Machine Learning.使用机器学习预测电喷雾电离质谱中的肽离子化效率。

J Am Soc Mass Spectrom. 2024 Oct 2;35(10):2297-2307. doi: 10.1021/jasms.4c00137. Epub 2024 Sep 9.

Definition and characterization of a "trypsinosome" from specific peptide characteristics by nano-HPLC-MS/MS and in silico analysis of complex protein mixtures.通过纳米高效液相色谱-串联质谱法从特定肽段特征对“胰蛋白酶体”进行定义和表征，并对复杂蛋白质混合物进行计算机模拟分析。

J Proteome Res. 2004 Nov-Dec;3(6):1138-48. doi: 10.1021/pr049909x.

A rapid and sensitive single-cell proteomic method based on fast liquid-chromatography separation, retention time prediction and MS1-only acquisition.基于快速液相色谱分离、保留时间预测和仅 MS1 采集的快速灵敏单细胞蛋白质组学方法。

Anal Chim Acta. 2023 Apr 22;1251:341038. doi: 10.1016/j.aca.2023.341038. Epub 2023 Mar 2.

引用本文的文献

Proteomics and bioinformatics guided discovery of microalgal multifunctional peptides for novel nutraceutical applications.蛋白质组学和生物信息学指导下的微藻多功能肽发现及其在新型营养保健品中的应用

Bioprocess Biosyst Eng. 2025 Jun 25. doi: 10.1007/s00449-025-03192-8.

To Fly, or Not to Fly, That Is the Question: A Deep Learning Model for Peptide Detectability Prediction in Mass Spectrometry.飞行，还是不飞行，这是个问题：一种用于质谱中肽段可检测性预测的深度学习模型。

J Proteome Res. 2025 Jun 6;24(6):2709-2726. doi: 10.1021/acs.jproteome.4c00973. Epub 2025 May 9.

Rescoring Peptide Spectrum Matches: Boosting Proteomics Performance by Integrating Peptide Property Predictors Into Peptide Identification.重新评分肽谱匹配：通过将肽性质预测器集成到肽鉴定中提高蛋白质组学性能。

Mol Cell Proteomics. 2024 Jul;23(7):100798. doi: 10.1016/j.mcpro.2024.100798. Epub 2024 Jun 11.

Variability analysis of LC-MS experimental factors and their impact on machine learning.LC-MS 实验因素的可变性分析及其对机器学习的影响。

Gigascience. 2022 Dec 28;12. doi: 10.1093/gigascience/giad096. Epub 2023 Nov 20.

本文引用的文献

PeptideRanger: An R Package to Optimize Synthetic Peptide Selection for Mass Spectrometry Applications.PeptideRanger：一个用于优化质谱应用中合成肽选择的 R 包。

J Proteome Res. 2023 Feb 3;22(2):526-531. doi: 10.1021/acs.jproteome.2c00538. Epub 2023 Jan 26.

AlacatDesigner─Computational Design of Peptide Concatamers for Protein Quantitation.AlacatDesigner─用于蛋白质定量的肽类连接物的计算设计。

J Proteome Res. 2023 Feb 3;22(2):594-604. doi: 10.1021/acs.jproteome.2c00608. Epub 2023 Jan 23.

Typic: A Practical and Robust Tool to Rank Proteotypic Peptides for Targeted Proteomics.Typic：一种用于靶向蛋白质组学中对蛋白型肽段进行排名的实用且强大的工具。

J Proteome Res. 2023 Feb 3;22(2):539-545. doi: 10.1021/acs.jproteome.2c00585. Epub 2022 Dec 8.

A comprehensive evaluation of regression-based drug responsiveness prediction models, using cell viability inhibitory concentrations (IC50 values).基于细胞活力抑制浓度（IC50 值）的回归型药物反应性预测模型的综合评估。

Bioinformatics. 2022 May 13;38(10):2810-2817. doi: 10.1093/bioinformatics/btac177.

Deep learning neural network tools for proteomics.深度学习神经网络工具在蛋白质组学中的应用。

Cell Rep Methods. 2021 May 17;1(2):100003. doi: 10.1016/j.crmeth.2021.100003. eCollection 2021 Jun 21.

A method to identify and quantify the complete peptide composition in protein hydrolysates.一种鉴定和定量蛋白质水解物中完整肽组成的方法。

Anal Chim Acta. 2022 Apr 8;1201:339616. doi: 10.1016/j.aca.2022.339616. Epub 2022 Feb 17.

Enzymatic extraction improves intracellular protein recovery from the industrial carrageenan seaweed revealed by quantitative, subcellular protein profiling: A high potential source of functional food ingredients.酶促提取提高了从工业卡拉胶海藻中回收细胞内蛋白质的效率，定量亚细胞蛋白质谱分析揭示：一种功能性食品成分的高潜力来源。

Food Chem X. 2021 Oct 20;12:100137. doi: 10.1016/j.fochx.2021.100137. eCollection 2021 Dec 30.

MS2AI: automated repurposing of public peptide LC-MS data for machine learning applications.MS2AI：用于机器学习应用的公共肽段液相色谱-质谱数据的自动重新利用。

Bioinformatics. 2022 Jan 12;38(3):875-877. doi: 10.1093/bioinformatics/btab701.

Machine learning for initial insulin estimation in hospitalized patients.机器学习在住院患者初始胰岛素估算中的应用。

J Am Med Inform Assoc. 2021 Sep 18;28(10):2212-2219. doi: 10.1093/jamia/ocab099.

MaxDIA enables library-based and library-free data-independent acquisition proteomics.MaxDIA支持基于文库和无文库的数据非依赖型采集蛋白质组学。

Nat Biotechnol. 2021 Dec;39(12):1563-1573. doi: 10.1038/s41587-021-00968-7. Epub 2021 Jul 8.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

关于HPLC-ESI-MS/MS中控制肽段MS1响应的物理化学性质的见解：一种深度学习方法。

Insight on physicochemical properties governing peptide MS1 response in HPLC-ESI-MS/MS: A deep learning approach.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献