Suppr超能文献

用机器学习减少定量质谱数据中的肽序列偏差。

Reducing Peptide Sequence Bias in Quantitative Mass Spectrometry Data with Machine Learning.

机构信息

Paul G. Allen School of Computer Science and Engineering, University of Washington, Seattle, Washington 98195, United States.

Department of Genome Sciences, University of Washington, Seattle, Washington 98195, United States.

出版信息

J Proteome Res. 2022 Jul 1;21(7):1771-1782. doi: 10.1021/acs.jproteome.2c00211. Epub 2022 Jun 13.

Abstract

Quantitative mass spectrometry measurements of peptides necessarily incorporate sequence-specific biases that reflect the behavior of the peptide during enzymatic digestion and liquid chromatography and in a mass spectrometer. These sequence-specific effects impair quantification accuracy, yielding peptide quantities that are systematically under- or overestimated. We provide empirical evidence for the existence of such biases, and we use a deep neural network, called Pepper, to automatically identify and reduce these biases. The model generalizes to new proteins and new runs within a related set of tandem mass spectrometry experiments, and the learned coefficients themselves reflect expected physicochemical properties of the corresponding peptide sequences. The resulting adjusted abundance measurements are more correlated with mRNA-based gene expression measurements than the unadjusted measurements. Pepper is suitable for data generated on a variety of mass spectrometry instruments and can be used with labeled or label-free approaches and with data-independent or data-dependent acquisition.

摘要

肽的定量质谱测量必然包含反映肽在酶解、液相色谱和质谱中行为的序列特异性偏差。这些序列特异性效应会损害定量准确性,导致肽的数量被系统地低估或高估。我们提供了存在这种偏差的经验证据,并使用称为 Pepper 的深度神经网络来自动识别和减少这些偏差。该模型可推广到新的蛋白质和同一组串联质谱实验中的新运行,并且学习到的系数本身反映了相应肽序列的预期物理化学性质。由此产生的调整后的丰度测量值与基于 mRNA 的基因表达测量值的相关性比未经调整的测量值更高。Pepper 适用于各种质谱仪器生成的数据,可以与标记或无标记方法以及数据独立或数据依赖的采集方法一起使用。

相似文献

7
DbyDeep: Exploration of MS-Detectable Peptides via Deep Learning.DbyDeep:基于深度学习的 MS 可检测肽的探索。
Anal Chem. 2023 Aug 1;95(30):11193-11200. doi: 10.1021/acs.analchem.3c00460. Epub 2023 Jul 17.
8
Deep learning neural network tools for proteomics.深度学习神经网络工具在蛋白质组学中的应用。
Cell Rep Methods. 2021 May 17;1(2):100003. doi: 10.1016/j.crmeth.2021.100003. eCollection 2021 Jun 21.

引用本文的文献

5
Toward an Integrated Machine Learning Model of a Proteomics Experiment.迈向蛋白质组学实验的集成机器学习模型。
J Proteome Res. 2023 Mar 3;22(3):681-696. doi: 10.1021/acs.jproteome.2c00711. Epub 2023 Feb 6.

本文引用的文献

1
Quantitative Proteome Landscape of the NCI-60 Cancer Cell Lines.NCI-60癌细胞系的定量蛋白质组图谱
iScience. 2019 Nov 22;21:664-680. doi: 10.1016/j.isci.2019.10.059. Epub 2019 Oct 31.

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验