MsImpute：无标记定量质谱中缺失肽段强度数据的估计。

MsImpute: Estimation of Missing Peptide Intensity Data in Label-Free Quantitative Mass Spectrometry.

机构信息

Bioinformatics Division, WEHI, Melbourne, Australia; Department of Medical Biology, University of Melbourne, Melbourne, Australia; Colonial Foundation Healthy Ageing Centre, WEHI, Melbourne, Australia.

Department of Medical Biology, University of Melbourne, Melbourne, Australia; Colonial Foundation Healthy Ageing Centre, WEHI, Melbourne, Australia; Advanced Technology and Biology Division, WEHI, Melbourne, Australia.

出版信息

Mol Cell Proteomics. 2023 Aug;22(8):100558. doi: 10.1016/j.mcpro.2023.100558. Epub 2023 Apr 25.

DOI:10.1016/j.mcpro.2023.100558

PMID:37105364

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10368900/

Abstract

Mass spectrometry (MS) enables high-throughput identification and quantification of proteins in complex biological samples and can provide insights into the global function of biological systems. Label-free quantification is cost-effective and suitable for the analysis of human samples. Despite rapid developments in label-free data acquisition workflows, the number of proteins quantified across samples can be limited by technical and biological variability. This variation can result in missing values which can in turn challenge downstream data analysis tasks. General purpose or gene expression-specific imputation algorithms are widely used to improve data completeness. Here, we propose an imputation algorithm designated for label-free MS data that is aware of the type of missingness affecting data. On published datasets acquired by data-dependent and data-independent acquisition workflows with variable degrees of biological complexity, we demonstrate that the proposed missing value estimation procedure by barycenter computation competes closely with the state-of-the-art imputation algorithms in differential abundance tasks while outperforming them in the accuracy of variance estimates of the peptide abundance measurements, and better controls the false discovery rate in label-free MS experiments. The barycenter estimation procedure is implemented in the msImpute software package and is available from the Bioconductor repository.

摘要

质谱 (MS) 能够高通量鉴定和定量复杂生物样本中的蛋白质，并能够深入了解生物系统的全局功能。无标记定量是一种具有成本效益的方法，适用于人类样本的分析。尽管无标记数据采集工作流程发展迅速，但由于技术和生物学变异性，跨样本定量的蛋白质数量可能会受到限制。这种变化会导致缺失值，进而挑战下游数据分析任务。通用或基因表达特异性插补算法被广泛用于提高数据完整性。在这里，我们提出了一种专门用于无标记 MS 数据的插补算法，该算法能够识别影响数据的缺失类型。在使用不同程度生物学复杂性的数据依赖和数据独立采集工作流程获取的已发表数据集上，我们证明了基于重心计算的提出的缺失值估计程序在差异丰度任务中与最先进的插补算法竞争激烈，同时在肽丰度测量的方差估计的准确性上优于它们，并更好地控制无标记 MS 实验中的假发现率。重心估计程序在 msImpute 软件包中实现，并可从 Bioconductor 存储库中获得。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9152/10368900/0a1934f84425/fx1.jpg

相似文献

MsImpute: Estimation of Missing Peptide Intensity Data in Label-Free Quantitative Mass Spectrometry.

Mol Cell Proteomics. 2023 Aug;22(8):100558. doi: 10.1016/j.mcpro.2023.100558. Epub 2023 Apr 25.

A comprehensive evaluation of popular proteomics software workflows for label-free proteome quantification and imputation.

Brief Bioinform. 2018 Nov 27;19(6):1344-1355. doi: 10.1093/bib/bbx054.

Benchmarking quantitative label-free LC-MS data processing workflows using a complex spiked proteomic standard dataset.

J Proteomics. 2016 Jan 30;132:51-62. doi: 10.1016/j.jprot.2015.11.011. Epub 2015 Nov 14.

PEPerMINT: peptide abundance imputation in mass spectrometry-based proteomics using graph neural networks.

Bioinformatics. 2024 Sep 1;40(Suppl 2):ii70-ii78. doi: 10.1093/bioinformatics/btae389.

ProtQuant: a tool for the label-free quantification of MudPIT proteomics data.

BMC Bioinformatics. 2007 Nov 1;8 Suppl 7(Suppl 7):S24. doi: 10.1186/1471-2105-8-S7-S24.

Assessment of label-free quantification and missing value imputation for proteomics in non-human primates.

BMC Genomics. 2022 Jul 8;23(1):496. doi: 10.1186/s12864-022-08723-1.

A simple peak detection and label-free quantitation algorithm for chromatography-mass spectrometry.

BMC Bioinformatics. 2014 Nov 25;15(1):376. doi: 10.1186/s12859-014-0376-0.

Review, evaluation, and discussion of the challenges of missing value imputation for mass spectrometry-based label-free global proteomics.

J Proteome Res. 2015 May 1;14(5):1993-2001. doi: 10.1021/pr501138h. Epub 2015 Apr 22.

Accounting for the Multiple Natures of Missing Values in Label-Free Quantitative Proteomics Data Sets to Compare Imputation Strategies.

J Proteome Res. 2016 Apr 1;15(4):1116-25. doi: 10.1021/acs.jproteome.5b00981. Epub 2016 Mar 1.

Accounting for multiple imputation-induced variability for differential analysis in mass spectrometry-based label-free quantitative proteomics.

PLoS Comput Biol. 2022 Aug 29;18(8):e1010420. doi: 10.1371/journal.pcbi.1010420. eCollection 2022 Aug.

引用本文的文献

MORC2 is a phosphorylation-dependent DNA compaction machine.

Nat Commun. 2025 Jul 1;16(1):5606. doi: 10.1038/s41467-025-60751-z.

Functional characterisation of components in two Plasmodium falciparum Cullin-RING-Ligase complexes.

Sci Rep. 2025 Jul 1;15(1):21359. doi: 10.1038/s41598-025-05342-0.

Optimizing imputation strategies for mass spectrometry-based proteomics considering intensity and missing value rates.

Comput Struct Biotechnol J. 2025 May 3;27:1818-1826. doi: 10.1016/j.csbj.2025.04.041. eCollection 2025.

Self-Produced Brain-Like ECM From 3D-Cultured Dermal Fibroblasts Enhances Neuronal Growth and Survival.

Biotechnol J. 2025 Mar;20(3):e202400594. doi: 10.1002/biot.202400594.

SWAPS: A Modular Deep-Learning Empowered Peptide Identity Propagation Framework Beyond Match-Between-Run.

J Proteome Res. 2025 Apr 4;24(4):1926-1940. doi: 10.1021/acs.jproteome.4c00972. Epub 2025 Mar 7.

Imputation of label-free quantitative mass spectrometry-based proteomics data using self-supervised deep learning.

Nat Commun. 2024 Jun 26;15(1):5405. doi: 10.1038/s41467-024-48711-5.

Evaluating Proteomics Imputation Methods with Improved Criteria.

J Proteome Res. 2023 Nov 3;22(11):3427-3438. doi: 10.1021/acs.jproteome.3c00205. Epub 2023 Oct 20.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

MsImpute：无标记定量质谱中缺失肽段强度数据的估计。

MsImpute: Estimation of Missing Peptide Intensity Data in Label-Free Quantitative Mass Spectrometry.

机构信息

出版信息

Mol Cell Proteomics. 2023 Aug;22(8):100558. doi: 10.1016/j.mcpro.2023.100558. Epub 2023 Apr 25.

DOI:10.1016/j.mcpro.2023.100558

PMID:37105364

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10368900/

Abstract

摘要

MsImpute：无标记定量质谱中缺失肽段强度数据的估计。

MsImpute: Estimation of Missing Peptide Intensity Data in Label-Free Quantitative Mass Spectrometry.

机构信息

出版信息

相似文献

引用本文的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

MsImpute：无标记定量质谱中缺失肽段强度数据的估计。

MsImpute: Estimation of Missing Peptide Intensity Data in Label-Free Quantitative Mass Spectrometry.

机构信息

出版信息

相似文献

引用本文的文献