• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

pmartR:基于质谱的生物学数据的质量控制和统计。

pmartR: Quality Control and Statistics for Mass Spectrometry-Based Biological Data.

机构信息

National Security Directorate , Pacific Northwest National Laboratory , 902 Battelle Boulevard , Richland , Washington 99354 , United States.

Earth & Biological Sciences Directorate , Pacific Northwest National Laboratory , 902 Battelle Boulavard , Richland , Washington 99354 , United States.

出版信息

J Proteome Res. 2019 Mar 1;18(3):1418-1425. doi: 10.1021/acs.jproteome.8b00760. Epub 2019 Jan 28.

DOI:10.1021/acs.jproteome.8b00760
PMID:30638385
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6750869/
Abstract

Prior to statistical analysis of mass spectrometry (MS) data, quality control (QC) of the identified biomolecule peak intensities is imperative for reducing process-based sources of variation and extreme biological outliers. Without this step, statistical results can be biased. Additionally, liquid chromatography-MS proteomics data present inherent challenges due to large amounts of missing data that require special consideration during statistical analysis. While a number of R packages exist to address these challenges individually, there is no single R package that addresses all of them. We present pmartR, an open-source R package, for QC (filtering and normalization), exploratory data analysis (EDA), visualization, and statistical analysis robust to missing data. Example analysis using proteomics data from a mouse study comparing smoke exposure to control demonstrates the core functionality of the package and highlights the capabilities for handling missing data. In particular, using a combined quantitative and qualitative statistical test, 19 proteins whose statistical significance would have been missed by a quantitative test alone were identified. The pmartR package provides a single software tool for QC, EDA, and statistical comparisons of MS data that is robust to missing data and includes numerous visualization capabilities.

摘要

在进行质谱(MS)数据分析之前,必须对鉴定出的生物分子峰强度进行质量控制(QC),以减少基于过程的变异源和极端生物学离群值。如果不进行这一步骤,统计结果可能会出现偏差。此外,由于存在大量缺失数据,液相色谱-MS 蛋白质组学数据在统计分析过程中需要特别考虑,因此存在固有挑战。虽然有许多 R 包可以单独解决这些挑战,但没有一个 R 包可以解决所有这些挑战。我们提出了 pmartR,这是一个开源的 R 包,用于 QC(过滤和归一化)、探索性数据分析(EDA)、可视化和对缺失数据具有鲁棒性的统计分析。使用来自一项比较吸烟暴露与对照的小鼠研究的蛋白质组学数据进行的示例分析展示了该软件包的核心功能,并强调了处理缺失数据的能力。特别是,使用一种综合的定量和定性统计测试,确定了 19 种仅通过定量测试可能会错过的蛋白质,这些蛋白质具有统计学意义。pmartR 软件包为 MS 数据的 QC、EDA 和统计比较提供了一个单一的软件工具,该工具对缺失数据具有鲁棒性,并包含许多可视化功能。

相似文献

1
pmartR: Quality Control and Statistics for Mass Spectrometry-Based Biological Data.pmartR:基于质谱的生物学数据的质量控制和统计。
J Proteome Res. 2019 Mar 1;18(3):1418-1425. doi: 10.1021/acs.jproteome.8b00760. Epub 2019 Jan 28.
2
: A Quality Control, Visualization, and Statistics Pipeline for Multiple Omics Datatypes.多组学生物学数据的质量控制、可视化和统计分析流水线
J Proteome Res. 2023 Feb 3;22(2):570-576. doi: 10.1021/acs.jproteome.2c00610. Epub 2023 Jan 9.
3
Open-source platform for the analysis of liquid chromatography-mass spectrometry (LC-MS) data.用于液相色谱-质谱联用(LC-MS)数据分析的开源平台。
Methods Mol Biol. 2008;428:369-82. doi: 10.1007/978-1-59745-117-8_19.
4
PMart Web Application: Marketplace for Interactive Analysis of Panomics Data.PMart 网络应用程序:用于 Panomics 数据交互分析的市场。
J Proteome Res. 2024 Aug 2;23(8):3310-3317. doi: 10.1021/acs.jproteome.3c00512. Epub 2023 Dec 12.
5
Normalization and missing value imputation for label-free LC-MS analysis.无标记 LC-MS 分析的归一化和缺失值插补。
BMC Bioinformatics. 2012;13 Suppl 16(Suppl 16):S5. doi: 10.1186/1471-2105-13-S16-S5. Epub 2012 Nov 5.
6
Decon2LS: An open-source software package for automated processing and visualization of high resolution mass spectrometry data.Decon2LS:一个用于高分辨率质谱数据自动处理和可视化的开源软件包。
BMC Bioinformatics. 2009 Mar 17;10:87. doi: 10.1186/1471-2105-10-87.
7
Review, evaluation, and discussion of the challenges of missing value imputation for mass spectrometry-based label-free global proteomics.基于质谱的无标记全局蛋白质组学中缺失值插补挑战的综述、评估与讨论。
J Proteome Res. 2015 May 1;14(5):1993-2001. doi: 10.1021/pr501138h. Epub 2015 Apr 22.
8
A statistical selection strategy for normalization procedures in LC-MS proteomics experiments through dataset-dependent ranking of normalization scaling factors.通过基于数据集的归一化缩放因子排序,对 LC-MS 蛋白质组学实验中的归一化程序进行统计选择策略。
Proteomics. 2011 Dec;11(24):4736-41. doi: 10.1002/pmic.201100078. Epub 2011 Nov 17.
9
Experimental design and data-analysis in label-free quantitative LC/MS proteomics: A tutorial with MSqRob.无标记定量 LC/MS 蛋白质组学中的实验设计和数据分析:MSqRob 教程。
J Proteomics. 2018 Jan 16;171:23-36. doi: 10.1016/j.jprot.2017.04.004. Epub 2017 Apr 5.
10
ROCS: a reproducibility index and confidence score for interaction proteomics studies.ROCS:交互蛋白质组学研究的可重复性指数和置信得分。
BMC Bioinformatics. 2012 Jun 8;13:128. doi: 10.1186/1471-2105-13-128.

引用本文的文献

1
Conditional knockout of C/EBP[Formula: see text] in epidermis results in dysregulated lipid biosynthesis and a defect in skin barrier function.表皮中C/EBP[公式:见正文]的条件性敲除导致脂质生物合成失调和皮肤屏障功能缺陷。
PLoS One. 2025 Jun 25;20(6):e0326670. doi: 10.1371/journal.pone.0326670. eCollection 2025.
2
Multi-Omic Profiling Identifies Conserved Metabolic Pathways Critical for SARS-CoV-2 Variants Infection.多组学分析确定了对新冠病毒变异株感染至关重要的保守代谢途径。
bioRxiv. 2025 Jun 3:2025.06.02.657371. doi: 10.1101/2025.06.02.657371.
3
Characterization of Caenorhabditis elegans sphingomyelin synthases through heterologous expression.

本文引用的文献

1
mixOmics: An R package for 'omics feature selection and multiple data integration.mixOmics:一个用于“组学”特征选择和多数据整合的R包。
PLoS Comput Biol. 2017 Nov 3;13(11):e1005752. doi: 10.1371/journal.pcbi.1005752. eCollection 2017 Nov.
2
In-depth method assessments of differentially expressed protein detection for shotgun proteomics data with missing values.深度方法评估用于具有缺失值的鸟枪法蛋白质组学数据的差异表达蛋白检测。
Sci Rep. 2017 Jun 13;7(1):3367. doi: 10.1038/s41598-017-03650-8.
3
metaX: a flexible and comprehensive software for processing metabolomics data.
通过异源表达对线虫鞘磷脂合成酶进行表征。
J Biol Chem. 2025 Jul;301(7):110300. doi: 10.1016/j.jbc.2025.110300. Epub 2025 Jun 3.
4
Quality Control in the Mass Spectrometry Proteomics Core: A Practical Primer.质谱蛋白质组学核心实验室的质量控制:实用入门指南。
J Biomol Tech. 2024 Sep 12;35(3). doi: 10.7171/3fc1f5fe.42308a9a. eCollection 2024 Sep 30.
5
Lipidomics reveals cell specific changes during pluripotent differentiation to neural and mesodermal lineages.脂质组学揭示了多能干细胞向神经和中胚层谱系分化过程中的细胞特异性变化。
Mol Omics. 2025 Mar 13. doi: 10.1039/d4mo00261j.
6
Data from a multi-year targeted proteomics study of a longitudinal birth cohort of type 1 diabetes.一项针对1型糖尿病纵向出生队列的多年靶向蛋白质组学研究的数据。
Sci Data. 2025 Jan 20;12(1):112. doi: 10.1038/s41597-024-04249-1.
7
Lipidomics Reveals Cell Specific Changes During Pluripotent Differentiation to Neural and Mesodermal Lineages.脂质组学揭示多能干细胞向神经和中胚层谱系分化过程中的细胞特异性变化。
bioRxiv. 2025 Jan 1:2024.12.31.630916. doi: 10.1101/2024.12.31.630916.
8
Calculating and Reporting Coefficients of Variation for DIA-Based Proteomics.计算和报告基于数据独立采集(DIA)的蛋白质组学的变异系数
J Proteome Res. 2024 Dec 6;23(12):5274-5278. doi: 10.1021/acs.jproteome.4c00461. Epub 2024 Nov 22.
9
The human plasma lipidome response to exertional heat tolerance testing.人体血浆脂质组对运动性热耐力测试的反应。
Lipids Health Dis. 2024 Nov 15;23(1):380. doi: 10.1186/s12944-024-02322-7.
10
Exploring the Synergistic Secretome: Insights from Co-Cultivation of and RUT-C30.探索协同分泌蛋白组:来自与RUT-C30共培养的见解。 (你原文中“and”后面缺失了具体内容)
J Fungi (Basel). 2024 Sep 28;10(10):677. doi: 10.3390/jof10100677.
MetaX:一款用于处理代谢组学数据的灵活且全面的软件。
BMC Bioinformatics. 2017 Mar 21;18(1):183. doi: 10.1186/s12859-017-1579-y.
4
DAPAR & ProStaR: software to perform statistical analyses in quantitative discovery proteomics.DAPAR和ProStaR:用于定量发现蛋白质组学统计分析的软件。
Bioinformatics. 2017 Jan 1;33(1):135-136. doi: 10.1093/bioinformatics/btw580. Epub 2016 Sep 6.
5
Accounting for the Multiple Natures of Missing Values in Label-Free Quantitative Proteomics Data Sets to Compare Imputation Strategies.考虑无标记定量蛋白质组学数据集中缺失值的多重性质以比较插补策略。
J Proteome Res. 2016 Apr 1;15(4):1116-25. doi: 10.1021/acs.jproteome.5b00981. Epub 2016 Mar 1.
6
Testing and Validation of Computational Methods for Mass Spectrometry.质谱计算方法的测试与验证
J Proteome Res. 2016 Mar 4;15(3):809-14. doi: 10.1021/acs.jproteome.5b00852. Epub 2015 Nov 17.
7
Review, evaluation, and discussion of the challenges of missing value imputation for mass spectrometry-based label-free global proteomics.基于质谱的无标记全局蛋白质组学中缺失值插补挑战的综述、评估与讨论。
J Proteome Res. 2015 May 1;14(5):1993-2001. doi: 10.1021/pr501138h. Epub 2015 Apr 22.
8
Visualization of proteomics data using R and bioconductor.使用R和生物导体可视化蛋白质组学数据。
Proteomics. 2015 Apr;15(8):1375-89. doi: 10.1002/pmic.201400392.
9
Orchestrating high-throughput genomic analysis with Bioconductor.使用Bioconductor编排高通量基因组分析。
Nat Methods. 2015 Feb;12(2):115-21. doi: 10.1038/nmeth.3252.
10
Bayesian proteoform modeling improves protein quantification of global proteomic measurements.贝叶斯蛋白质异构体建模可改善全局蛋白质组学测量中的蛋白质定量。
Mol Cell Proteomics. 2014 Dec;13(12):3639-46. doi: 10.1074/mcp.M113.030932.