• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

贝达:基于伽玛回归均值-方差趋势的无标签蛋白质组学的贝叶斯分层建模。

Baldur: Bayesian Hierarchical Modeling for Label-Free Proteomics with Gamma Regressing Mean-Variance Trends.

机构信息

Institute for Genomics, Biocomputing & Biotechnology, Mississippi State University, Mississippi State, Mississippi, USA; Department of Biochemistry, Molecular Biology, Entomology and Plant Pathology, Mississippi State University, Mississippi State, Mississippi, USA.

出版信息

Mol Cell Proteomics. 2023 Dec;22(12):100658. doi: 10.1016/j.mcpro.2023.100658. Epub 2023 Oct 7.

DOI:10.1016/j.mcpro.2023.100658
PMID:37806340
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10687340/
Abstract

Label-free proteomics is a fast-growing methodology to infer abundances in mass spectrometry proteomics. Extensive research has focused on spectral quantification and peptide identification. However, research toward modeling and understanding quantitative proteomics data is scarce. Here we propose a Bayesian hierarchical decision model (Baldur) to test for differences in means between conditions for proteins, peptides, and post-translational modifications. We developed a Bayesian regression model to characterize local mean-variance trends in data, to estimate measurement uncertainty and hyperparameters for the decision model. A key contribution is the development of a new gamma regression model that describes the mean-variance dependency as a mixture of a common and a latent trend-allowing for localized trend estimates. We then evaluate the performance of Baldur, limma-trend, and t test on six benchmark datasets: five total proteomics and one post-translational modification dataset. We find that Baldur drastically improves the decision in noisier post-translational modification data over limma-trend and t test. In addition, we see significant improvements using Baldur over the other methods in the total proteomics datasets. Finally, we analyzed Baldur's performance when increasing the number of replicates and found that the method always increases precision with sample size, while showing robust control of the false positive rate. We conclude that our model vastly improves over popular data analysis methods (limma-trend and t test) in several spike-in datasets by achieving a high true positive detection rate, while greatly reducing the false-positive rate.

摘要

无标记蛋白质组学是一种快速发展的方法,可用于推断质谱蛋白质组学中的丰度。大量研究集中在光谱定量和肽鉴定上。然而,用于建模和理解定量蛋白质组学数据的研究很少。在这里,我们提出了一种贝叶斯分层决策模型(Baldur),用于测试蛋白质、肽和翻译后修饰之间条件均值的差异。我们开发了一种贝叶斯回归模型来描述数据中局部均值-方差趋势,以估计决策模型的测量不确定性和超参数。一个关键贡献是开发了一种新的伽马回归模型,该模型将均值-方差依赖性描述为一个常见趋势和一个潜在趋势的混合体,从而允许进行局部趋势估计。然后,我们在六个基准数据集上评估了 Baldur、limma-trend 和 t 检验的性能:五个总蛋白质组学数据集和一个翻译后修饰数据集。我们发现,Baldur 极大地改善了在噪声更大的翻译后修饰数据中的决策,优于 limma-trend 和 t 检验。此外,我们在总蛋白质组学数据集中看到使用 Baldur 会显著提高性能。最后,我们分析了随着重复数量的增加,Baldur 的性能,发现该方法始终随着样本量的增加而提高精度,同时保持假阳性率的稳健控制。我们得出的结论是,我们的模型在几个 Spike-in 数据集上大大优于流行的数据分析方法(limma-trend 和 t 检验),通过实现高真阳性检测率,同时大大降低假阳性率。

相似文献

1
Baldur: Bayesian Hierarchical Modeling for Label-Free Proteomics with Gamma Regressing Mean-Variance Trends.贝达:基于伽玛回归均值-方差趋势的无标签蛋白质组学的贝叶斯分层建模。
Mol Cell Proteomics. 2023 Dec;22(12):100658. doi: 10.1016/j.mcpro.2023.100658. Epub 2023 Oct 7.
2
Accounting for multiple imputation-induced variability for differential analysis in mass spectrometry-based label-free quantitative proteomics.针对基于质谱的无标记定量蛋白质组学中差异分析的多重插补诱导变异性进行核算。
PLoS Comput Biol. 2022 Aug 29;18(8):e1010420. doi: 10.1371/journal.pcbi.1010420. eCollection 2022 Aug.
3
Triqler for Protein Summarization of Data from Data-Independent Acquisition Mass Spectrometry.Triqler 用于对数据非依赖性采集质谱数据进行蛋白质总结。
J Proteome Res. 2023 Apr 7;22(4):1359-1366. doi: 10.1021/acs.jproteome.2c00607. Epub 2023 Mar 29.
4
Identification of differentially expressed peptides in high-throughput proteomics data.高通量蛋白质组学数据中差异表达肽的鉴定。
Brief Bioinform. 2018 Sep 28;19(5):971-981. doi: 10.1093/bib/bbx031.
5
Uncertainty-Aware Protein-Level Quantification and Differential Expression Analysis of Proteomics Data with seaMass.基于 seaMass 对蛋白质组学数据进行不确定性感知的蛋白水平定量和差异表达分析。
Methods Mol Biol. 2023;2426:141-162. doi: 10.1007/978-1-0716-1967-4_8.
6
Evaluation of linear models and missing value imputation for the analysis of peptide-centric proteomics.肽质组学分析中线性模型的评估和缺失值插补
BMC Bioinformatics. 2019 Mar 14;20(Suppl 2):102. doi: 10.1186/s12859-019-2619-6.
7
Peptide-level Robust Ridge Regression Improves Estimation, Sensitivity, and Specificity in Data-dependent Quantitative Label-free Shotgun Proteomics.肽水平稳健岭回归改善了数据依赖型无标记鸟枪法蛋白质组学中的估计、灵敏度和特异性。
Mol Cell Proteomics. 2016 Feb;15(2):657-68. doi: 10.1074/mcp.M115.055897. Epub 2015 Nov 13.
8
Empirical Bayesian random censoring threshold model improves detection of differentially abundant proteins.经验贝叶斯随机删失阈值模型可提高对差异丰富蛋白质的检测能力。
J Proteome Res. 2014 Sep 5;13(9):3871-80. doi: 10.1021/pr500171u. Epub 2014 Aug 22.
9
A simple peak detection and label-free quantitation algorithm for chromatography-mass spectrometry.一种用于色谱-质谱联用的简单峰检测和无标记定量算法。
BMC Bioinformatics. 2014 Nov 25;15(1):376. doi: 10.1186/s12859-014-0376-0.
10
On the beta-binomial model for analysis of spectral count data in label-free tandem mass spectrometry-based proteomics.基于无标记串联质谱蛋白质组学的谱计数数据的双二项式模型分析。
Bioinformatics. 2010 Feb 1;26(3):363-9. doi: 10.1093/bioinformatics/btp677. Epub 2009 Dec 9.