• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一种用于同时对 2D PAGE 凝胶中所有差异表达蛋白质进行分类的贝叶斯模型。

A Bayesian model for classifying all differentially expressed proteins simultaneously in 2D PAGE gels.

机构信息

Bioinformatics Institute, University of Auckland, Private Bag, 92019, Auckland, New Zealand.

出版信息

BMC Bioinformatics. 2012 Jun 19;13:137. doi: 10.1186/1471-2105-13-137.

DOI:10.1186/1471-2105-13-137
PMID:22712439
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3505467/
Abstract

BACKGROUND

Two-dimensional polyacrylamide gel electrophoresis (2D PAGE) is commonly used to identify differentially expressed proteins under two or more experimental or observational conditions. Wu et al (2009) developed a univariate probabilistic model which was used to identify differential expression between Case and Control groups, by applying a Likelihood Ratio Test (LRT) to each protein on a 2D PAGE. In contrast to commonly used statistical approaches, this model takes into account the two possible causes of missing values in 2D PAGE: either (1) the non-expression of a protein; or (2) a level of expression that falls below the limit of detection.

RESULTS

We develop a global Bayesian model which extends the previously described model. Unlike the univariate approach, the model reported here is able treat all differentially expressed proteins simultaneously. Whereas each protein is modelled by the univariate likelihood function previously described, several global distributions are used to model the underlying relationship between the parameters associated with individual proteins. These global distributions are able to combine information from each protein to give more accurate estimates of the true parameters. In our implementation of the procedure, all parameters are recovered by Markov chain Monte Carlo (MCMC) integration. The 95% highest posterior density (HPD) intervals for the marginal posterior distributions are used to determine whether differences in protein expression are due to differences in mean expression intensities, and/or differences in the probabilities of expression.

CONCLUSIONS

Simulation analyses showed that the global model is able to accurately recover the underlying global distributions, and identify more differentially expressed proteins than the simple application of a LRT. Additionally, simulations also indicate that the probability of incorrectly identifying a protein as differentially expressed (i.e., the False Discovery Rate) is very low. The source code is available at https://github.com/stevenhwu/BIDE-2D.

摘要

背景

二维聚丙烯酰胺凝胶电泳(2D PAGE)常用于鉴定两个或更多实验或观察条件下差异表达的蛋白质。Wu 等人(2009 年)开发了一种单变量概率模型,通过对 2D PAGE 上的每个蛋白质应用似然比检验(LRT),用于识别病例和对照组之间的差异表达。与常用的统计方法不同,该模型考虑了 2D PAGE 中缺失值的两种可能原因:(1)蛋白质未表达;或(2)表达水平低于检测限。

结果

我们开发了一种全局贝叶斯模型,扩展了之前描述的模型。与单变量方法不同,这里报告的模型能够同时处理所有差异表达的蛋白质。虽然每个蛋白质都由之前描述的单变量似然函数建模,但使用了几个全局分布来建模与单个蛋白质相关联的参数之间的潜在关系。这些全局分布能够结合每个蛋白质的信息,给出更准确的真实参数估计。在我们的程序实现中,所有参数都通过马尔可夫链蒙特卡罗(MCMC)积分恢复。边际后验分布的 95%最高后验密度(HPD)区间用于确定蛋白质表达的差异是否归因于平均表达强度的差异,以及/或表达概率的差异。

结论

模拟分析表明,全局模型能够准确地恢复潜在的全局分布,并比简单应用 LRT 识别出更多差异表达的蛋白质。此外,模拟还表明,错误地将蛋白质识别为差异表达的概率(即假发现率)非常低。源代码可在 https://github.com/stevenhwu/BIDE-2D 获得。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/38b2/3505467/d3ccc1a1bce8/1471-2105-13-137-8.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/38b2/3505467/1d66ea189853/1471-2105-13-137-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/38b2/3505467/93a47b900263/1471-2105-13-137-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/38b2/3505467/6613b7d97cd4/1471-2105-13-137-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/38b2/3505467/ff362f54d016/1471-2105-13-137-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/38b2/3505467/a200f5b8d2ed/1471-2105-13-137-5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/38b2/3505467/5ad32569603a/1471-2105-13-137-6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/38b2/3505467/6d9cc53a8505/1471-2105-13-137-7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/38b2/3505467/d3ccc1a1bce8/1471-2105-13-137-8.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/38b2/3505467/1d66ea189853/1471-2105-13-137-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/38b2/3505467/93a47b900263/1471-2105-13-137-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/38b2/3505467/6613b7d97cd4/1471-2105-13-137-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/38b2/3505467/ff362f54d016/1471-2105-13-137-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/38b2/3505467/a200f5b8d2ed/1471-2105-13-137-5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/38b2/3505467/5ad32569603a/1471-2105-13-137-6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/38b2/3505467/6d9cc53a8505/1471-2105-13-137-7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/38b2/3505467/d3ccc1a1bce8/1471-2105-13-137-8.jpg

相似文献

1
A Bayesian model for classifying all differentially expressed proteins simultaneously in 2D PAGE gels.一种用于同时对 2D PAGE 凝胶中所有差异表达蛋白质进行分类的贝叶斯模型。
BMC Bioinformatics. 2012 Jun 19;13:137. doi: 10.1186/1471-2105-13-137.
2
A statistical model to identify differentially expressed proteins in 2D PAGE gels.一种用于识别二维聚丙烯酰胺凝胶中差异表达蛋白质的统计模型。
PLoS Comput Biol. 2009 Sep;5(9):e1000509. doi: 10.1371/journal.pcbi.1000509. Epub 2009 Sep 18.
3
Thermodynamic integration via differential evolution: A method for estimating marginal likelihoods.通过差分进化进行热力学集成:一种估计边缘似然的方法。
Behav Res Methods. 2019 Apr;51(2):930-947. doi: 10.3758/s13428-018-1172-y.
4
Estimation of evolutionary parameters using short, random and partial sequences from mixed samples of anonymous individuals.利用来自匿名个体混合样本的短的、随机的和部分序列估计进化参数。
BMC Bioinformatics. 2015 Nov 4;16:357. doi: 10.1186/s12859-015-0810-y.
5
Bayesian bivariate meta-analysis of diagnostic test studies using integrated nested Laplace approximations.贝叶斯二元诊断试验研究的综合嵌套拉普拉斯逼近的荟萃分析。
Stat Med. 2010 May 30;29(12):1325-39. doi: 10.1002/sim.3858.
6
A gradient Markov chain Monte Carlo algorithm for computing multivariate maximum likelihood estimates and posterior distributions: mixture dose-response assessment.用于计算多元极大似然估计值和后验分布的梯度马尔可夫链蒙特卡罗算法:混合剂量反应评估。
Risk Anal. 2012 Feb;32(2):345-59. doi: 10.1111/j.1539-6924.2011.01672.x. Epub 2011 Sep 11.
7
A Bayesian approach to a general regression model for ROC curves.一种用于ROC曲线的一般回归模型的贝叶斯方法。
Med Decis Making. 1998 Oct-Dec;18(4):436-43. doi: 10.1177/0272989X9801800412.
8
Exploring fast computational strategies for probabilistic phylogenetic analysis.探索概率系统发育分析的快速计算策略。
Syst Biol. 2007 Oct;56(5):711-26. doi: 10.1080/10635150701611258.
9
Bringing metabolic networks to life: integration of kinetic, metabolic, and proteomic data.让代谢网络鲜活起来:动力学、代谢组学和蛋白质组学数据的整合
Theor Biol Med Model. 2006 Dec 15;3:42. doi: 10.1186/1742-4682-3-42.
10
Mixture models for single-cell assays with applications to vaccine studies.单细胞分析的混合模型及其在疫苗研究中的应用。
Biostatistics. 2014 Jan;15(1):87-101. doi: 10.1093/biostatistics/kxt024. Epub 2013 Jul 24.

本文引用的文献

1
AUTOMATED ANALYSIS OF QUANTITATIVE IMAGE DATA USING ISOMORPHIC FUNCTIONAL MIXED MODELS, WITH APPLICATION TO PROTEOMICS DATA.使用同构功能混合模型对定量图像数据进行自动分析,并应用于蛋白质组学数据。
Ann Appl Stat. 2011 Jan 1;5(2A):894-923. doi: 10.1214/10-aoas407.
2
Missing values in gel-based proteomics.凝胶基蛋白质组学中的缺失值。
Proteomics. 2010 Mar;10(6):1202-11. doi: 10.1002/pmic.200800576.
3
A statistical model to identify differentially expressed proteins in 2D PAGE gels.一种用于识别二维聚丙烯酰胺凝胶中差异表达蛋白质的统计模型。
PLoS Comput Biol. 2009 Sep;5(9):e1000509. doi: 10.1371/journal.pcbi.1000509. Epub 2009 Sep 18.
4
Incorporating pathway information into boosting estimation of high-dimensional risk prediction models.将通路信息纳入高维风险预测模型的增强估计中。
BMC Bioinformatics. 2009 Jan 13;10:18. doi: 10.1186/1471-2105-10-18.
5
The state of the art in the analysis of two-dimensional gel electrophoresis images.二维凝胶电泳图像分析的技术现状。
Appl Microbiol Biotechnol. 2007 Oct;76(6):1223-43. doi: 10.1007/s00253-007-1128-0. Epub 2007 Aug 23.
6
A probabilistic treatment of the missing spot problem in 2D gel electrophoresis experiments.二维凝胶电泳实验中缺失斑点问题的概率处理方法。
J Proteome Res. 2007 Aug;6(8):3335-43. doi: 10.1021/pr070137p. Epub 2007 Jul 11.
7
Multivariate analysis of 2-DE protein patterns--practical approaches.二维蛋白质图谱的多变量分析——实用方法
Electrophoresis. 2007 Apr;28(8):1289-99. doi: 10.1002/elps.200600414.
8
Challenges related to analysis of protein spot volumes from two-dimensional gel electrophoresis as revealed by replicate gels.重复凝胶显示的二维凝胶电泳蛋白质斑点体积分析相关挑战。
J Proteome Res. 2006 Dec;5(12):3399-410. doi: 10.1021/pr0603250.
9
The pitfalls of proteomics experiments without the correct use of bioinformatics tools.蛋白质组学实验中若不正确使用生物信息学工具所存在的缺陷。
Proteomics. 2006 Oct;6(20):5577-96. doi: 10.1002/pmic.200600223.
10
Software-induced variance in two-dimensional gel electrophoresis image analysis.二维凝胶电泳图像分析中的软件诱导变异
Electrophoresis. 2005 Dec;26(23):4508-20. doi: 10.1002/elps.200500253.