• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

整合蛋白质组学数据集的蛋白质组覆盖预测

Proteome coverage prediction for integrated proteomics datasets.

作者信息

Claassen Manfred, Aebersold Ruedi, Buhmann Joachim M

机构信息

Department of Computer Science, ETH Zurich, Zurich, Switzerland.

出版信息

J Comput Biol. 2011 Mar;18(3):283-93. doi: 10.1089/cmb.2010.0261.

DOI:10.1089/cmb.2010.0261
PMID:21385034
Abstract

Comprehensive characterization of a proteome defines a fundamental goal in proteomics. In order to maximize proteome coverage for a complex protein mixture, i.e., to identify as many proteins as possible, various different fractionation experiments are typically performed and the individual fractions are subjected to mass spectrometric analysis. The resulting data are integrated into large and heterogeneous datasets. Proteome coverage prediction refers to the task of extrapolating the number of protein discoveries by future measurements conditioned on a sequence of already performed measurements. Proteome coverage prediction at an early stage enables experimentalists to design and plan efficient proteomics studies. To date, there does not exist any method that reliably predicts proteome coverage from integrated datasets. We present a generalized hierarchical Pitman-Yor process model that explicitly captures the redundancy within integrated datasets. The accuracy of our approach for proteome coverage prediction is assessed by applying it to an integrated proteomics dataset for the bacterium L. interrogans. The proposed procedure outperforms ad hoc extrapolation methods and prediction methods designed for non-integrated datasets. Furthermore, the maximally achievable proteome coverage is estimated for the experimental setup underlying the L. interrogans dataset. We discuss the implications of our results for determining rational stop criteria and their influence on the design of efficient and reliable proteomics studies.

摘要

蛋白质组的全面表征是蛋白质组学的一个基本目标。为了最大限度地提高复杂蛋白质混合物的蛋白质组覆盖率,即尽可能多地鉴定蛋白质,通常会进行各种不同的分级实验,并对各个级分进行质谱分析。所得数据被整合到大型且异质的数据集中。蛋白质组覆盖率预测是指根据一系列已进行的测量来推断未来测量中蛋白质发现数量的任务。早期的蛋白质组覆盖率预测使实验人员能够设计和规划高效的蛋白质组学研究。迄今为止,还不存在任何能从整合数据集中可靠预测蛋白质组覆盖率的方法。我们提出了一种广义分层皮特曼 - 约尔过程模型,该模型明确捕捉了整合数据集中的冗余信息。通过将我们的方法应用于问号钩端螺旋体的整合蛋白质组数据集,评估了我们用于蛋白质组覆盖率预测方法的准确性。所提出的程序优于专为非整合数据集设计的临时外推方法和预测方法。此外,还针对问号钩端螺旋体数据集所依据的实验设置估计了可实现的最大蛋白质组覆盖率。我们讨论了我们的结果对于确定合理的停止标准及其对高效可靠蛋白质组学研究设计的影响。

相似文献

1
Proteome coverage prediction for integrated proteomics datasets.整合蛋白质组学数据集的蛋白质组覆盖预测
J Comput Biol. 2011 Mar;18(3):283-93. doi: 10.1089/cmb.2010.0261.
2
Proteome coverage prediction with infinite Markov models.基于无限马尔可夫模型的蛋白质组覆盖预测
Bioinformatics. 2009 Jun 15;25(12):i154-60. doi: 10.1093/bioinformatics/btp233.
3
High-coverage proteome analysis reveals the first insight of protein modification systems in the pathogenic spirochete Leptospira interrogans.高通量蛋白质组分析揭示致病性螺旋体钩端螺旋体中蛋白质修饰系统的初步见解。
Cell Res. 2010 Feb;20(2):197-210. doi: 10.1038/cr.2009.127. Epub 2009 Nov 17.
4
Proteome-wide cellular protein concentrations of the human pathogen Leptospira interrogans.问号钩端螺旋体这种人类病原体的全蛋白质组细胞蛋白质浓度。
Nature. 2009 Aug 6;460(7256):762-5. doi: 10.1038/nature08184. Epub 2009 Jul 15.
5
pep2pro: a new tool for comprehensive proteome data analysis to reveal information about organ-specific proteomes in Arabidopsis thaliana.pep2pro:一种用于全面蛋白质组数据分析的新工具,可揭示拟南芥器官特异性蛋白质组的信息。
Integr Biol (Camb). 2011 Mar;3(3):225-37. doi: 10.1039/c0ib00078g. Epub 2011 Jan 24.
6
Global proteome analysis of Leptospira interrogans.钩端螺旋体属全球蛋白质组分析。
J Proteome Res. 2009 Oct;8(10):4564-78. doi: 10.1021/pr9004597.
7
Critical assessment of proteome-wide label-free absolute abundance estimation strategies.蛋白质组范围内无标记绝对定量估计策略的批判性评估。
Proteomics. 2013 Sep;13(17):2567-78. doi: 10.1002/pmic.201300135. Epub 2013 Jul 30.
8
Absolute quantification of microbial proteomes at different states by directed mass spectrometry.通过定向质谱法对不同状态下的微生物蛋白质组进行绝对定量。
Mol Syst Biol. 2011 Jul 19;7:510. doi: 10.1038/msb.2011.37.
9
Improved prediction of peptide detectability for targeted proteomics using a rank-based algorithm and organism-specific data.使用基于排序的算法和特定生物体数据改进靶向蛋白质组学中肽段可检测性的预测。
J Proteomics. 2014 Aug 28;108:269-83. doi: 10.1016/j.jprot.2014.05.011. Epub 2014 May 27.
10
Comprehensive proteomics.全面蛋白质组学。
Curr Opin Biotechnol. 2011 Feb;22(1):3-8. doi: 10.1016/j.copbio.2010.09.002. Epub 2010 Oct 1.

引用本文的文献

1
The Mtb proteome library: a resource of assays to quantify the complete proteome of Mycobacterium tuberculosis.结核分枝杆菌蛋白质组文库:一种定量分析结核分枝杆菌全蛋白质组的检测方法资源库。
Cell Host Microbe. 2013 May 15;13(5):602-612. doi: 10.1016/j.chom.2013.04.008.
2
Inference and validation of protein identifications.蛋白质鉴定的推断和验证。
Mol Cell Proteomics. 2012 Nov;11(11):1097-104. doi: 10.1074/mcp.R111.014795. Epub 2012 Aug 3.
3
The quantitative proteome of a human cell line.人类细胞系的定量蛋白质组学。
Mol Syst Biol. 2011 Nov 8;7:549. doi: 10.1038/msb.2011.82.
4
Generic comparison of protein inference engines.蛋白质推理引擎的通用比较。
Mol Cell Proteomics. 2012 Apr;11(4):O110.007088. doi: 10.1074/mcp.O110.007088. Epub 2011 Nov 4.
5
Absolute quantification of microbial proteomes at different states by directed mass spectrometry.通过定向质谱法对不同状态下的微生物蛋白质组进行绝对定量。
Mol Syst Biol. 2011 Jul 19;7:510. doi: 10.1038/msb.2011.37.