Suppr超能文献

pep2pro:一种用于全面蛋白质组数据分析的新工具,可揭示拟南芥器官特异性蛋白质组的信息。

pep2pro: a new tool for comprehensive proteome data analysis to reveal information about organ-specific proteomes in Arabidopsis thaliana.

机构信息

Department of Biology, ETH Zurich, Universitaetstrasse 2, 8092 Zurich, Switzerland.

出版信息

Integr Biol (Camb). 2011 Mar;3(3):225-37. doi: 10.1039/c0ib00078g. Epub 2011 Jan 24.

Abstract

pep2pro is a comprehensive proteome analysis database specifically suitable for flexible proteome data analysis. The pep2pro database schema offers solutions to the various challenges of developing a proteome data analysis database and because data integrated in pep2pro are in relational format, it enables flexible and detailed data analysis. The information provided here will facilitate building proteome data analysis databases for other organisms or applications. The capacity of the pep2pro database for the integration and analysis of large proteome datasets was demonstrated by creating the pep2pro dataset, which is an organ-specific characterisation of the Arabidopsis thaliana proteome containing 14 522 identified proteins based on 2.6 million peptide spectrum assignments. This dataset provides evidence of protein expression and reveals organ-specific processes. The high coverage and density of the dataset are essential for protein quantification by normalised spectral counting and allowed us to extract information that is usually not accessible in low-coverage datasets. With this quantitative protein information we analysed organ- and organelle-specific sub-proteomes. In addition we matched spectra to regions in the genome that were not predicted to have protein coding capacity and provide PCR validation for selected revised gene models. Furthermore, we analysed the peptide features that distinguish detected from non-detected peptides and found substantial disagreement between predicted and detected proteotypic peptides, suggesting that large-scale proteomics data are essential for efficient selection of proteotypic peptides in targeted proteomics surveys. The pep2pro dataset is available as a resource for plant systems biology at www.pep2pro.ethz.ch.

摘要

pep2pro 是一个全面的蛋白质组分析数据库,特别适合灵活的蛋白质组数据分析。pep2pro 数据库模式为开发蛋白质组数据分析数据库提供了解决方案,并且由于 pep2pro 中集成的数据采用关系格式,因此能够进行灵活和详细的数据分析。这里提供的信息将有助于为其他生物体或应用程序构建蛋白质组数据分析数据库。通过创建 pep2pro 数据集,展示了 pep2pro 数据库整合和分析大型蛋白质组数据集的能力,该数据集是拟南芥蛋白质组的器官特异性特征,基于 260 万个肽段谱分配,包含 14522 个鉴定的蛋白质。该数据集提供了蛋白质表达的证据,并揭示了器官特异性的过程。数据集的高覆盖率和密度对于通过归一化光谱计数进行蛋白质定量是必不可少的,并且允许我们提取通常在低覆盖率数据集中无法访问的信息。利用这些定量蛋白质信息,我们分析了器官和细胞器特异性亚蛋白质组。此外,我们将光谱与未预测具有蛋白质编码能力的基因组区域进行匹配,并为选定的修订基因模型提供 PCR 验证。此外,我们分析了区分检测到的肽和未检测到的肽的肽特征,发现预测的和检测到的肽特征之间存在很大的差异,这表明大规模蛋白质组数据对于靶向蛋白质组学调查中有效选择肽特征是必不可少的。pep2pro 数据集可在 www.pep2pro.ethz.ch 作为植物系统生物学的资源使用。

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验