Suppr超能文献

PaxDb 4.0版本:整合了模式生物、组织和细胞系的蛋白质丰度数据。

Version 4.0 of PaxDb: Protein abundance data, integrated across model organisms, tissues, and cell-lines.

作者信息

Wang Mingcong, Herrmann Christina J, Simonovic Milan, Szklarczyk Damian, von Mering Christian

机构信息

Institute of Molecular Life Sciences and Swiss Institute of Bioinformatics, University of Zurich, Switzerland.

出版信息

Proteomics. 2015 Sep;15(18):3163-8. doi: 10.1002/pmic.201400441. Epub 2015 Mar 12.

Abstract

Protein quantification at proteome-wide scale is an important aim, enabling insights into fundamental cellular biology and serving to constrain experiments and theoretical models. While proteome-wide quantification is not yet fully routine, many datasets approaching proteome-wide coverage are becoming available through biophysical and MS techniques. Data of this type can be accessed via a variety of sources, including publication supplements and online data repositories. However, access to the data is still fragmentary, and comparisons across experiments and organisms are not straightforward. Here, we describe recent updates to our database resource "PaxDb" (Protein Abundances Across Organisms). PaxDb focuses on protein abundance information at proteome-wide scope, irrespective of the underlying measurement technique. Quantification data is reprocessed, unified, and quality-scored, and then integrated to build a meta-resource. PaxDb also allows evolutionary comparisons through precomputed gene orthology relations. Recently, we have expanded the scope of the database to include cell-line samples, and more systematically scan the literature for suitable datasets. We report that a significant fraction of published experiments cannot readily be accessed and/or parsed for quantitative information, requiring additional steps and efforts. The current update brings PaxDb to 414 datasets in 53 organisms, with (semi-) quantitative abundance information covering more than 300,000 proteins.

摘要

在全蛋白质组范围内进行蛋白质定量是一个重要目标,它有助于深入了解细胞基础生物学,并为实验和理论模型提供约束。虽然全蛋白质组定量尚未完全成为常规操作,但通过生物物理和质谱技术,许多接近全蛋白质组覆盖范围的数据集正变得可用。这类数据可以通过多种来源获取,包括出版物补充材料和在线数据存储库。然而,数据的获取仍然是零散的,跨实验和生物体的比较也并非易事。在此,我们描述了我们的数据库资源“PaxDb”(跨生物体蛋白质丰度)的最新更新情况。PaxDb专注于全蛋白质组范围内的蛋白质丰度信息,而不考虑基础测量技术。定量数据经过重新处理、统一和质量评分,然后整合以构建一个元资源。PaxDb还允许通过预先计算的基因直系同源关系进行进化比较。最近,我们扩大了数据库的范围,纳入了细胞系样本,并更系统地在文献中搜索合适的数据集。我们报告称,很大一部分已发表的实验难以轻易获取和/或解析其定量信息,需要额外的步骤和努力。当前的更新使PaxDb包含了53种生物体中的414个数据集,其(半)定量丰度信息涵盖了超过30万种蛋白质。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8122/6680238/cf62aa9d39e5/PMIC-15-3163-g001.jpg

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验