1] Chair of Proteomics and Bioanalytics, Technische Universität München, Emil-Erlenmeyer Forum 5, 85354 Freising, Germany [2] SAP AG, Dietmar-Hopp-Allee 16, 69190 Walldorf, Germany [3].
1] SAP AG, Dietmar-Hopp-Allee 16, 69190 Walldorf, Germany [2].
Nature. 2014 May 29;509(7502):582-7. doi: 10.1038/nature13319.
Proteomes are characterized by large protein-abundance differences, cell-type- and time-dependent expression patterns and post-translational modifications, all of which carry biological information that is not accessible by genomics or transcriptomics. Here we present a mass-spectrometry-based draft of the human proteome and a public, high-performance, in-memory database for real-time analysis of terabytes of big data, called ProteomicsDB. The information assembled from human tissues, cell lines and body fluids enabled estimation of the size of the protein-coding genome, and identified organ-specific proteins and a large number of translated lincRNAs (long intergenic non-coding RNAs). Analysis of messenger RNA and protein-expression profiles of human tissues revealed conserved control of protein abundance, and integration of drug-sensitivity data enabled the identification of proteins predicting resistance or sensitivity. The proteome profiles also hold considerable promise for analysing the composition and stoichiometry of protein complexes. ProteomicsDB thus enables navigation of proteomes, provides biological insight and fosters the development of proteomic technology.
蛋白质组的特点是蛋白质丰度差异大、细胞类型和时间依赖性表达模式以及翻译后修饰,所有这些都携带了基因组学或转录组学无法获取的生物学信息。在这里,我们展示了一个基于质谱的人类蛋白质组草图,以及一个名为 ProteomicsDB 的公共、高性能、内存中数据库,用于实时分析数太字节的大数据。从人体组织、细胞系和体液中收集的信息可用于估计蛋白质编码基因组的大小,并鉴定出器官特异性蛋白和大量翻译的长链非编码 RNA(lncRNA)。对人类组织中信使 RNA 和蛋白质表达谱的分析揭示了蛋白质丰度的保守调控,整合药物敏感性数据可用于鉴定预测耐药性或敏感性的蛋白质。蛋白质组图谱也为分析蛋白质复合物的组成和化学计量学提供了很大的潜力。因此,ProteomicsDB 能够对蛋白质组进行导航,提供生物学见解,并促进蛋白质组学技术的发展。