Science for Life Laboratory, KTH-Royal Institute of Technology, Stockholm, Sweden.
J Proteome Res. 2013 Jun 7;12(6):2439-48. doi: 10.1021/pr300924j. Epub 2012 Dec 31.
A gene-centric Human Proteome Project has been proposed to characterize the human protein-coding genes in a chromosome-centered manner to understand human biology and disease. Here, we report on the protein evidence for all genes predicted from the genome sequence based on manual annotation from literature (UniProt), antibody-based profiling in cells, tissues and organs and analysis of the transcript profiles using next generation sequencing in human cell lines of different origins. We estimate that there is good evidence for protein existence for 69% (n = 13985) of the human protein-coding genes, while 23% have only evidence on the RNA level and 7% still lack experimental evidence. Analysis of the expression patterns shows few tissue-specific proteins and approximately half of the genes expressed in all the analyzed cells. The status for each gene with regards to protein evidence is visualized in a chromosome-centric manner as part of a new version of the Human Protein Atlas ( www.proteinatlas.org ).
一个以基因为中心的人类蛋白质组计划已经被提议,以染色体为中心的方式来描述人类蛋白质编码基因,从而了解人类生物学和疾病。在这里,我们报告了基于文献(UniProt)手动注释、细胞、组织和器官中的抗体分析以及使用不同来源的人类细胞系进行下一代测序的转录谱分析,为所有从基因组序列预测的基因提供的蛋白质证据。我们估计,对于 69%(n=13985)的人类蛋白质编码基因,有很好的蛋白质存在证据,而 23%的基因只有在 RNA 水平上有证据,7%的基因仍然缺乏实验证据。表达模式的分析表明,很少有组织特异性蛋白质,大约一半的基因在所有分析的细胞中表达。每个基因的蛋白质证据状态以染色体为中心的方式可视化,作为新版本的人类蛋白质图谱(www.proteinatlas.org)的一部分。