Biomarkers and Systems Biology Center, Research Triangle Institute, Research Triangle Park, North Carolina 27709-2194, USA.
Mol Cell Proteomics. 2010 Jul;9(7):1383-99. doi: 10.1074/mcp.M900378-MCP200. Epub 2010 Feb 23.
Population-based variability in protein expression patterns, especially in humans, is often observed but poorly understood. Moreover, very little is known about how interindividual genetic variation contributes to protein expression patterns. To begin to address this, we describe elements of technical and biological variations contributing to expression of 544 proteins in a population of 24 individual human lymphoblastoid cell lines that have been extensively genotyped as part of the International HapMap Project. We determined that expression levels of 10% of the proteins were tightly correlated to cell doubling rates. Using the publicly available genotypes for these lymphoblastoid cell lines, we applied a genetic association approach to identify quantitative trait loci associated with protein expression variation. Results identified 24 protein forms corresponding to 15 proteins for which genetic elements were responsible for >50% of the expression variation. The genetic variation associated with protein expression levels were located in cis with the gene coding for the transcript of the protein for 19 of these protein forms. Four of the genetic elements identified were coding non-synonymous single nucleotide polymorphisms that resulted in migration pattern changes in the two-dimensional gel. This is the first description of large scale proteomics analysis demonstrating the direct relationship between genome and proteome variations in human cells.
在人群中,蛋白质表达模式的变化(尤其是在人类中)经常被观察到,但却知之甚少。此外,人们对个体遗传变异如何影响蛋白质表达模式知之甚少。为了开始解决这个问题,我们描述了导致 24 个人类淋巴母细胞系群体中 544 种蛋白质表达的技术和生物学变异因素,这些细胞系已作为国际人类基因组单体型图计划的一部分进行了广泛的基因分型。我们发现,10%的蛋白质的表达水平与细胞倍增率密切相关。利用这些淋巴母细胞系的公开可用基因型,我们应用遗传关联方法来鉴定与蛋白质表达变异相关的数量性状基因座。结果确定了 24 种蛋白质形式,对应 15 种蛋白质,这些蛋白质的遗传元件负责超过 50%的表达变异。与蛋白质表达水平相关的遗传变异与这些蛋白质形式的转录物的基因编码区位于顺式位置。其中 19 种蛋白质形式的遗传元件与基因编码区位于顺式位置。这是首次描述大规模蛋白质组学分析,证明了人类细胞中基因组和蛋白质组变异之间的直接关系。