Shevchenko A, Jensen O N, Podtelejnikov A V, Sagliocco F, Wilm M, Vorm O, Mortensen P, Shevchenko A, Boucherie H, Mann M
Peptide and Protein Group, European Molecular Biology Laboratory, Heidelberg, Germany.
Proc Natl Acad Sci U S A. 1996 Dec 10;93(25):14440-5. doi: 10.1073/pnas.93.25.14440.
The function of many of the uncharacterized open reading frames discovered by genomic sequencing can be determined at the level of expressed gene products, the proteome. However, identifying the cognate gene from minute amounts of protein has been one of the major problems in molecular biology. Using yeast as an example, we demonstrate here that mass spectrometric protein identification is a general solution to this problem given a completely sequenced genome. As a first screen, our strategy uses automated laser desorption ionization mass spectrometry of the peptide mixtures produced by in-gel tryptic digestion of a protein. Up to 90% of proteins are identified by searching sequence data bases by lists of peptide masses obtained with high accuracy. The remaining proteins are identified by partially sequencing several peptides of the unseparated mixture by nanoelectrospray tandem mass spectrometry followed by data base searching with multiple peptide sequence tags. In blind trials, the method led to unambiguous identification in all cases. In the largest individual protein identification project to date, a total of 150 gel spots-many of them at subpicomole amounts-were successfully analyzed, greatly enlarging a yeast two-dimensional gel data base. More than 32 proteins were novel and matched to previously uncharacterized open reading frames in the yeast genome. This study establishes that mass spectrometry provides the required throughput, the certainty of identification, and the general applicability to serve as the method of choice to connect genome and proteome.
通过基因组测序发现的许多未鉴定的开放阅读框的功能,可以在表达的基因产物即蛋白质组水平上加以确定。然而,从微量蛋白质中鉴定出相应的基因一直是分子生物学中的主要问题之一。以酵母为例,我们在此证明,在基因组已完全测序的情况下,质谱蛋白质鉴定是解决这一问题的通用方法。作为初步筛选,我们的策略是对蛋白质经胰蛋白酶胶内消化产生的肽混合物进行自动激光解吸电离质谱分析。通过用高精度获得的肽质量列表搜索序列数据库,可鉴定高达90%的蛋白质。其余蛋白质则通过纳米电喷雾串联质谱对未分离混合物的几个肽进行部分测序,随后用多个肽序列标签搜索数据库来鉴定。在盲测中,该方法在所有情况下都能得到明确的鉴定结果。在迄今为止最大的单个蛋白质鉴定项目中,总共成功分析了150个凝胶点——其中许多处于亚皮摩尔量——极大地扩充了酵母二维凝胶数据库。超过32种蛋白质是新发现的,与酵母基因组中先前未鉴定的开放阅读框相匹配。这项研究表明,质谱分析提供了所需的通量、鉴定的确定性以及普遍适用性,可作为连接基因组和蛋白质组的首选方法。