Baxter Susan M, Rosenblum Jonathan S, Knutson Stacy, Nelson Melanie R, Montimurro Jennifer S, Di Gennaro Jeannine A, Speir Jeffrey A, Burbaum Jonathan J, Fetrow Jacquelyn S
GeneFormatics, Inc., 5830 Oberlin Drive, Suite 200, San Diego, CA 92121, USA.
Mol Cell Proteomics. 2004 Mar;3(3):209-25. doi: 10.1074/mcp.M300082-MCP200. Epub 2003 Nov 25.
An analysis of the structurally and catalytically diverse serine hydrolase protein family in the Saccharomyces cerevisiae proteome was undertaken using two independent but complementary, large-scale approaches. The first approach is based on computational analysis of serine hydrolase active site structures; the second utilizes the chemical reactivity of the serine hydrolase active site in complex mixtures. These proteomics approaches share the ability to fractionate the complex proteome into functional subsets. Each method identified a significant number of sequences, but 15 proteins were identified by both methods. Eight of these were unannotated in the Saccharomyces Genome Database at the time of this study and are thus novel serine hydrolase identifications. Three of the previously uncharacterized proteins are members of a eukaryotic serine hydrolase family, designated as Fsh (family of serine hydrolase), identified here for the first time. OVCA2, a potential human tumor suppressor, and DYR-SCHPO, a dihydrofolate reductase from Schizosaccharomyces pombe, are members of this family. Comparing the combined results to results of other proteomic methods showed that only four of the 15 proteins were identified in a recent large-scale, "shotgun" proteomic analysis and eight were identified using a related, but similar, approach (neither identifies function). Only 10 of the 15 were annotated using alternate motif-based computational tools. The results demonstrate the precision derived from combining complementary, function-based approaches to extract biological information from complex proteomes. The chemical proteomics technology indicates that a functional protein is being expressed in the cell, while the computational proteomics technology adds details about the specific type of function and residue that is likely being labeled. The combination of synergistic methods facilitates analysis, enriches true positive results, and increases confidence in novel identifications. This work also highlights the risks inherent in annotation transfer and the use of scoring functions for determination of correct annotations.
利用两种独立但互补的大规模方法,对酿酒酵母蛋白质组中结构和催化功能多样的丝氨酸水解酶蛋白家族进行了分析。第一种方法基于对丝氨酸水解酶活性位点结构的计算分析;第二种方法利用丝氨酸水解酶活性位点在复杂混合物中的化学反应性。这些蛋白质组学方法都具备将复杂蛋白质组分离为功能亚组的能力。每种方法都鉴定出了大量序列,但有15种蛋白质是两种方法都鉴定出来的。其中8种在本研究时在酿酒酵母基因组数据库中未被注释,因此是新鉴定出的丝氨酸水解酶。3种先前未鉴定的蛋白质是一个真核丝氨酸水解酶家族的成员,该家族被命名为Fsh(丝氨酸水解酶家族),在此首次被鉴定。潜在的人类肿瘤抑制因子OVCA2和粟酒裂殖酵母的二氢叶酸还原酶DYR-SCHPO都是这个家族的成员。将综合结果与其他蛋白质组学方法的结果进行比较表明,在最近的大规模“鸟枪法”蛋白质组分析中,这15种蛋白质中只有4种被鉴定出来,另外8种是使用一种相关但类似的方法鉴定出来的(两种方法都未鉴定功能)。这15种蛋白质中只有10种是使用基于替代基序的计算工具进行注释的。结果表明,将基于功能的互补方法结合起来从复杂蛋白质组中提取生物学信息具有很高的精确性。化学蛋白质组学技术表明细胞中正在表达一种功能性蛋白质,而计算蛋白质组学技术则补充了有关可能被标记的特定功能类型和残基的详细信息。协同方法的结合有助于分析,丰富了真阳性结果,并增加了对新鉴定结果的信心。这项工作还突出了注释转移以及使用评分函数确定正确注释所固有的风险。