Deng Minghua, Mehta Shipra, Sun Fengzhu, Chen Ting
Program in Molecular and Computational Biology, Department of Biological Sciences, University of Southern California, Los Angeles, California 90089, USA.
Genome Res. 2002 Oct;12(10):1540-8. doi: 10.1101/gr.153002.
The interaction between proteins is one of the most important features of protein functions. Behind protein-protein interactions there are protein domains interacting physically with one another to perform the necessary functions. Therefore, understanding protein interactions at the domain level gives a global view of the protein interaction network, and possibly of protein functions. Two research groups used yeast two-hybrid assays to generate 5719 interactions between proteins of the yeast Saccharomyces cerevisiae. This allows us to study the large-scale conserved patterns of interactions between protein domains. Using evolutionarily conserved domains defined in a protein-domain database called PFAM (http://PFAM.wustl.edu), we apply a Maximum Likelihood Estimation method to infer interacting domains that are consistent with the observed protein-protein interactions. We estimate the probabilities of interactions between every pair of domains and measure the accuracies of our predictions at the protein level. Using the inferred domain-domain interactions, we predict interactions between proteins. Our predicted protein-protein interactions have a significant overlap with the protein-protein interactions (MIPS: http://mips.gfs.de) obtained by methods other than the two-hybrid assays. The mean correlation coefficient of the gene expression profiles for our predicted interaction pairs is significantly higher than that for random pairs. Our method has shown robustness in analyzing incomplete data sets and dealing with various experimental errors. We found several novel protein-protein interactions such as RPS0A interacting with APG17 and TAF40 interacting with SPT3, which are consistent with the functions of the proteins.
蛋白质之间的相互作用是蛋白质功能最重要的特征之一。在蛋白质 - 蛋白质相互作用的背后,是蛋白质结构域相互物理作用以执行必要的功能。因此,在结构域水平上理解蛋白质相互作用能让我们全面了解蛋白质相互作用网络,甚至可能了解蛋白质的功能。两个研究小组利用酵母双杂交试验生成了酿酒酵母蛋白质之间的5719种相互作用。这使我们能够研究蛋白质结构域之间相互作用的大规模保守模式。利用在一个名为PFAM(http://PFAM.wustl.edu)的蛋白质结构域数据库中定义的进化保守结构域,我们应用最大似然估计方法来推断与观察到的蛋白质 - 蛋白质相互作用一致的相互作用结构域。我们估计每对结构域之间相互作用的概率,并在蛋白质水平上衡量我们预测的准确性。利用推断出的结构域 - 结构域相互作用,我们预测蛋白质之间的相互作用。我们预测的蛋白质 - 蛋白质相互作用与通过双杂交试验以外的方法获得的蛋白质 - 蛋白质相互作用(MIPS:http://mips.gfs.de)有显著重叠。我们预测的相互作用对的基因表达谱的平均相关系数明显高于随机对的相关系数。我们的方法在分析不完整数据集和处理各种实验误差方面表现出稳健性。我们发现了几种新的蛋白质 - 蛋白质相互作用,如RPS0A与APG17相互作用以及TAF40与SPT3相互作用,这与蛋白质的功能一致。