The College of Mechanical & Electronic Engineering and Automatization, National University of Defense Technology, 410073 Changsha, China.
Biochem Biophys Res Commun. 2013 Oct 25;440(3):388-92. doi: 10.1016/j.bbrc.2013.09.093. Epub 2013 Oct 1.
The discovery of regulation relationship of protein interactions is crucial for the mechanism research in signaling network. Bioinformatics methods can be used to accelerate the discovery of regulation relationship between protein interactions, to distinguish the activation relations from inhibition relations. In this paper, we describe a novel method to predict the regulation relations of protein interactions in the signaling network. We detected 4,417 domain pairs that were significantly enriched in the activation or inhibition dataset. Three machine learning methods, logistic regression, support vector machines(SVMs), and naïve bayes, were explored in the classifier models. The prediction power of three different models was evaluated by 5-fold cross-validation and the independent test dataset. The area under the receiver operating characteristic curve for logistic regression, SVM, and naïve bayes models was 0.946, 0.905 and 0.809, respectively. Finally, the logistic regression classifier was applied to the human proteome-wide interaction dataset, and 2,591 interactions were predicted with their regulation relations, with 2,048 in activation and 543 in inhibition. This model based on domains can be used to identify the regulation relations between protein interactions and furthermore reconstruct signaling pathways.
蛋白质相互作用调控关系的发现对于信号网络的机制研究至关重要。生物信息学方法可用于加速蛋白质相互作用调控关系的发现,区分激活关系和抑制关系。本文描述了一种预测信号网络中蛋白质相互作用调控关系的新方法。我们检测到 4417 对在激活或抑制数据集显著富集的结构域对。在分类器模型中探索了三种机器学习方法,逻辑回归、支持向量机(SVM)和朴素贝叶斯。通过 5 折交叉验证和独立测试数据集评估了三种不同模型的预测能力。逻辑回归、SVM 和朴素贝叶斯模型的接收者操作特征曲线下面积分别为 0.946、0.905 和 0.809。最后,逻辑回归分类器被应用于人类蛋白质组范围的相互作用数据集,预测了 2591 个具有调控关系的相互作用,其中 2048 个为激活,543 个为抑制。这个基于结构域的模型可用于识别蛋白质相互作用之间的调控关系,并进一步重建信号通路。