学习从蛋白质序列预测蛋白质-蛋白质相互作用。

Learning to predict protein-protein interactions from protein sequences.

作者信息

Gomez Shawn M, Noble William Stafford, Rzhetsky Andrey

机构信息

Unité de Biochimie et Biologie Moléculaire des Insectes, Institut Pasteur, 75724 Paris Cedex 15, France.

出版信息

Bioinformatics. 2003 Oct 12;19(15):1875-81. doi: 10.1093/bioinformatics/btg352.

DOI:10.1093/bioinformatics/btg352

PMID:14555619

Abstract

In order to understand the molecular machinery of the cell, we need to know about the multitude of protein-protein interactions that allow the cell to function. High-throughput technologies provide some data about these interactions, but so far that data is fairly noisy. Therefore, computational techniques for predicting protein-protein interactions could be of significant value. One approach to predicting interactions in silico is to produce from first principles a detailed model of a candidate interaction. We take an alternative approach, employing a relatively simple model that learns dynamically from a large collection of data. In this work, we describe an attraction-repulsion model, in which the interaction between a pair of proteins is represented as the sum of attractive and repulsive forces associated with small, domain- or motif-sized features along the length of each protein. The model is discriminative, learning simultaneously from known interactions and from pairs of proteins that are known (or suspected) not to interact. The model is efficient to compute and scales well to very large collections of data. In a cross-validated comparison using known yeast interactions, the attraction-repulsion method performs better than several competing techniques.

摘要

为了理解细胞的分子机制，我们需要了解众多使细胞发挥功能的蛋白质-蛋白质相互作用。高通量技术提供了一些关于这些相互作用的数据，但到目前为止，这些数据相当嘈杂。因此，预测蛋白质-蛋白质相互作用的计算技术可能具有重要价值。一种在计算机上预测相互作用的方法是从第一原理出发构建候选相互作用的详细模型。我们采用了一种替代方法，使用一个相对简单的模型，该模型从大量数据中动态学习。在这项工作中，我们描述了一种吸引-排斥模型，其中一对蛋白质之间的相互作用表示为与沿着每个蛋白质长度的小的、结构域或基序大小的特征相关的吸引力和排斥力的总和。该模型具有判别性，能同时从已知的相互作用以及已知（或怀疑）不相互作用的蛋白质对中学习。该模型计算效率高，并且能很好地扩展到非常大的数据集合。在使用已知酵母相互作用进行的交叉验证比较中，吸引-排斥方法的表现优于几种竞争技术。

相似文献

Learning to predict protein-protein interactions from protein sequences.

Bioinformatics. 2003 Oct 12;19(15):1875-81. doi: 10.1093/bioinformatics/btg352.

Predicting disulfide connectivity from protein sequence using multiple sequence feature vectors and secondary structure.

Bioinformatics. 2007 Dec 1;23(23):3147-54. doi: 10.1093/bioinformatics/btm505. Epub 2007 Oct 17.

An integrated approach to the prediction of domain-domain interactions.

BMC Bioinformatics. 2006 May 25;7:269. doi: 10.1186/1471-2105-7-269.

Structure-templated predictions of novel protein interactions from sequence information.

PLoS Comput Biol. 2007 Sep;3(9):1783-9. doi: 10.1371/journal.pcbi.0030182.

Clustering-based approach for predicting motif pairs from protein interaction data.

J Bioinform Comput Biol. 2009 Aug;7(4):701-16. doi: 10.1142/s0219720009004266.

A novel structure-based encoding for machine-learning applied to the inference of SH3 domain specificity.

Bioinformatics. 2006 Oct 1;22(19):2333-9. doi: 10.1093/bioinformatics/btl403. Epub 2006 Jul 26.

Prediction of protein-protein interactions using distant conservation of sequence patterns and structure relationships.

Bioinformatics. 2005 Aug 15;21(16):3360-8. doi: 10.1093/bioinformatics/bti522. Epub 2005 Jun 16.

An ensemble of K-local hyperplanes for predicting protein-protein interactions.

Bioinformatics. 2006 May 15;22(10):1207-10. doi: 10.1093/bioinformatics/btl055. Epub 2006 Feb 15.

Predicting protein-protein interactions using signature products.

Bioinformatics. 2005 Jan 15;21(2):218-26. doi: 10.1093/bioinformatics/bth483. Epub 2004 Aug 19.

Orthogonal kernel machine for the prediction of functional sites in proteins.

IEEE Trans Syst Man Cybern B Cybern. 2005 Feb;35(1):100-6. doi: 10.1109/tsmcb.2004.840723.

引用本文的文献

Topology-driven negative sampling enhances generalizability in protein-protein interaction prediction.

Bioinformatics. 2025 May 6;41(5). doi: 10.1093/bioinformatics/btaf148.

Improved cytokine-receptor interaction prediction by exploiting the negative sample space.

BMC Bioinformatics. 2020 Oct 31;21(1):493. doi: 10.1186/s12859-020-03835-5.

Classification in biological networks with hypergraphlet kernels.

Bioinformatics. 2021 May 17;37(7):1000-1007. doi: 10.1093/bioinformatics/btaa768.

Protein-protein interaction sites prediction by ensemble random forests with synthetic minority oversampling technique.

Bioinformatics. 2019 Jul 15;35(14):2395-2402. doi: 10.1093/bioinformatics/bty995.

Evaluating the impact of topological protein features on the negative examples selection.

BMC Bioinformatics. 2018 Nov 20;19(Suppl 14):417. doi: 10.1186/s12859-018-2385-x.

Predicting protein-protein interactions through sequence-based deep learning.

Bioinformatics. 2018 Sep 1;34(17):i802-i810. doi: 10.1093/bioinformatics/bty573.

Prediction of cassava protein interactome based on interolog method.

Sci Rep. 2017 Dec 8;7(1):17206. doi: 10.1038/s41598-017-17633-2.

Implementation and comparison of kernel-based learning methods to predict metabolic networks.

Netw Model Anal Health Inform Bioinform. 2016;5(1):26. doi: 10.1007/s13721-016-0134-5. Epub 2016 Jul 15.

3DIANA: 3D Domain Interaction Analysis: A Toolbox for Quaternary Structure Modeling.

Biophys J. 2016 Feb 23;110(4):766-75. doi: 10.1016/j.bpj.2015.11.3519. Epub 2016 Jan 7.

Fundamentals of protein interaction network mapping.

Mol Syst Biol. 2015 Dec 17;11(12):848. doi: 10.15252/msb.20156351.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

学习从蛋白质序列预测蛋白质-蛋白质相互作用。

Learning to predict protein-protein interactions from protein sequences.

作者信息

Gomez Shawn M, Noble William Stafford, Rzhetsky Andrey

机构信息

Unité de Biochimie et Biologie Moléculaire des Insectes, Institut Pasteur, 75724 Paris Cedex 15, France.

出版信息

Bioinformatics. 2003 Oct 12;19(15):1875-81. doi: 10.1093/bioinformatics/btg352.

DOI:10.1093/bioinformatics/btg352

PMID:14555619

Abstract

摘要

学习从蛋白质序列预测蛋白质-蛋白质相互作用。

Learning to predict protein-protein interactions from protein sequences.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

学习从蛋白质序列预测蛋白质-蛋白质相互作用。

Learning to predict protein-protein interactions from protein sequences.

作者信息

机构信息

出版信息

相似文献

引用本文的文献