Suppr超能文献

蛋白酶抑制剂与抗原-抗体复合物中界面残基的鉴定:一种支持向量机方法。

Identification of interface residues in protease-inhibitor and antigen-antibody complexes: a support vector machine approach.

作者信息

Yan Changhui, Honavar Vasant, Dobbs Drena

机构信息

Artificial Intelligence Research Laboratory, Iowa State University, Atanasoff Hall 226, Ames, IA 50011-1040, USA.

出版信息

Neural Comput Appl. 2004 Jun 1;13(2):123-129. doi: 10.1007/s00521-004-0414-3.

Abstract

In this paper, we describe a machine learning approach for sequence-based prediction of protein-protein interaction sites. A support vector machine (SVM) classifier was trained to predict whether or not a surface residue is an interface residue (i.e., is located in the protein-protein interaction surface), based on the identity of the target residue and its ten sequence neighbors. Separate classifiers were trained on proteins from two categories of complexes, antibody-antigen and protease-inhibitor. The effectiveness of each classifier was evaluated using leave-one-out (jack-knife) cross-validation. Interface and non-interface residues were classified with relatively high sensitivity (82.3% and 78.5%) and specificity (81.0% and 77.6%) for proteins in the antigen-antibody and protease-inhibitor complexes, respectively. The correlation between predicted and actual labels was 0.430 and 0.462, indicating that the method performs substantially better than chance (zero correlation). Combined with recently developed methods for identification of surface residues from sequence information, this offers a promising approach to predict residues involved in protein-protein interactions from sequence information alone.

摘要

在本文中,我们描述了一种基于序列预测蛋白质-蛋白质相互作用位点的机器学习方法。训练了一个支持向量机(SVM)分类器,以根据目标残基及其十个序列邻域的同一性来预测表面残基是否为界面残基(即位于蛋白质-蛋白质相互作用表面)。针对来自抗体-抗原和蛋白酶-抑制剂两类复合物的蛋白质分别训练了分类器。使用留一法(刀切法)交叉验证评估每个分类器的有效性。对于抗原-抗体和蛋白酶-抑制剂复合物中的蛋白质,界面残基和非界面残基的分类分别具有相对较高的灵敏度(82.3%和78.5%)和特异性(81.0%和77.6%)。预测标签与实际标签之间的相关性分别为0.430和0.462,表明该方法的性能明显优于随机猜测(零相关性)。结合最近开发的从序列信息中识别表面残基的方法,这为仅从序列信息预测参与蛋白质-蛋白质相互作用的残基提供了一种有前景的方法。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/81e8/2880521/e7a18c6d6eb3/nihms67534f1.jpg

相似文献

本文引用的文献

3
Analysing six types of protein-protein interfaces.分析六种蛋白质-蛋白质相互作用界面。
J Mol Biol. 2003 Jan 10;325(2):377-87. doi: 10.1016/s0022-2836(02)01223-8.
4
Computational methods for the prediction of protein interactions.预测蛋白质相互作用的计算方法。
Curr Opin Struct Biol. 2002 Jun;12(3):368-73. doi: 10.1016/s0959-440x(02)00333-0.
5
Dissecting protein-protein recognition sites.剖析蛋白质-蛋白质识别位点。
Proteins. 2002 May 15;47(3):334-43. doi: 10.1002/prot.10085.
10
Prediction of protein surface accessibility with information theory.基于信息论的蛋白质表面可及性预测
Proteins. 2001 Mar 1;42(4):452-9. doi: 10.1002/1097-0134(20010301)42:4<452::aid-prot40>3.0.co;2-q.

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验