• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

通过机器学习算法选择的用于区分永久性和瞬时复合物中蛋白质-蛋白质相互作用的物理化学描述符。

Physicochemical descriptors to discriminate protein-protein interactions in permanent and transient complexes selected by means of machine learning algorithms.

作者信息

Block Peter, Paern Juri, Hüllermeier Eyke, Sanschagrin Paul, Sotriffer Christoph A, Klebe Gerhard

机构信息

Department of Pharmaceutical Chemistry, Philipps-University Marburg, Marburg, Germany.

出版信息

Proteins. 2006 Nov 15;65(3):607-22. doi: 10.1002/prot.21104.

DOI:10.1002/prot.21104
PMID:16955490
Abstract

Analyzing protein-protein interactions at the atomic level is critical for our understanding of the principles governing the interactions involved in protein-protein recognition. For this purpose, descriptors explaining the nature of different protein-protein complexes are desirable. In this work, the authors introduced Epic Protein Interface Classification as a framework handling the preparation, processing, and analysis of protein-protein complexes for classification with machine learning algorithms. We applied four different machine learning algorithms: Support Vector Machines, C4.5 Decision Trees, K Nearest Neighbors, and Naïve Bayes algorithm in combination with three feature selection methods, Filter (Relief F), Wrapper, and Genetic Algorithms, to extract discriminating features from the protein-protein complexes. To compare protein-protein complexes to each other, the authors represented the physicochemical characteristics of their interfaces in four different ways, using two different atomic contact vectors, DrugScore pair potential vectors and SFCscore descriptor vectors. We classified two different datasets: (A) 172 protein-protein complexes comprising 96 monomers, forming contacts enforced by the crystallographic packing environment (crystal contacts), and 76 biologically functional homodimer complexes; (B) 345 protein-protein complexes containing 147 permanent complexes and 198 transient complexes. We were able to classify up to 94.8% of the packing enforced/functional and up to 93.6% of the permanent/transient complexes correctly. Furthermore, we were able to extract relevant features from the different protein-protein complexes and introduce an approach for scoring the importance of the extracted features.

摘要

在原子水平上分析蛋白质 - 蛋白质相互作用对于我们理解蛋白质 - 蛋白质识别中相互作用的原理至关重要。为此,需要能够解释不同蛋白质 - 蛋白质复合物性质的描述符。在这项工作中,作者引入了Epic蛋白质界面分类作为一个框架,用于处理蛋白质 - 蛋白质复合物的制备、处理和分析,以便使用机器学习算法进行分类。我们应用了四种不同的机器学习算法:支持向量机、C4.5决策树、K近邻算法和朴素贝叶斯算法,并结合三种特征选择方法,即过滤法(Relief F)、包装法和遗传算法,从蛋白质 - 蛋白质复合物中提取区分特征。为了相互比较蛋白质 - 蛋白质复合物,作者用四种不同方式表示其界面的物理化学特征,使用两种不同的原子接触向量、DrugScore对势向量和SFCscore描述符向量。我们对两个不同的数据集进行了分类:(A)172个蛋白质 - 蛋白质复合物,包括96个单体,形成由晶体堆积环境强制的接触(晶体接触),以及76个具有生物学功能的同二聚体复合物;(B)345个蛋白质 - 蛋白质复合物,包含147个永久复合物和

相似文献

1
Physicochemical descriptors to discriminate protein-protein interactions in permanent and transient complexes selected by means of machine learning algorithms.通过机器学习算法选择的用于区分永久性和瞬时复合物中蛋白质-蛋白质相互作用的物理化学描述符。
Proteins. 2006 Nov 15;65(3):607-22. doi: 10.1002/prot.21104.
2
Specificity of molecular interactions in transient protein-protein interaction interfaces.瞬时蛋白质-蛋白质相互作用界面中分子相互作用的特异性
Proteins. 2006 Nov 15;65(3):593-606. doi: 10.1002/prot.21056.
3
Atomic contact vectors in protein-protein recognition.蛋白质-蛋白质识别中的原子接触向量
Proteins. 2003 Nov 15;53(3):629-39. doi: 10.1002/prot.10432.
4
Funnel hunting in a rough terrain: learning and discriminating native energy funnels.在复杂地形中进行漏斗搜寻:学习和辨别原生能量漏斗。
Structure. 2008 Feb;16(2):269-79. doi: 10.1016/j.str.2007.11.013.
5
A new protein-protein docking scoring function based on interface residue properties.一种基于界面残基性质的新型蛋白质-蛋白质对接评分函数。
Bioinformatics. 2007 Mar 1;23(5):555-62. doi: 10.1093/bioinformatics/btl654. Epub 2007 Jan 18.
6
New measures for estimating surface complementarity and packing at protein-protein interfaces.用于估计蛋白质-蛋白质界面表面互补性和堆积的新方法。
FEBS Lett. 2010 Mar 19;584(6):1163-8. doi: 10.1016/j.febslet.2010.02.021. Epub 2010 Feb 12.
7
Predicting protein-ligand binding affinities using novel geometrical descriptors and machine-learning methods.使用新型几何描述符和机器学习方法预测蛋白质-配体结合亲和力。
J Chem Inf Comput Sci. 2004 Mar-Apr;44(2):699-703. doi: 10.1021/ci034246+.
8
Feature selection and classification of protein-protein complexes based on their binding affinities using machine learning approaches.基于机器学习方法,利用蛋白质-蛋白质复合物的结合亲和力进行特征选择和分类。
Proteins. 2014 Sep;82(9):2088-96. doi: 10.1002/prot.24564. Epub 2014 Apr 16.
9
Classification of faces in man and machine.人类与机器中的面部分类。
Neural Comput. 2006 Jan;18(1):143-65. doi: 10.1162/089976606774841611.
10
Characterization and prediction of protein-protein interactions within and between complexes.复合物内部及之间蛋白质-蛋白质相互作用的表征与预测。
Proc Natl Acad Sci U S A. 2006 Oct 3;103(40):14718-23. doi: 10.1073/pnas.0603352103. Epub 2006 Sep 26.

引用本文的文献

1
Statistical analysis of sequential motifs at biologically relevant protein-protein interfaces.生物相关蛋白质-蛋白质界面处序列基序的统计分析。
Comput Struct Biotechnol J. 2024 Mar 7;23:1244-1259. doi: 10.1016/j.csbj.2024.03.004. eCollection 2024 Dec.
2
Prediction of transient and permanent protein interactions using AI methods.使用人工智能方法预测瞬时和永久蛋白质相互作用。
Bioinformation. 2023 Jun 30;19(6):749-753. doi: 10.6026/97320630019749. eCollection 2023.
3
Protein Complex Organization Imposes Constraints on Proteome Dysregulation in Cancer.
蛋白质复合体组织对癌症中蛋白质组失调施加限制。
Front Bioinform. 2021 Aug 30;1:723482. doi: 10.3389/fbinf.2021.723482. eCollection 2021.
4
Roles of Physicochemical and Structural Properties of RNA-Binding Proteins in Predicting the Activities of Trans-Acting Splicing Factors with Machine Learning.基于物理化学和结构特性的 RNA 结合蛋白在机器学习预测反式剪接因子活性中的作用。
Int J Mol Sci. 2022 Apr 17;23(8):4426. doi: 10.3390/ijms23084426.
5
Are transient protein-protein interactions more dispensable?瞬时蛋白-蛋白相互作用更可有可无吗?
PLoS Comput Biol. 2022 Apr 11;18(4):e1010013. doi: 10.1371/journal.pcbi.1010013. eCollection 2022 Apr.
6
Modeling protein quaternary structure of homo- and hetero-oligomers beyond binary interactions by homology.通过同源建模来模拟同聚体和异聚体的蛋白质四级结构,超越二元相互作用。
Sci Rep. 2017 Sep 5;7(1):10480. doi: 10.1038/s41598-017-09654-8.
7
3D deep convolutional neural networks for amino acid environment similarity analysis.用于氨基酸环境相似性分析的3D深度卷积神经网络。
BMC Bioinformatics. 2017 Jun 14;18(1):302. doi: 10.1186/s12859-017-1702-0.
8
ProteinsPlus: a web portal for structure analysis of macromolecules.蛋白质+: 用于分析大分子结构的网络门户。
Nucleic Acids Res. 2017 Jul 3;45(W1):W337-W343. doi: 10.1093/nar/gkx333.
9
Implication of Terminal Residues at Protein-Protein and Protein-DNA Interfaces.蛋白质-蛋白质和蛋白质-DNA界面处末端残基的影响
PLoS One. 2016 Sep 9;11(9):e0162143. doi: 10.1371/journal.pone.0162143. eCollection 2016.
10
Use B-factor related features for accurate classification between protein binding interfaces and crystal packing contacts.利用与B因子相关的特征,对蛋白质结合界面和晶体堆积接触进行准确分类。
BMC Bioinformatics. 2014;15 Suppl 16(Suppl 16):S3. doi: 10.1186/1471-2105-15-S16-S3. Epub 2014 Dec 8.