• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

多RELIEF:一种使用机器学习方法进行特征加权,从多序列比对中识别特异性决定残基的方法。

Multi-RELIEF: a method to recognize specificity determining residues from multiple sequence alignments using a Machine-Learning approach for feature weighting.

作者信息

Ye Kai, Feenstra K Anton, Heringa Jaap, Ijzerman Adriaan P, Marchiori Elena

机构信息

Division of Medical Chemistry, LACDR, Leiden University, P.O. Box 9502, 2300 RA, Leiden, The Netherlands.

出版信息

Bioinformatics. 2008 Jan 1;24(1):18-25. doi: 10.1093/bioinformatics/btm537. Epub 2007 Nov 17.

DOI:10.1093/bioinformatics/btm537
PMID:18024975
Abstract

MOTIVATION

Identification of residues that account for protein function specificity is crucial, not only for understanding the nature of functional specificity, but also for protein engineering experiments aimed at switching the specificity of an enzyme, regulator or transporter. Available algorithms generally use multiple sequence alignments to identify residue positions conserved within subfamilies but divergent in between. However, many biological examples show a much subtler picture than simple intra-group conservation versus inter-group divergence.

RESULTS

We present multi-RELIEF, a novel approach for identifying specificity residues that is based on RELIEF, a state-of-the-art Machine-Learning technique for feature weighting. It estimates the expected 'local' functional specificity of residues from an alignment divided in multiple classes. Optionally, 3D structure information is exploited by increasing the weight of residues that have high-weight neighbors. Using ROC curves over a large body of experimental reference data, we show that (a) multi-RELIEF identifies specificity residues for the seven test sets used, (b) incorporating structural information improves prediction for specificity of interaction with small molecules and (c) comparison of multi-RELIEF with four other state-of-the-art algorithms indicates its robustness and best overall performance.

AVAILABILITY

A web-server implementation of multi-RELIEF is available at www.ibi.vu.nl/programs/multirelief. Matlab source code of the algorithm and data sets are available on request for academic use.

摘要

动机

识别决定蛋白质功能特异性的残基至关重要,这不仅有助于理解功能特异性的本质,还对旨在改变酶、调节因子或转运蛋白特异性的蛋白质工程实验具有重要意义。现有的算法通常使用多序列比对来识别亚家族内保守但亚家族间不同的残基位置。然而,许多生物学实例显示出的情况比简单的组内保守与组间差异更为微妙。

结果

我们提出了多RELIEF方法,这是一种基于RELIEF(一种用于特征加权的先进机器学习技术)来识别特异性残基的新方法。它从划分为多个类别的比对中估计残基的预期“局部”功能特异性。可选择地,通过增加具有高权重邻居的残基权重来利用三维结构信息。通过对大量实验参考数据使用ROC曲线,我们表明:(a)多RELIEF方法能够识别所使用的七个测试集的特异性残基;(b)纳入结构信息可改善与小分子相互作用特异性的预测;(c)将多RELIEF方法与其他四种先进算法进行比较表明其具有稳健性和最佳的整体性能。

可用性

多RELIEF方法的网络服务器实现可在www.ibi.vu.nl/programs/multirelief获取。算法的Matlab源代码和数据集可根据学术使用需求提供。

相似文献

1
Multi-RELIEF: a method to recognize specificity determining residues from multiple sequence alignments using a Machine-Learning approach for feature weighting.多RELIEF:一种使用机器学习方法进行特征加权,从多序列比对中识别特异性决定残基的方法。
Bioinformatics. 2008 Jan 1;24(1):18-25. doi: 10.1093/bioinformatics/btm537. Epub 2007 Nov 17.
2
An iterative refinement algorithm for consistency based multiple structural alignment methods.一种基于一致性的多结构比对方法的迭代优化算法。
Bioinformatics. 2006 Sep 1;22(17):2087-93. doi: 10.1093/bioinformatics/btl351. Epub 2006 Jun 29.
3
SVM-HUSTLE--an iterative semi-supervised machine learning approach for pairwise protein remote homology detection.SVM-HUSTLE——一种用于成对蛋白质远程同源性检测的迭代半监督机器学习方法。
Bioinformatics. 2008 Mar 15;24(6):783-90. doi: 10.1093/bioinformatics/btn028. Epub 2008 Feb 1.
4
Predicting protein structure classes from function predictions.从功能预测中预测蛋白质结构类别。
Bioinformatics. 2004 Mar 22;20(5):770-6. doi: 10.1093/bioinformatics/btg483. Epub 2004 Jan 29.
5
Automated discovery of 3D motifs for protein function annotation.用于蛋白质功能注释的3D基序的自动发现。
Bioinformatics. 2006 Mar 15;22(6):723-30. doi: 10.1093/bioinformatics/btk038. Epub 2006 Jan 12.
6
Support vector machines for prediction of dihedral angle regions.用于预测二面角区域的支持向量机
Bioinformatics. 2006 Dec 15;22(24):3009-15. doi: 10.1093/bioinformatics/btl489. Epub 2006 Sep 27.
7
Multiple alignment by sequence annealing.通过序列退火进行多序列比对。
Bioinformatics. 2007 Jan 15;23(2):e24-9. doi: 10.1093/bioinformatics/btl311.
8
Bayesian search of functionally divergent protein subgroups and their function specific residues.功能趋异蛋白质亚组及其功能特异性残基的贝叶斯搜索
Bioinformatics. 2006 Oct 15;22(20):2466-74. doi: 10.1093/bioinformatics/btl411. Epub 2006 Jul 26.
9
Detecting protein dissimilarities in multiple alignments using Bayesian variable selection.使用贝叶斯变量选择检测多序列比对中的蛋白质差异。
Bioinformatics. 2007 Jan 15;23(2):245-6. doi: 10.1093/bioinformatics/btl566. Epub 2006 Nov 14.
10
PROMALS: towards accurate multiple sequence alignments of distantly related proteins.PROMALS:用于实现远缘相关蛋白质准确多序列比对
Bioinformatics. 2007 Apr 1;23(7):802-8. doi: 10.1093/bioinformatics/btm017. Epub 2007 Jan 31.

引用本文的文献

1
Unraveling viral drug targets: a deep learning-based approach for the identification of potential binding sites.解析病毒药物靶点:基于深度学习的潜在结合位点鉴定方法。
Brief Bioinform. 2023 Nov 22;25(1). doi: 10.1093/bib/bbad459.
2
Recognition of fold- and function-specific sites in the ligand-binding domain of the thyroid hormone receptor-like family.识别甲状腺激素受体样家族配体结合域中的折叠和功能特定位点。
Front Endocrinol (Lausanne). 2022 Sep 29;13:981090. doi: 10.3389/fendo.2022.981090. eCollection 2022.
3
Predicting the Specificity- Determining Positions of Receptor Tyrosine Kinase Axl.
预测受体酪氨酸激酶Axl的特异性决定位点
Front Mol Biosci. 2021 Jun 14;8:658906. doi: 10.3389/fmolb.2021.658906. eCollection 2021.
4
Spatiotemporal identification of druggable binding sites using deep learning.利用深度学习对可药物结合位点进行时空识别。
Commun Biol. 2020 Oct 27;3(1):618. doi: 10.1038/s42003-020-01350-0.
5
MAGA: A Supervised Method to Detect Motifs From Annotated Groups in Alignments.MAGA:一种从比对中的注释组检测基序的监督方法。
Evol Bioinform Online. 2020 Apr 29;16:1176934320916199. doi: 10.1177/1176934320916199. eCollection 2020.
6
Exploring the computational methods for protein-ligand binding site prediction.探索蛋白质-配体结合位点预测的计算方法。
Comput Struct Biotechnol J. 2020 Feb 17;18:417-426. doi: 10.1016/j.csbj.2020.02.008. eCollection 2020.
7
Deep Analysis of Residue Constraints (DARC): identifying determinants of protein functional specificity.深度解析残基约束(DARC):鉴定蛋白质功能特异性的决定因素。
Sci Rep. 2020 Feb 3;10(1):1691. doi: 10.1038/s41598-019-55118-6.
8
Relief-based feature selection: Introduction and review.基于缓解的特征选择:介绍与综述。
J Biomed Inform. 2018 Sep;85:189-203. doi: 10.1016/j.jbi.2018.07.014. Epub 2018 Jul 18.
9
Recognition of sites of functional specialisation in all known eukaryotic protein kinase families.识别所有已知真核蛋白激酶家族中功能特化的位点。
PLoS Comput Biol. 2018 Feb 13;14(2):e1005975. doi: 10.1371/journal.pcbi.1005975. eCollection 2018 Feb.
10
Inferring joint sequence-structural determinants of protein functional specificity.推断蛋白质功能特异性的关节序列结构决定因素。
Elife. 2018 Jan 16;7:e29880. doi: 10.7554/eLife.29880.