• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

从蛋白质序列预测金属结合位点。

Predicting metal-binding sites from protein sequence.

机构信息

University of Trento, Trento.

出版信息

IEEE/ACM Trans Comput Biol Bioinform. 2012 Jan-Feb;9(1):203-13. doi: 10.1109/TCBB.2011.94. Epub 2011 May 16.

DOI:10.1109/TCBB.2011.94
PMID:21606549
Abstract

Prediction of binding sites from sequence can significantly help toward determining the function of uncharacterized proteins on a genomic scale. The task is highly challenging due to the enormous amount of alternative candidate configurations. Previous research has only considered this prediction problem starting from 3D information. When starting from sequence alone, only methods that predict the bonding state of selected residues are available. The sole exception consists of pattern-based approaches, which rely on very specific motifs and cannot be applied to discover truly novel sites. We develop new algorithmic ideas based on structured-output learning for determining transition-metal-binding sites coordinated by cysteines and histidines. The inference step (retrieving the best scoring output) is intractable for general output types (i.e., general graphs). However, under the assumption that no residue can coordinate more than one metal ion, we prove that metal binding has the algebraic structure of a matroid, allowing us to employ a very efficient greedy algorithm. We test our predictor in a highly stringent setting where the training set consists of protein chains belonging to SCOP folds different from the ones used for accuracy estimation. In this setting, our predictor achieves 56 percent precision and 60 percent recall in the identification of ligand-ion bonds.

摘要

从序列预测结合位点可以极大地帮助确定基因组范围内未表征蛋白质的功能。由于存在大量的替代候选构象,因此该任务极具挑战性。以前的研究仅从 3D 信息开始考虑此预测问题。仅从序列开始时,仅可使用预测选定残基键合状态的方法。唯一的例外是基于模式的方法,这些方法依赖于非常特定的基序,并且不能用于发现真正新颖的位点。我们基于结构化输出学习开发了新的算法思想,用于确定由半胱氨酸和组氨酸协调的过渡金属结合位点。对于一般输出类型(即一般图形),推理步骤(检索得分最高的输出)是难以处理的。但是,假设没有残基可以协调超过一个金属离子,我们证明金属结合具有拟阵的代数结构,这使我们能够使用非常有效的贪婪算法。我们在高度严格的设置中测试了我们的预测器,其中训练集由属于 SCOP 折叠的蛋白质链组成,这些链与用于准确性估计的折叠不同。在这种设置下,我们的预测器在识别配体-离子键时的精度为 56%,召回率为 60%。

相似文献

1
Predicting metal-binding sites from protein sequence.从蛋白质序列预测金属结合位点。
IEEE/ACM Trans Comput Biol Bioinform. 2012 Jan-Feb;9(1):203-13. doi: 10.1109/TCBB.2011.94. Epub 2011 May 16.
2
Identifying cysteines and histidines in transition-metal-binding sites using support vector machines and neural networks.使用支持向量机和神经网络识别过渡金属结合位点中的半胱氨酸和组氨酸。
Proteins. 2006 Nov 1;65(2):305-16. doi: 10.1002/prot.21135.
3
Prediction of 3D metal binding sites from translated gene sequences based on remote-homology templates.基于远程同源模板的翻译基因序列中三维金属结合位点的预测。
Proteins. 2009 Aug 1;76(2):365-74. doi: 10.1002/prot.22352.
4
MIonSite: Ligand-specific prediction of metal ion-binding sites via enhanced AdaBoost algorithm with protein sequence information.MIonSite:利用AdaBoost 算法增强并结合蛋白质序列信息进行配体特异性金属离子结合位点预测。
Anal Biochem. 2019 Feb 1;566:75-88. doi: 10.1016/j.ab.2018.11.009. Epub 2018 Nov 9.
5
PSSM-based prediction of DNA binding sites in proteins.基于位置特异性得分矩阵的蛋白质中DNA结合位点预测
BMC Bioinformatics. 2005 Feb 19;6:33. doi: 10.1186/1471-2105-6-33.
6
MetalDetector: a web server for predicting metal-binding sites and disulfide bridges in proteins from sequence.金属探测器:一个用于从序列预测蛋白质中金属结合位点和二硫键的网络服务器。
Bioinformatics. 2008 Sep 15;24(18):2094-5. doi: 10.1093/bioinformatics/btn371. Epub 2008 Jul 16.
7
Applying the Naïve Bayes classifier with kernel density estimation to the prediction of protein-protein interaction sites.应用朴素贝叶斯分类器和核密度估计对蛋白质-蛋白质相互作用位点进行预测。
Bioinformatics. 2010 Aug 1;26(15):1841-8. doi: 10.1093/bioinformatics/btq302. Epub 2010 Jun 6.
8
Predicting ligand binding residues and functional sites using multipositional correlations with graph theoretic clustering and kernel CCA.利用图论聚类和核典型相关分析的多位置相关性预测配体结合残基和功能位点。
IEEE/ACM Trans Comput Biol Bioinform. 2012 Jul-Aug;9(4):992-1001. doi: 10.1109/TCBB.2011.136.
9
Using structural motif descriptors for sequence-based binding site prediction.使用结构基序描述符进行基于序列的结合位点预测。
BMC Bioinformatics. 2007 May 22;8 Suppl 4(Suppl 4):S5. doi: 10.1186/1471-2105-8-S4-S5.
10
In silico identification of putative metal binding motifs.通过计算机模拟鉴定潜在的金属结合基序。
Bioinformatics. 2007 Feb 1;23(3):267-71. doi: 10.1093/bioinformatics/btl617. Epub 2006 Dec 5.

引用本文的文献

1
Computational approaches for design and redesign of metal-binding sites on proteins.用于蛋白质上金属结合位点设计与重新设计的计算方法。
Biosci Rep. 2017 Mar 27;37(2). doi: 10.1042/BSR20160179. Print 2017 Apr 28.
2
An integrative computational framework based on a two-step random forest algorithm improves prediction of zinc-binding sites in proteins.基于两步随机森林算法的综合计算框架提高了蛋白质中锌结合位点的预测能力。
PLoS One. 2012;7(11):e49716. doi: 10.1371/journal.pone.0049716. Epub 2012 Nov 14.