基于远程同源模板的翻译基因序列中三维金属结合位点的预测。

Prediction of 3D metal binding sites from translated gene sequences based on remote-homology templates.

机构信息

Department of Plant Sciences, Weizmann Institute of Science, Rehovot, Israel.

出版信息

Proteins. 2009 Aug 1;76(2):365-74. doi: 10.1002/prot.22352.

PMID:19173310

Abstract

Database-scale analysis was performed to determine whether structural models, based on remote homologues, are effective in predicting 3D transition metal binding sites in proteins directly from translated gene sequences. The extent by which side chain modeling alone reduces sensitivity and selectivity is shown to be <10%. Surprisingly, selectivity was not dependent on the level of sequence homology between template and target, or on the presence of a metal ion in the structural template. Applying a modification of the CHED algorithm (Babor et al., Proteins 2008;70:208-217) and machine learning filters, a selectivity of approximately 90% was achieved for protein sequences using unrelated structural templates over a sequence identity range of 18-100%. Below approximately 18% identity, the number of analyzable target-template pairs and predictability of metal binding sites falls off sharply. A full third of structural templates were found to have target partners only in the remote homology range of 18-30%. In this range, nonmetal-binding templates are calculated to be the majority and serve to predict with 50% sensitivity at the geometric level. Overall, sensitivity at the geometric level for targets having templates in the 18-30% sequence identity range is 73%, with an average of one false positive site per true site. Protein sequences described as "unknown" in the UniProt database and composed largely of unidentified genome project sequences were studied and metal binding sites predicted. A web server for prediction of metal binding sites from protein sequence is provided.

摘要

进行了数据库规模的分析，以确定基于远程同源物的结构模型是否能够有效地直接从翻译后的基因序列预测蛋白质中的三维过渡金属结合位点。结果表明，仅通过侧链建模降低敏感性和选择性的程度<10%。令人惊讶的是，选择性不依赖于模板和目标之间的序列同源性水平，也不依赖于结构模板中是否存在金属离子。应用 CHED 算法（Babor 等人，Proteins 2008；70：208-217）和机器学习滤波器的修改版，使用不相关的结构模板，在 18-100%的序列同一性范围内，对蛋白质序列的选择性约为 90%。在大约 18%的同一性以下，可分析的目标-模板对的数量和金属结合位点的可预测性急剧下降。发现有三分之一的结构模板只有在 18-30%的远程同源范围内才有目标伴侣。在这个范围内，非金属结合模板被计算为大多数，并以 50%的敏感性在几何水平上进行预测。总体而言，在 18-30%序列同一性范围内具有模板的目标的几何水平敏感性为 73%，每个真实位点平均有一个假阳性位点。研究了在 UniProt 数据库中被描述为“未知”的蛋白质序列，这些序列主要由未识别的基因组项目序列组成，并预测了金属结合位点。提供了一个用于从蛋白质序列预测金属结合位点的网络服务器。

相似文献

Prediction of 3D metal binding sites from translated gene sequences based on remote-homology templates.

Proteins. 2009 Aug 1;76(2):365-74. doi: 10.1002/prot.22352.

Exploiting 3D structural templates for detection of metal-binding sites in protein structures.

Proteins. 2008 Mar;70(4):1206-18. doi: 10.1002/prot.21601.

Beyond the Twilight Zone: automated prediction of structural properties of proteins by recursive neural networks and remote homology information.

Proteins. 2009 Oct;77(1):181-90. doi: 10.1002/prot.22429.

Homology-based modeling of 3D structures of protein-protein complexes using alignments of modified sequence profiles.

Int J Biol Macromol. 2008 Aug 15;43(2):198-208. doi: 10.1016/j.ijbiomac.2008.05.004. Epub 2008 May 21.

Flexibility of metal binding sites in proteins on a database scale.

Proteins. 2005 May 1;59(2):221-30. doi: 10.1002/prot.20431.

Metalloproteomics: high-throughput structural and functional annotation of proteins in structural genomics.

Structure. 2005 Oct;13(10):1473-86. doi: 10.1016/j.str.2005.07.014.

Protein structure prediction of CASP5 comparative modeling and fold recognition targets using consensus alignment approach and 3D assessment.

Proteins. 2003;53 Suppl 6:410-7. doi: 10.1002/prot.10548.

Protein-binding site prediction based on three-dimensional protein modeling.

Proteins. 2009;77 Suppl 9:152-6. doi: 10.1002/prot.22572.

N-terminal N-myristoylation of proteins: prediction of substrate proteins from amino acid sequence.

J Mol Biol. 2002 Apr 5;317(4):541-57. doi: 10.1006/jmbi.2002.5426.

Structural analysis of metal sites in proteins: non-heme iron sites as a case study.

J Mol Biol. 2009 May 1;388(2):356-80. doi: 10.1016/j.jmb.2009.02.052. Epub 2009 Mar 2.

引用本文的文献

MIBPred: Ensemble Learning-Based Metal Ion-Binding Protein Classifier.

ACS Omega. 2024 Feb 8;9(7):8439-8447. doi: 10.1021/acsomega.3c09587. eCollection 2024 Feb 20.

Identifying Metal Binding Sites in Proteins Using Homologous Structures, the MADE Approach.

J Chem Inf Model. 2023 Aug 28;63(16):5204-5219. doi: 10.1021/acs.jcim.3c00558. Epub 2023 Aug 9.

Learning to Identify Physiological and Adventitious Metal-Binding Sites in the Three-Dimensional Structures of Proteins by Following the Hints of a Deep Neural Network.

J Chem Inf Model. 2022 Jun 27;62(12):2951-2960. doi: 10.1021/acs.jcim.2c00522. Epub 2022 Jun 9.

mebipred: identifying metal-binding potential in protein sequence.

Bioinformatics. 2022 Jul 11;38(14):3532-3540. doi: 10.1093/bioinformatics/btac358.

A Comprehensive Review of Computation-Based Metal-Binding Prediction Approaches at the Residue Level.

Biomed Res Int. 2022 Mar 31;2022:8965712. doi: 10.1155/2022/8965712. eCollection 2022.

The Mechanism of Metal Homeostasis in Plants: A New View on the Synergistic Regulation Pathway of Membrane Proteins, Lipids and Metal Ions.

Membranes (Basel). 2021 Dec 15;11(12):984. doi: 10.3390/membranes11120984.

Identification of Disease-Related 2-Oxoglutarate/Fe (II)-Dependent Oxygenase Based on Reduced Amino Acid Cluster Strategy.

Front Cell Dev Biol. 2021 Jul 16;9:707938. doi: 10.3389/fcell.2021.707938. eCollection 2021.

The Case for Enzymatic Competitive Metal Affinity Methods.

ACS Catal. 2020 Feb 7;10(3):2298-2307. doi: 10.1021/acscatal.9b04831. Epub 2020 Jan 17.

Bioinformatics of Metalloproteins and Metalloproteomes.

Molecules. 2020 Jul 24;25(15):3366. doi: 10.3390/molecules25153366.

Simple Coordination Geometry Descriptors Allow to Accurately Predict Metal-Binding Sites in Proteins.

ACS Omega. 2019 Feb 19;4(2):3726-3731. doi: 10.1021/acsomega.8b03457. eCollection 2019 Feb 28.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

基于远程同源模板的翻译基因序列中三维金属结合位点的预测。

Prediction of 3D metal binding sites from translated gene sequences based on remote-homology templates.

机构信息

出版信息

相似文献

引用本文的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献