释放元线程在蛋白质进化/结构功能推断中的力量。

Unleashing the power of meta-threading for evolution/structure-based function inference of proteins.

机构信息

Department of Biological Sciences, Louisiana State University Baton Rouge, LA, USA ; Center for Computation and Technology, Louisiana State University Baton Rouge, LA, USA.

出版信息

Front Genet. 2013 Jun 19;4:118. doi: 10.3389/fgene.2013.00118. eCollection 2013.

DOI:10.3389/fgene.2013.00118

PMID:23802014

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3686302/

Abstract

Protein threading is widely used in the prediction of protein structure and the subsequent functional annotation. Most threading approaches employ similar criteria for the template identification for use in both protein structure and function modeling. Using structure similarity alone might result in a high false positive rate in protein function inference, which suggests that selecting functional templates should be subject to a different set of constraints. In this study, we extend the functionality of eThread, a recently developed approach to meta-threading, focusing on the optimal selection of functional templates. We optimized the selection of template proteins to cover a broad spectrum of protein molecular function: ligand, metal, inorganic cluster, protein, and nucleic acid binding. In large-scale benchmarks, we demonstrate that the recognition rates in identifying templates that bind molecular partners in similar locations are very high, typically 70-80%, at the expense of a relatively low false positive rate. eThread also provides useful insights into the chemical properties of binding molecules and the structural features of binding. For instance, the sensitivity in recognizing similar protein-binding interfaces is 58% at only 18% false positive rate. Furthermore, in comparative analysis, we demonstrate that meta-threading supported by machine learning outperforms single-threading approaches in functional template selection. We show that meta-threading effectively detects many facets of protein molecular function, even in a low-sequence identity regime. The enhanced version of eThread is freely available as a webserver and stand-alone software at http://www.brylinski.org/ethread.

摘要

蛋白质序列分析被广泛应用于蛋白质结构预测和随后的功能注释。大多数序列分析方法都采用相似的模板识别标准，用于蛋白质结构和功能建模。仅使用结构相似性可能会导致蛋白质功能推断中的高假阳性率，这表明选择功能模板应该受到不同的约束条件的限制。在这项研究中，我们扩展了 eThread 的功能，eThread 是一种最近开发的元序列分析方法，重点是对功能模板的最佳选择。我们优化了模板蛋白的选择，以涵盖广泛的蛋白质分子功能：配体、金属、无机簇、蛋白质和核酸结合。在大规模基准测试中，我们证明了在识别与类似位置的分子伴侣结合的模板方面，识别率非常高，通常为 70-80%，而假阳性率相对较低。eThread 还提供了有关结合分子的化学性质和结合的结构特征的有用信息。例如，在仅 18%的假阳性率下，识别相似蛋白结合界面的敏感性为 58%。此外，在比较分析中，我们证明了机器学习支持的元序列分析在功能模板选择方面优于单序列分析方法。我们表明，元序列分析有效地检测了蛋白质分子功能的许多方面，即使在低序列同一性的情况下也是如此。eThread 的增强版本可作为一个网络服务器和独立软件在 http://www.brylinski.org/ethread 上免费获得。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c0cd/3686302/5096233e6902/fgene-04-00118-g001.jpg

相似文献

Unleashing the power of meta-threading for evolution/structure-based function inference of proteins.释放元线程在蛋白质进化/结构功能推断中的力量。

Front Genet. 2013 Jun 19;4:118. doi: 10.3389/fgene.2013.00118. eCollection 2013.

Prediction of protein-protein interaction sites from weakly homologous template structures using meta-threading and machine learning.利用元线程和机器学习从弱同源模板结构预测蛋白质-蛋白质相互作用位点。

J Mol Recognit. 2015 Jan;28(1):35-48. doi: 10.1002/jmr.2410.

eThread: a highly optimized machine learning-based approach to meta-threading and the modeling of protein tertiary structures.eThread：一种高度优化的基于机器学习的元线程和蛋白质三级结构建模方法。

PLoS One. 2012;7(11):e50200. doi: 10.1371/journal.pone.0050200. Epub 2012 Nov 21.

eFindSite: improved prediction of ligand binding sites in protein models using meta-threading, machine learning and auxiliary ligands.eFindSite：利用元线程、机器学习和辅助配体改进蛋白质模型中配体结合位点的预测。

J Comput Aided Mol Des. 2013 Jun;27(6):551-67. doi: 10.1007/s10822-013-9663-5. Epub 2013 Jul 10.

Does inclusion of residue-residue contact information boost protein threading?残基-残基接触信息的包含是否能提高蛋白质结构预测的准确性？

Proteins. 2019 Jul;87(7):596-606. doi: 10.1002/prot.25684. Epub 2019 Mar 28.

Developing eThread pipeline using SAGA-pilot abstraction for large-scale structural bioinformatics.利用SAGA-pilot抽象开发用于大规模结构生物信息学的eThread管道。

Biomed Res Int. 2014;2014:348725. doi: 10.1155/2014/348725. Epub 2014 Jun 9.

The utility of artificially evolved sequences in protein threading and fold recognition.人工进化序列在蛋白质结构预测和折叠识别中的应用。

J Theor Biol. 2013 Jul 7;328:77-88. doi: 10.1016/j.jtbi.2013.03.018. Epub 2013 Mar 27.

eVolver: an optimization engine for evolving protein sequences to stabilize the respective structures.eVolver：一种用于进化蛋白质序列以稳定其相应结构的优化引擎。

BMC Res Notes. 2013 Jul 31;6:303. doi: 10.1186/1756-0500-6-303.

LOMETS2: improved meta-threading server for fold-recognition and structure-based function annotation for distant-homology proteins.LOMETS2：改进的元线程服务器，用于折叠识别和基于结构的远程同源蛋白功能注释。

Nucleic Acids Res. 2019 Jul 2;47(W1):W429-W436. doi: 10.1093/nar/gkz384.

DisCovER: distance- and orientation-based covariational threading for weakly homologous proteins.DisCovER：基于距离和方向的弱同源蛋白质共变线程。

Proteins. 2022 Feb;90(2):579-588. doi: 10.1002/prot.26254. Epub 2021 Oct 11.

引用本文的文献

How much metagenome data is needed for protein structure prediction: The advantages of targeted approach from the ecological and evolutionary perspectives.蛋白质结构预测需要多少宏基因组数据：从生态和进化角度看靶向方法的优势

Imeta. 2022 Mar 6;1(1):e9. doi: 10.1002/imt2.9. eCollection 2022 Mar.

Protein functional annotation of simultaneously improved stability, accuracy and false discovery rate achieved by a sequence-based deep learning.基于序列的深度学习同时提高稳定性、准确性和假阳性率的蛋白质功能注释。

Brief Bioinform. 2020 Jul 15;21(4):1437-1447. doi: 10.1093/bib/bbz081.

Binding site matching in rational drug design: algorithms and applications.理性药物设计中的结合位点匹配：算法与应用。

Brief Bioinform. 2019 Nov 27;20(6):2167-2184. doi: 10.1093/bib/bby078.

Calculating an optimal box size for ligand docking and virtual screening against experimental and predicted binding pockets.针对实验性和预测性结合口袋计算用于配体对接和虚拟筛选的最佳盒子大小。

J Cheminform. 2015 May 15;7:18. doi: 10.1186/s13321-015-0067-5. eCollection 2015.

Developing eThread pipeline using SAGA-pilot abstraction for large-scale structural bioinformatics.利用SAGA-pilot抽象开发用于大规模结构生物信息学的eThread管道。

Biomed Res Int. 2014;2014:348725. doi: 10.1155/2014/348725. Epub 2014 Jun 9.

The future of crystallography in drug discovery.药物发现中晶体学的未来。

Expert Opin Drug Discov. 2014 Feb;9(2):125-37. doi: 10.1517/17460441.2014.872623. Epub 2013 Dec 28.

Exploring the "dark matter" of a mammalian proteome by protein structure and function modeling.通过蛋白质结构和功能建模探索哺乳动物蛋白质组的“暗物质”。

Proteome Sci. 2013 Dec 9;11(1):47. doi: 10.1186/1477-5956-11-47.

本文引用的文献

PLoS One. 2012;7(11):e50200. doi: 10.1371/journal.pone.0050200. Epub 2012 Nov 21.

Detecting local ligand-binding site similarity in nonhomologous proteins by surface patch comparison.通过表面斑块比较检测非同源蛋白质中的局部配体结合位点相似性。

Proteins. 2012 Apr;80(4):1177-95. doi: 10.1002/prot.24018. Epub 2012 Jan 24.

Binding ligand prediction for proteins using partial matching of local surface patches.利用局部表面斑块的部分匹配进行蛋白质结合配体预测。

Int J Mol Sci. 2010;11(12):5009-26. doi: 10.3390/ijms11125009. Epub 2010 Dec 6.

FINDSITE-metal: integrating evolutionary information and machine learning for structure-based metal-binding site prediction at the proteome level.FINDSITE-metal：在蛋白质组水平上基于结构的金属结合位点预测中整合进化信息和机器学习。

Proteins. 2011 Mar;79(3):735-51. doi: 10.1002/prot.22913. Epub 2010 Dec 6.

Docking by structural similarity at protein-protein interfaces.蛋白质-蛋白质界面的结构相似性对接。

Proteins. 2010 Nov 15;78(15):3235-41. doi: 10.1002/prot.22812.

iAlign: a method for the structural comparison of protein-protein interfaces.iAlign：一种用于蛋白质-蛋白质界面结构比较的方法。

Bioinformatics. 2010 Sep 15;26(18):2259-65. doi: 10.1093/bioinformatics/btq404. Epub 2010 Jul 11.

3DLigandSite: predicting ligand-binding sites using similar structures.3DLigandSite：使用相似结构预测配体结合位点。

Nucleic Acids Res. 2010 Jul;38(Web Server issue):W469-73. doi: 10.1093/nar/gkq406. Epub 2010 May 31.

Real-time ligand binding pocket database search using local surface descriptors.使用局部表面描述符进行实时配体结合口袋数据库搜索。

Proteins. 2010 Jul;78(9):2007-28. doi: 10.1002/prot.22715.

I-TASSER: a unified platform for automated protein structure and function prediction.I-TASSER：一个用于自动化蛋白质结构和功能预测的统一平台。

Nat Protoc. 2010 Apr;5(4):725-38. doi: 10.1038/nprot.2010.5. Epub 2010 Mar 25.

Critical assessment of methods of protein structure prediction - Round VIII.蛋白质结构预测方法的关键性评估 - 第八轮。

Proteins. 2009;77 Suppl 9:1-4. doi: 10.1002/prot.22589.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

释放元线程在蛋白质进化/结构功能推断中的力量。

Unleashing the power of meta-threading for evolution/structure-based function inference of proteins.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献