Proteus：一种用于预测内在无序蛋白质中无序到有序转变结合区域的随机森林分类器。

Proteus: a random forest classifier to predict disorder-to-order transitioning binding regions in intrinsically disordered proteins.

作者信息

Basu Sankar, Söderquist Fredrik, Wallner Björn

机构信息

Bioinformatics Division, Department of Physics, Chemistry and Biology, Linköping University, Linköping, Sweden.

Department of Biochemistry, University of Calcutta, Kolkata, 700019, India.

出版信息

J Comput Aided Mol Des. 2017 May;31(5):453-466. doi: 10.1007/s10822-017-0020-y. Epub 2017 Apr 1.

DOI:10.1007/s10822-017-0020-y

PMID:28365882

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5429364/

Abstract

The focus of the computational structural biology community has taken a dramatic shift over the past one-and-a-half decades from the classical protein structure prediction problem to the possible understanding of intrinsically disordered proteins (IDP) or proteins containing regions of disorder (IDPR). The current interest lies in the unraveling of a disorder-to-order transitioning code embedded in the amino acid sequences of IDPs/IDPRs. Disordered proteins are characterized by an enormous amount of structural plasticity which makes them promiscuous in binding to different partners, multi-functional in cellular activity and atypical in folding energy landscapes resembling partially folded molten globules. Also, their involvement in several deadly human diseases (e.g. cancer, cardiovascular and neurodegenerative diseases) makes them attractive drug targets, and important for a biochemical understanding of the disease(s). The study of the structural ensemble of IDPs is rather difficult, in particular for transient interactions. When bound to a structured partner, an IDPR adapts an ordered conformation in the complex. The residues that undergo this disorder-to-order transition are called protean residues, generally found in short contiguous stretches and the first step in understanding the modus operandi of an IDP/IDPR would be to predict these residues. There are a few available methods which predict these protean segments from their amino acid sequences; however, their performance reported in the literature leaves clear room for improvement. With this background, the current study presents 'Proteus', a random forest classifier that predicts the likelihood of a residue undergoing a disorder-to-order transition upon binding to a potential partner protein. The prediction is based on features that can be calculated using the amino acid sequence alone. Proteus compares favorably with existing methods predicting twice as many true positives as the second best method (55 vs. 27%) with a much higher precision on an independent data set. The current study also sheds some light on a possible 'disorder-to-order' transitioning consensus, untangled, yet embedded in the amino acid sequence of IDPs. Some guidelines have also been suggested for proceeding with a real-life structural modeling involving an IDPR using Proteus.

摘要

在过去的十五年半里，计算结构生物学界的关注焦点发生了巨大转变，从经典的蛋白质结构预测问题转向了对内在无序蛋白质（IDP）或含有无序区域的蛋白质（IDPR）的可能理解。当前的兴趣在于揭示嵌入在IDP/IDPR氨基酸序列中的无序到有序转变密码。无序蛋白质的特征是具有大量的结构可塑性，这使得它们在与不同伙伴结合时具有混杂性，在细胞活动中具有多功能性，并且在折叠能量景观方面是非典型的，类似于部分折叠的熔球。此外，它们与几种致命的人类疾病（如癌症、心血管疾病和神经退行性疾病）有关，这使得它们成为有吸引力的药物靶点，并且对于从生物化学角度理解这些疾病很重要。对IDP结构集合的研究相当困难，特别是对于瞬时相互作用。当与结构化伙伴结合时，IDPR在复合物中会采用有序构象。经历这种无序到有序转变的残基称为多变残基，通常在短的连续片段中发现，而理解IDP/IDPR作用方式的第一步将是预测这些残基。有一些可用的方法可以从氨基酸序列预测这些多变片段；然而，文献中报道的它们的性能显然还有改进的空间。在此背景下，当前的研究提出了“Proteus”，这是一种随机森林分类器，可预测残基在与潜在伙伴蛋白结合时经历无序到有序转变的可能性。该预测基于仅使用氨基酸序列即可计算的特征。Proteus与现有方法相比具有优势，在独立数据集上预测的真阳性数量是第二好方法的两倍（55%对27%），并且精度更高。当前的研究还揭示了一种可能的“无序到有序”转变共识，虽然尚未完全理清，但嵌入在IDP的氨基酸序列中。还提出了一些指导方针，用于使用Proteus进行涉及IDPR的实际结构建模。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/896c/5429364/805c96ea1c5f/10822_2017_20_Fig1_HTML.jpg

相似文献

Proteus: a random forest classifier to predict disorder-to-order transitioning binding regions in intrinsically disordered proteins.

J Comput Aided Mol Des. 2017 May;31(5):453-466. doi: 10.1007/s10822-017-0020-y. Epub 2017 Apr 1.

Intrinsic disorder-based protein interactions and their modulators.

Curr Pharm Des. 2013;19(23):4191-213. doi: 10.2174/1381612811319230005.

Quantifying Protein Disorder through Measures of Excess Conformational Entropy.

J Phys Chem B. 2016 May 19;120(19):4341-50. doi: 10.1021/acs.jpcb.6b00658. Epub 2016 May 4.

idpr: A package for profiling and analyzing Intrinsically Disordered Proteins in R.

PLoS One. 2022 Apr 18;17(4):e0266929. doi: 10.1371/journal.pone.0266929. eCollection 2022.

Do sequence neighbours of intrinsically disordered regions promote structural flexibility in intrinsically disordered proteins?

J Struct Biol. 2020 Feb 1;209(2):107428. doi: 10.1016/j.jsb.2019.107428. Epub 2019 Nov 20.

Proteins without unique 3D structures: biotechnological applications of intrinsically unstable/disordered proteins.

Biotechnol J. 2015 Mar;10(3):356-66. doi: 10.1002/biot.201400374. Epub 2014 Oct 6.

Predicting Secondary Structure Propensities in IDPs Using Simple Statistics from Three-Residue Fragments.

J Mol Biol. 2020 Sep 4;432(19):5447-5459. doi: 10.1016/j.jmb.2020.07.026. Epub 2020 Aug 6.

Intrinsically disordered proteins and structured proteins with intrinsically disordered regions have different functional roles in the cell.

PLoS One. 2019 Aug 19;14(8):e0217889. doi: 10.1371/journal.pone.0217889. eCollection 2019.

Order, Disorder, and Everything in Between.

Molecules. 2016 Aug 19;21(8):1090. doi: 10.3390/molecules21081090.

Predicting Conformational Disorder.

Methods Mol Biol. 2016;1415:265-99. doi: 10.1007/978-1-4939-3572-7_14.

引用本文的文献

Intrinsic Disorder and Other Malleable Arsenals of Evolved Protein Multifunctionality.

J Mol Evol. 2024 Dec;92(6):669-684. doi: 10.1007/s00239-024-10196-7. Epub 2024 Aug 30.

Leveraging machine learning models for peptide-protein interaction prediction.

RSC Chem Biol. 2024 Mar 13;5(5):401-417. doi: 10.1039/d3cb00208j. eCollection 2024 May 8.

Machine learning model of the catalytic efficiency and substrate specificity of acyl-ACP thioesterase variants generated from natural and directed evolution.

Front Bioeng Biotechnol. 2024 Apr 11;12:1379121. doi: 10.3389/fbioe.2024.1379121. eCollection 2024.

Leveraging Machine Learning Models for Peptide-Protein Interaction Prediction.

ArXiv. 2024 Feb 7:arXiv:2310.18249v2.

Elucidating the functional roles of prokaryotic proteins using big data and artificial intelligence.

FEMS Microbiol Rev. 2023 Jan 16;47(1). doi: 10.1093/femsre/fuad003.

Improving peptide-protein docking with AlphaFold-Multimer using forced sampling.

Front Bioinform. 2022 Sep 26;2:959160. doi: 10.3389/fbinf.2022.959160. eCollection 2022.

InterPepRank: Assessment of Docked Peptide Conformations by a Deep Graph Network.

Front Bioinform. 2021 Oct 25;1:763102. doi: 10.3389/fbinf.2021.763102. eCollection 2021.

Prediction of protein-protein interaction sites in intrinsically disordered proteins.

Front Mol Biosci. 2022 Sep 30;9:985022. doi: 10.3389/fmolb.2022.985022. eCollection 2022.

Capturing a Crucial 'Disorder-to-Order Transition' at the Heart of the Coronavirus Molecular Pathology-Triggered by Highly Persistent, Interchangeable Salt-Bridges.

Vaccines (Basel). 2022 Feb 16;10(2):301. doi: 10.3390/vaccines10020301.

Intrinsically disordered proteins identified in the aggregate proteome serve as biomarkers of neurodegeneration.

Metab Brain Dis. 2022 Jan;37(1):147-152. doi: 10.1007/s11011-021-00791-8. Epub 2021 Aug 4.

本文引用的文献

The alphabet of intrinsic disorder: I. Act like a Pro: On the abundance and roles of proline residues in intrinsically disordered proteins.

Intrinsically Disord Proteins. 2013 Apr 1;1(1):e24360. doi: 10.4161/idp.24360. eCollection 2013 Jan-Dec.

ProQ3D: improved model quality assessments using deep learning.

Bioinformatics. 2017 May 15;33(10):1578-1580. doi: 10.1093/bioinformatics/btw819.

ProQ3: Improved model quality assessments using Rosetta energy terms.

Sci Rep. 2016 Oct 4;6:33509. doi: 10.1038/srep33509.

Globular-disorder transition in proteins: a compromise between hydrophobic and electrostatic interactions?

Phys Chem Chem Phys. 2016 Aug 17;18(33):23207-14. doi: 10.1039/c6cp03185d.

Finding correct protein-protein docking models using ProQDock.

Bioinformatics. 2016 Jun 15;32(12):i262-i270. doi: 10.1093/bioinformatics/btw257.

Conformational Entropy of Intrinsically Disordered Proteins from Amino Acid Triads.

Sci Rep. 2015 Jul 3;5:11740. doi: 10.1038/srep11740.

Intrinsically disordered energy landscapes.

Sci Rep. 2015 May 22;5:10386. doi: 10.1038/srep10386.

Association between intrinsic disorder and serine/threonine phosphorylation in Mycobacterium tuberculosis.

PeerJ. 2015 Jan 8;3:e724. doi: 10.7717/peerj.724. eCollection 2015.

Intrinsically disordered proteins in cellular signalling and regulation.

Nat Rev Mol Cell Biol. 2015 Jan;16(1):18-29. doi: 10.1038/nrm3920.

DISOPRED3: precise disordered region predictions with annotated protein-binding activity.

Bioinformatics. 2015 Mar 15;31(6):857-63. doi: 10.1093/bioinformatics/btu744. Epub 2014 Nov 12.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

Proteus：一种用于预测内在无序蛋白质中无序到有序转变结合区域的随机森林分类器。

Proteus: a random forest classifier to predict disorder-to-order transitioning binding regions in intrinsically disordered proteins.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献