MoRFpred，一种基于序列的计算工具，用于预测和描述蛋白质中短的无序到有序转变的结合区域。

MoRFpred, a computational tool for sequence-based prediction and characterization of short disorder-to-order transitioning binding regions in proteins.

机构信息

Department of Electrical and Computer Engineering, University of Alberta, Edmonton, T6G 2V4, Canada.

出版信息

Bioinformatics. 2012 Jun 15;28(12):i75-83. doi: 10.1093/bioinformatics/bts209.

DOI:10.1093/bioinformatics/bts209

PMID:22689782

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3371841/

Abstract

MOTIVATION

Molecular recognition features (MoRFs) are short binding regions located within longer intrinsically disordered regions that bind to protein partners via disorder-to-order transitions. MoRFs are implicated in important processes including signaling and regulation. However, only a limited number of experimentally validated MoRFs is known, which motivates development of computational methods that predict MoRFs from protein chains.

RESULTS

We introduce a new MoRF predictor, MoRFpred, which identifies all MoRF types (α, β, coil and complex). We develop a comprehensive dataset of annotated MoRFs to build and empirically compare our method. MoRFpred utilizes a novel design in which annotations generated by sequence alignment are fused with predictions generated by a Support Vector Machine (SVM), which uses a custom designed set of sequence-derived features. The features provide information about evolutionary profiles, selected physiochemical properties of amino acids, and predicted disorder, solvent accessibility and B-factors. Empirical evaluation on several datasets shows that MoRFpred outperforms related methods: α-MoRF-Pred that predicts α-MoRFs and ANCHOR which finds disordered regions that become ordered when bound to a globular partner. We show that our predicted (new) MoRF regions have non-random sequence similarity with native MoRFs. We use this observation along with the fact that predictions with higher probability are more accurate to identify putative MoRF regions. We also identify a few sequence-derived hallmarks of MoRFs. They are characterized by dips in the disorder predictions and higher hydrophobicity and stability when compared to adjacent (in the chain) residues.

AVAILABILITY

http://biomine.ece.ualberta.ca/MoRFpred/; http://biomine.ece.ualberta.ca/MoRFpred/Supplement.pdf.

摘要

动机

分子识别特征 (MoRFs) 是位于较长的固有无序区域内的短结合区域，通过无序到有序的转变与蛋白质伴侣结合。MoRFs 参与包括信号转导和调节在内的重要过程。然而，已知的实验验证的 MoRF 数量有限，这促使开发了从蛋白质链预测 MoRF 的计算方法。

结果

我们引入了一种新的 MoRF 预测器 MoRFpred，它可以识别所有 MoRF 类型（α、β、卷曲和复杂）。我们开发了一个包含注释 MoRF 的综合数据集来构建和经验比较我们的方法。MoRFpred 采用了一种新颖的设计，其中通过序列比对生成的注释与通过支持向量机 (SVM) 生成的预测融合在一起，SVM 使用一组自定义设计的序列衍生特征。这些特征提供了有关进化特征、选定的氨基酸理化特性以及预测的无序、溶剂可及性和 B 因子的信息。在几个数据集上的经验评估表明，MoRFpred 优于相关方法：预测 α-MoRF 的 α-MoRF-Pred 和当与球状伴侣结合时变得有序的发现无序区域的 ANCHOR。我们表明，我们预测的（新的）MoRF 区域与天然 MoRF 具有非随机的序列相似性。我们利用这一观察结果以及预测概率更高的预测更准确的事实来识别可能的 MoRF 区域。我们还确定了一些 MoRF 的序列衍生特征。与相邻（在链中）残基相比，它们的特征是无序预测中的下降以及更高的疏水性和稳定性。

可用性

http://biomine.ece.ualberta.ca/MoRFpred/；http://biomine.ece.ualberta.ca/MoRFpred/Supplement.pdf。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e3d2/3371841/7a5a08c24dd7/bts209f1.jpg

相似文献

MoRFpred, a computational tool for sequence-based prediction and characterization of short disorder-to-order transitioning binding regions in proteins.MoRFpred，一种基于序列的计算工具，用于预测和描述蛋白质中短的无序到有序转变的结合区域。

Bioinformatics. 2012 Jun 15;28(12):i75-83. doi: 10.1093/bioinformatics/bts209.

MoRFPred-plus: Computational Identification of MoRFs in Protein Sequences using Physicochemical Properties and HMM profiles.MoRFPred-plus：利用物理化学性质和隐马尔可夫模型轮廓对蛋白质序列中的分子识别特征进行计算识别

J Theor Biol. 2018 Jan 21;437:9-16. doi: 10.1016/j.jtbi.2017.10.015. Epub 2017 Oct 16.

Computational identification of MoRFs in protein sequences.蛋白质序列中分子识别功能区域（MoRFs）的计算识别

Bioinformatics. 2015 Jun 1;31(11):1738-44. doi: 10.1093/bioinformatics/btv060. Epub 2015 Jan 30.

Predicting Functions of Disordered Proteins with MoRFpred.使用MoRFpred预测无序蛋白质的功能

Methods Mol Biol. 2019;1851:337-352. doi: 10.1007/978-1-4939-8736-8_19.

MoRFPred_en: Sequence-based prediction of MoRFs using an ensemble learning strategy.MoRFPred_en：使用集成学习策略基于序列预测莫尔费（MoRFs）。

J Bioinform Comput Biol. 2019 Dec;17(6):1940015. doi: 10.1142/S0219720019400158.

Predicting MoRFs in protein sequences using HMM profiles.使用隐马尔可夫模型（HMM）概况预测蛋白质序列中的分子识别特征（MoRF）。

BMC Bioinformatics. 2016 Dec 22;17(Suppl 19):504. doi: 10.1186/s12859-016-1375-0.

MFSPSSMpred: identifying short disorder-to-order binding regions in disordered proteins based on contextual local evolutionary conservation.MFSPSSMpred：基于上下文局部进化保守性识别无序蛋白中的短无序到有序结合区域。

BMC Bioinformatics. 2013 Oct 4;14:300. doi: 10.1186/1471-2105-14-300.

OPAL: prediction of MoRF regions in intrinsically disordered protein sequences.OPAL：预测无序蛋白质序列中的 MoRF 区域。

Bioinformatics. 2018 Jun 1;34(11):1850-1858. doi: 10.1093/bioinformatics/bty032.

Computational Prediction of MoRFs, Short Disorder-to-order Transitioning Protein Binding Regions.MoRFs的计算预测，即短的无序到有序转变的蛋白质结合区域。

Comput Struct Biotechnol J. 2019 Mar 26;17:454-462. doi: 10.1016/j.csbj.2019.03.013. eCollection 2019.

Computational Identification of MoRFs in Protein Sequences Using Hierarchical Application of Bayes Rule.使用贝叶斯规则的分层应用对蛋白质序列中的MoRF进行计算识别。

PLoS One. 2015 Oct 30;10(10):e0141603. doi: 10.1371/journal.pone.0141603. eCollection 2015.

引用本文的文献

The C-terminal end of PLIN1 displays structural disorder.PLIN1的C末端表现出结构无序。

Biochem Biophys Rep. 2025 Feb 28;42:101963. doi: 10.1016/j.bbrep.2025.101963. eCollection 2025 Jun.

The Historical Evolution and Significance of Multiple Sequence Alignment in Molecular Structure and Function Prediction.多重序列比对在分子结构与功能预测中的历史演变及意义

Biomolecules. 2024 Nov 29;14(12):1531. doi: 10.3390/biom14121531.

Integrated analysis of hub genes and intrinsically disordered regions in triple-negative breast cancer.三阴性乳腺癌中枢纽基因与内在无序区域的综合分析

J Genet Eng Biotechnol. 2024 Dec;22(4):100408. doi: 10.1016/j.jgeb.2024.100408. Epub 2024 Aug 16.

Accurate and Fast Prediction of Intrinsic Disorder Using flDPnn.使用 flDPnn 进行精确快速的固有无序预测。

Methods Mol Biol. 2025;2867:201-218. doi: 10.1007/978-1-0716-4196-5_12.

The molecular architecture of the desmosomal outer dense plaque by integrative structural modeling.整合结构建模解析桥粒外致密斑的分子结构。

Protein Sci. 2024 Dec;33(12):e5217. doi: 10.1002/pro.5217.

On the Roles of Protein Intrinsic Disorder in the Origin of Life and Evolution.论蛋白质内在无序在生命起源与进化中的作用

Life (Basel). 2024 Oct 15;14(10):1307. doi: 10.3390/life14101307.

iDLB-Pred: identification of disordered lipid binding residues in protein sequences using convolutional neural network.iDLB-Pred：使用卷积神经网络鉴定蛋白质序列中紊乱脂质结合残基

Sci Rep. 2024 Oct 21;14(1):24724. doi: 10.1038/s41598-024-75700-x.

Intrinsic Disorder and Other Malleable Arsenals of Evolved Protein Multifunctionality.内在无序与进化蛋白质多功能性的其他可塑“武器库”

J Mol Evol. 2024 Dec;92(6):669-684. doi: 10.1007/s00239-024-10196-7. Epub 2024 Aug 30.

Experimental methods to study the structure and dynamics of intrinsically disordered regions in proteins.研究蛋白质内在无序区域的结构与动力学的实验方法。

Curr Res Struct Biol. 2024 Mar 21;7:100138. doi: 10.1016/j.crstbi.2024.100138. eCollection 2024.

Advanced computational approaches to understand protein aggregation.用于理解蛋白质聚集的先进计算方法。

Biophys Rev (Melville). 2024 Apr 24;5(2):021302. doi: 10.1063/5.0180691. eCollection 2024 Jun.

本文引用的文献

SPINE-D: accurate prediction of short and long disordered regions by a single neural-network based method.SPINE-D：基于单个神经网络的方法准确预测短和长的无序区域。

J Biomol Struct Dyn. 2012;29(4):799-813. doi: 10.1080/073911012010525022.

Comprehensive comparative assessment of in-silico predictors of disordered regions.计算预测无序区域的生物信息学方法的综合比较评估

Curr Protein Pept Sci. 2012 Feb;13(1):6-18. doi: 10.2174/138920312799277938.

Structural protein descriptors in 1-dimension and their sequence-based predictions.一维结构蛋白描述符及其基于序列的预测。

Curr Protein Pept Sci. 2011 Sep;12(6):470-89. doi: 10.2174/138920311796957711.

Improved sequence-based prediction of disordered regions with multilayer fusion of multiple information sources.基于多层融合多种信息源的改进序列预测无序区域。

Bioinformatics. 2010 Sep 15;26(18):i489-96. doi: 10.1093/bioinformatics/btq373.

Understanding protein non-folding.理解蛋白质的非折叠状态。

Biochim Biophys Acta. 2010 Jun;1804(6):1231-64. doi: 10.1016/j.bbapap.2010.01.017. Epub 2010 Feb 1.

CD-HIT Suite: a web server for clustering and comparing biological sequences.CD-HIT 套件：用于聚类和比较生物序列的网络服务器。

Bioinformatics. 2010 Mar 1;26(5):680-2. doi: 10.1093/bioinformatics/btq003. Epub 2010 Jan 6.

Epitopia: a web-server for predicting B-cell epitopes.Epitopia：一个用于预测 B 细胞表位的网络服务器。

BMC Bioinformatics. 2009 Sep 14;10:287. doi: 10.1186/1471-2105-10-287.

ANCHOR: web server for predicting protein binding regions in disordered proteins.蛋白质无序区结合位点预测的网络服务器。

Bioinformatics. 2009 Oct 15;25(20):2745-6. doi: 10.1093/bioinformatics/btp518. Epub 2009 Aug 28.

Infrastructure for the life sciences: design and implementation of the UniProt website.生命科学基础设施：UniProt网站的设计与实现

BMC Bioinformatics. 2009 May 8;10:136. doi: 10.1186/1471-2105-10-136.

Prediction of protein binding regions in disordered proteins.无序蛋白质中蛋白质结合区域的预测。

PLoS Comput Biol. 2009 May;5(5):e1000376. doi: 10.1371/journal.pcbi.1000376. Epub 2009 May 1.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

MoRFpred，一种基于序列的计算工具，用于预测和描述蛋白质中短的无序到有序转变的结合区域。

MoRFpred, a computational tool for sequence-based prediction and characterization of short disorder-to-order transitioning binding regions in proteins.

机构信息

出版信息

MOTIVATION

RESULTS

AVAILABILITY

动机

结果

可用性

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献