CAID2实验中无序结合区域预测的评估。

Evaluation of predictions of disordered binding regions in the CAID2 experiment.

作者信息

Zhang Fuhao, Kurgan Lukasz

机构信息

College of Information Engineering, Northwest A & F University, China.

Department of Computer Science, Virginia Commonwealth University, Richmond, VA 23284, USA.

出版信息

Comput Struct Biotechnol J. 2024 Dec 17;27:78-88. doi: 10.1016/j.csbj.2024.12.009. eCollection 2025.

DOI:10.1016/j.csbj.2024.12.009

PMID:39811792

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11732247/

Abstract

A large portion of the Intrinsically Disordered Regions (IDRs) in protein sequences interact with proteins, nucleic acids, and other types of ligands. Correspondingly, dozens of sequence-based predictors of binding IDRs were developed. A recently completed second community-based Critical Assessments of protein Intrinsic Disorder prediction (CAID2) evaluated 32 predictors of binding IDRs. However, CAID2 considered a rather narrow scenario by testing on 78 proteins with binding IDRs and not differentiating between different ligands, in spite that virtually all predictors target IDRs that interact with specific types of ligands. In that scenario, several intrinsic disorder predictors predict binding IDRs with accuracy equivalent to the best predictors of binding IDRs since large majority of IDRs in the 78 test proteins are binding. We substantially extended the CAID2's evaluation by using the entire CAID2 dataset of 348 proteins and considering several arguably more practical scenarios. We assessed whether predictors accurately differentiate binding IDRs from other types of IDRs and how they perform when predicting IDRs that interact with different ligand types. We found that intrinsic disorder predictors cannot accurately identify binding IDRs among other disordered regions, majority of the predictors of binding IDRs are ligand type agnostic (i.e., they cross predict binding in IDRs that interact with ligands that they do not cover), and only a handful of predictors of binding IDRs perform relatively well and generate reasonably low amounts of cross predictions. We also suggest a number of future research directions that would move this active field of research forward.

摘要

蛋白质序列中很大一部分内在无序区域（IDR）会与蛋白质、核酸及其他类型的配体相互作用。相应地，人们开发了几十种基于序列的结合IDR预测工具。最近完成的第二次基于社区的蛋白质内在无序预测关键评估（CAID2）对32种结合IDR预测工具进行了评估。然而，CAID2考虑的情况相当有限，它仅对78个具有结合IDR的蛋白质进行测试，且未区分不同的配体，尽管几乎所有预测工具针对的都是与特定类型配体相互作用的IDR。在这种情况下，一些内在无序预测工具预测结合IDR的准确性与最佳结合IDR预测工具相当，因为78个测试蛋白质中的大多数IDR都是具有结合能力的。我们通过使用包含348个蛋白质的整个CAID2数据集并考虑几种更具实际意义的情况，对CAID2的评估进行了大幅扩展。我们评估了预测工具能否准确区分结合IDR与其他类型的IDR，以及它们在预测与不同配体类型相互作用的IDR时的表现。我们发现，内在无序预测工具无法在其他无序区域中准确识别结合IDR，大多数结合IDR预测工具对配体类型不敏感（即它们会交叉预测与未涵盖配体相互作用的IDR中的结合情况），只有少数结合IDR预测工具表现相对较好，产生的交叉预测数量相对较少。我们还提出了一些未来的研究方向，以推动这个活跃的研究领域向前发展。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ee07/11732247/a6db2a21585c/ga1.jpg

相似文献

Evaluation of predictions of disordered binding regions in the CAID2 experiment.CAID2实验中无序结合区域预测的评估。

Comput Struct Biotechnol J. 2024 Dec 17;27:78-88. doi: 10.1016/j.csbj.2024.12.009. eCollection 2025.

Assessment of Disordered Linker Predictions in the CAID2 Experiment.CAID2 实验中无序连接预测的评估。

Biomolecules. 2024 Feb 28;14(3):287. doi: 10.3390/biom14030287.

Compositional Bias of Intrinsically Disordered Proteins and Regions and Their Predictions.固有无序蛋白质和区域的组成偏倚及其预测。

Biomolecules. 2022 Jun 25;12(7):888. doi: 10.3390/biom12070888.

Computational prediction of disordered binding regions.无序结合区域的计算预测

Comput Struct Biotechnol J. 2023 Feb 10;21:1487-1497. doi: 10.1016/j.csbj.2023.02.018. eCollection 2023.

PredIDR2: Improving accuracy of protein intrinsic disorder prediction by updating deep convolutional neural network and supplementing DisProt data.PredIDR2：通过更新深度卷积神经网络和补充DisProt数据提高蛋白质内在无序预测的准确性。

Int J Biol Macromol. 2025 May;306(Pt 4):141801. doi: 10.1016/j.ijbiomac.2025.141801. Epub 2025 Mar 5.

Computational prediction of functions of intrinsically disordered regions.计算预测无规卷曲区域的功能。

Prog Mol Biol Transl Sci. 2019;166:341-369. doi: 10.1016/bs.pmbts.2019.04.006. Epub 2019 May 20.

CLIP: accurate prediction of disordered linear interacting peptides from protein sequences using co-evolutionary information.CLIP：利用共进化信息从蛋白质序列中准确预测无序线性相互作用肽段。

Brief Bioinform. 2023 Jan 19;24(1). doi: 10.1093/bib/bbac502.

DeepDISOBind: accurate prediction of RNA-, DNA- and protein-binding intrinsically disordered residues with deep multi-task learning.DeepDISOBind：通过深度多任务学习准确预测 RNA、DNA 和蛋白质结合的无规卷曲残基。

Brief Bioinform. 2022 Jan 17;23(1). doi: 10.1093/bib/bbab521.

Prediction of protein-protein interactions using sequences of intrinsically disordered regions.利用序列的固有无序区域预测蛋白质-蛋白质相互作用。

Proteins. 2023 Jul;91(7):980-990. doi: 10.1002/prot.26486. Epub 2023 Mar 20.

Comparative Assessment of Intrinsic Disorder Predictions with a Focus on Protein and Nucleic Acid-Binding Proteins.内在无序预测的比较评估，重点关注蛋白质和核酸结合蛋白。

Biomolecules. 2020 Dec 4;10(12):1636. doi: 10.3390/biom10121636.

引用本文的文献

Empirical Assessment of Sequence-Based Predictions of Intrinsically Disordered Regions Involved in Phase Separation.基于序列的相分离相关内在无序区域预测的实证评估

Biomolecules. 2025 Jul 25;15(8):1079. doi: 10.3390/biom15081079.

本文引用的文献

PredIDR: Accurate prediction of protein intrinsic disorder regions using deep convolutional neural network.PredIDR：使用深度卷积神经网络准确预测蛋白质内在无序区域。

Int J Biol Macromol. 2025 Jan;284(Pt 1):137665. doi: 10.1016/j.ijbiomac.2024.137665. Epub 2024 Nov 19.

flDPnn2: Accurate and Fast Predictor of Intrinsic Disorder in Proteins.flDPnn2：一种准确快速预测蛋白质内无序的方法。

J Mol Biol. 2024 Sep 1;436(17):168605. doi: 10.1016/j.jmb.2024.168605. Epub 2024 May 8.

Improved protein complex prediction with AlphaFold-multimer by denoising the MSA profile.利用 AlphaFold-multimer 对 MSA 谱图进行去噪，提高蛋白质复合物预测能力。

PLoS Comput Biol. 2024 Jul 25;20(7):e1012253. doi: 10.1371/journal.pcbi.1012253. eCollection 2024 Jul.

Taxonomy-specific assessment of intrinsic disorder predictions at residue and region levels in higher eukaryotes, protists, archaea, bacteria and viruses.对高等真核生物、原生生物、古细菌、细菌和病毒中残基和区域水平的内在无序预测进行分类学特异性评估。

Comput Struct Biotechnol J. 2024 Apr 27;23:1968-1977. doi: 10.1016/j.csbj.2024.04.059. eCollection 2024 Dec.

Accurate structure prediction of biomolecular interactions with AlphaFold 3.利用 AlphaFold 3 进行生物分子相互作用的精确结构预测。

Nature. 2024 Jun;630(8016):493-500. doi: 10.1038/s41586-024-07487-w. Epub 2024 May 8.

Assessment of Disordered Linker Predictions in the CAID2 Experiment.CAID2 实验中无序连接预测的评估。

Biomolecules. 2024 Feb 28;14(3):287. doi: 10.3390/biom14030287.

Comparative evaluation of AlphaFold2 and disorder predictors for prediction of intrinsic disorder, disorder content and fully disordered proteins.用于预测内在无序、无序含量和完全无序蛋白质的AlphaFold2与无序预测器的比较评估

Comput Struct Biotechnol J. 2023 Jun 2;21:3248-3258. doi: 10.1016/j.csbj.2023.06.001. eCollection 2023.

DisoFLAG: accurate prediction of protein intrinsic disorder and its functions using graph-based interaction protein language model.DisoFLAG：基于图的互作蛋白语言模型准确预测蛋白质固有无序及其功能。

BMC Biol. 2024 Jan 2;22(1):3. doi: 10.1186/s12915-023-01803-y.

IDP-LM: Prediction of protein intrinsic disorder and disorder functions based on language models.IDP-LM：基于语言模型的蛋白质固有无序预测和无序功能预测。

PLoS Comput Biol. 2023 Nov 22;19(11):e1011657. doi: 10.1371/journal.pcbi.1011657. eCollection 2023 Nov.

DisProt in 2024: improving function annotation of intrinsically disordered proteins.2024 年的 DisProt：改善无序蛋白质的功能注释。

Nucleic Acids Res. 2024 Jan 5;52(D1):D434-D441. doi: 10.1093/nar/gkad928.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

CAID2实验中无序结合区域预测的评估。

Evaluation of predictions of disordered binding regions in the CAID2 experiment.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献