Department of Computer Science, Virginia Commonwealth University, Richmond, VA, USA.
School of Statistics and Data Science, LPMC and KLMDASR, Nankai University, Tianjin, China.
Nat Protoc. 2023 Nov;18(11):3157-3172. doi: 10.1038/s41596-023-00876-x. Epub 2023 Sep 22.
Intrinsic disorder is instrumental for a wide range of protein functions, and its analysis, using computational predictions from primary structures, complements secondary and tertiary structure-based approaches. In this Tutorial, we provide an overview and comparison of 23 publicly available computational tools with complementary parameters useful for intrinsic disorder prediction, partly relying on results from the Critical Assessment of protein Intrinsic Disorder prediction experiment. We consider factors such as accuracy, runtime, availability and the need for functional insights. The selected tools are available as web servers and downloadable programs, offer state-of-the-art predictions and can be used in a high-throughput manner. We provide examples and instructions for the selected tools to illustrate practical aspects related to the submission, collection and interpretation of predictions, as well as the timing and their limitations. We highlight two predictors for intrinsically disordered proteins, flDPnn as accurate and fast and IUPred as very fast and moderately accurate, while suggesting ANCHOR2 and MoRFchibi as two of the best-performing predictors for intrinsically disordered region binding. We link these tools to additional resources, including databases of predictions and web servers that integrate multiple predictive methods. Altogether, this Tutorial provides a hands-on guide to comparatively evaluating multiple predictors, submitting and collecting their own predictions, and reading and interpreting results. It is suitable for experimentalists and computational biologists interested in accurately and conveniently identifying intrinsic disorder, facilitating the functional characterization of the rapidly growing collections of protein sequences.
固有无序在广泛的蛋白质功能中起着重要作用,使用基于原始结构的计算预测对其进行分析,补充了基于二级和三级结构的方法。在本教程中,我们提供了 23 种公开可用的计算工具的概述和比较,这些工具具有互补的参数,可用于固有无序预测,部分依赖于蛋白质固有无序预测实验的关键评估结果。我们考虑了准确性、运行时间、可用性和对功能见解的需求等因素。选定的工具可作为网络服务器和可下载程序使用,提供最先进的预测,并可用于高通量方式。我们为选定的工具提供了示例和说明,以说明与提交、收集和解释预测以及时间和限制相关的实际方面。我们强调了两种用于预测固有无序蛋白质的预测器,flDPnn 具有准确性和快速性,IUPred 具有非常快速和中等准确性,同时建议 ANCHOR2 和 MoRFchibi 作为用于固有无序区域结合的两个性能最佳的预测器之一。我们将这些工具链接到其他资源,包括预测数据库和集成多个预测方法的网络服务器。总之,本教程提供了一个实践指南,用于比较评估多个预测器、提交和收集它们自己的预测,并阅读和解释结果。它适合对准确和方便地识别固有无序、促进快速增长的蛋白质序列功能特征具有兴趣的实验家和计算生物学家。