Department of Biochemistry, ELTE Eötvös Loránd University, Pázmány Péter Stny 1/c, H-1117 Budapest, Hungary.
Biomolecules. 2023 Sep 25;13(10):1442. doi: 10.3390/biom13101442.
Disorder prediction methods that can discriminate between ordered and disordered regions have contributed fundamentally to our understanding of the properties and prevalence of intrinsically disordered proteins (IDPs) in proteomes as well as their functional roles. However, a recent large-scale assessment of the performance of these methods indicated that there is still room for further improvements, necessitating novel approaches to understand the strengths and weaknesses of individual methods. In this study, we compared two methods, IUPred and disorder prediction, based on the pLDDT scores derived from AlphaFold2 (AF2) models. We evaluated these methods using a dataset from the DisProt database, consisting of experimentally characterized disordered regions and subsets associated with diverse experimental methods and functions. IUPred and AF2 provided consistent predictions in 79% of cases for long disordered regions; however, for 15% of these cases, they both suggested order in disagreement with annotations. These discrepancies arose primarily due to weak experimental support, the presence of intermediate states, or context-dependent behavior, such as binding-induced transitions. Furthermore, AF2 tended to predict helical regions with high pLDDT scores within disordered segments, while IUPred had limitations in identifying linker regions. These results provide valuable insights into the inherent limitations and potential biases of disorder prediction methods.
能够区分有序区域和无序区域的无序预测方法极大地促进了我们对蛋白质组中固有无序蛋白 (IDP) 的性质和普遍性及其功能作用的理解。然而,最近对这些方法性能的大规模评估表明,仍有进一步改进的空间,需要新的方法来了解个别方法的优缺点。在这项研究中,我们比较了两种方法,即 IUPred 和 disorder prediction,它们基于 AlphaFold2 (AF2) 模型得出的 pLDDT 分数。我们使用来自 DisProt 数据库的数据集评估了这些方法,该数据集由经过实验表征的无序区域和与各种实验方法和功能相关的子集组成。IUPred 和 AF2 对长无序区域的预测在 79%的情况下是一致的;然而,在这些情况下的 15%,它们都与注释不一致地暗示了有序。这些差异主要是由于实验支持较弱、存在中间状态或上下文相关的行为(如结合诱导的转变)所致。此外,AF2 倾向于在无序片段内预测具有高 pLDDT 分数的螺旋区域,而 IUPred 在识别连接子区域方面存在局限性。这些结果提供了对无序预测方法固有局限性和潜在偏差的宝贵见解。