Department of Biomedical Sciences, University of Padova, Padova, Italy.
Institute of Biomembranes, Bioenergetics and Molecular Biotechnologies, National Research Council (CNR-IBIOM), Bari, Italy.
Nucleic Acids Res. 2024 Jan 5;52(D1):D434-D441. doi: 10.1093/nar/gkad928.
DisProt (URL: https://disprot.org) is the gold standard database for intrinsically disordered proteins and regions, providing valuable information about their functions. The latest version of DisProt brings significant advancements, including a broader representation of functions and an enhanced curation process. These improvements aim to increase both the quality of annotations and their coverage at the sequence level. Higher coverage has been achieved by adopting additional evidence codes. Quality of annotations has been improved by systematically applying Minimum Information About Disorder Experiments (MIADE) principles and reporting all the details of the experimental setup that could potentially influence the structural state of a protein. The DisProt database now includes new thematic datasets and has expanded the adoption of Gene Ontology terms, resulting in an extensive functional repertoire which is automatically propagated to UniProtKB. Finally, we show that DisProt's curated annotations strongly correlate with disorder predictions inferred from AlphaFold2 pLDDT (predicted Local Distance Difference Test) confidence scores. This comparison highlights the utility of DisProt in explaining apparent uncertainty of certain well-defined predicted structures, which often correspond to folding-upon-binding fragments. Overall, DisProt serves as a comprehensive resource, combining experimental evidence of disorder information to enhance our understanding of intrinsically disordered proteins and their functional implications.
DisProt(网址:https://disprot.org)是一个用于研究无序蛋白质和区域的黄金标准数据库,提供了有关它们功能的有价值信息。DisProt 的最新版本带来了重大进展,包括更广泛的功能表示和改进的注释过程。这些改进旨在提高注释的质量和序列水平的覆盖度。通过采用额外的证据代码,提高了覆盖度。通过系统地应用无序信息最小化实验(MIADE)原则并报告所有可能影响蛋白质结构状态的实验设置的详细信息,提高了注释的质量。DisProt 数据库现在包括新的主题数据集,并扩展了对基因本体术语的采用,从而形成了一个广泛的功能库,该功能库可自动传播到 UniProtKB。最后,我们表明 DisProt 的注释与从 AlphaFold2 pLDDT(预测局部距离差异测试)置信得分推断出的无序预测强烈相关。这种比较突出了 DisProt 在解释某些明确定义的预测结构的明显不确定性方面的实用性,这些结构通常对应于折叠结合片段。总的来说,DisProt 是一个综合性资源,结合了无序信息的实验证据,以增强我们对无序蛋白质及其功能影响的理解。