Department of Biomedical Sciences, University of Padua, 35131 Padua, Italy.
Bioprocessing Technology Institute, A*STAR, Singapore 138668, Singapore.
Int J Mol Sci. 2020 Jun 24;21(12):4496. doi: 10.3390/ijms21124496.
Intrinsically disordered protein regions are commonly defined from missing electron density in X-ray structures. Experimental evidence for long disorder regions (LDRs) of at least 30 residues was so far limited to manually curated proteins. Here, we describe a comprehensive and large-scale analysis of experimental LDRs for 3133 unique proteins, demonstrating an increasing coverage of intrinsic disorder in the Protein Data Bank (PDB) in the last decade. The results suggest that long missing residue regions are a good quality source to annotate intrinsically disordered regions and perform functional analysis in large data sets. The consensus approach used to define LDRs allows to evaluate context dependent disorder and provide a common definition at the protein level.
无规卷曲蛋白质区域通常根据 X 射线结构中的电子密度缺失来定义。到目前为止,至少 30 个残基的长无规卷曲区域 (LDR) 的实验证据仅限于人工编辑的蛋白质。在这里,我们对 3133 个独特蛋白质的实验 LDR 进行了全面的大规模分析,表明在过去十年中,蛋白质数据库 (PDB) 中内在无序的覆盖范围不断增加。结果表明,长缺失残基区域是一个很好的质量来源,可以在大型数据集中标注内在无序区域并进行功能分析。用于定义 LDR 的共识方法允许评估上下文相关的无序,并在蛋白质水平上提供一个通用的定义。