LAAS-CNRS, Université de Toulouse, CNRS, Toulouse, France.
Sanofi Recherche & Développement, Integrated Drug Discovery, Molecular Design Sciences, Vitry-sur-Seine, France.
Proteins. 2021 Feb;89(2):218-231. doi: 10.1002/prot.26008. Epub 2020 Oct 12.
Flexible regions in proteins, such as loops, cannot be represented by a single conformation. Instead, conformational ensembles are needed to provide a more global picture. In this context, identifying statistically meaningful conformations within an ensemble generated by loop sampling techniques remains an open problem. The difficulty is primarily related to the lack of structural data about these flexible regions. With the majority of structural data coming from x-ray crystallography and ignoring plasticity, the conception and evaluation of loop scoring methods is challenging. In this work, we compare the performance of various scoring methods on a set of eight protein loops that are known to be flexible. The ability of each method to identify and select all of the known conformations is assessed, and the underlying energy landscapes are produced and projected to visualize the qualitative differences obtained when using the methods. Statistical potentials are found to provide considerable reliability despite their being designed to tradeoff accuracy for lower computational cost. On a large pool of loop models, they are capable of filtering out statistically improbable states while retaining those that resemble known (and thus likely) conformations. However, computationally expensive methods are still required for more precise assessment and structural refinement. The results also highlight the importance of employing several scaffolds for the protein, due to the high influence of small structural rearrangements in the rest of the protein over the modeled energy landscape for the loop.
蛋白质中的柔性区域,如环,不能用单一构象表示。相反,需要构象集合来提供更全局的图像。在这种情况下,在环采样技术生成的集合中识别具有统计学意义的构象仍然是一个未解决的问题。困难主要与这些柔性区域缺乏结构数据有关。由于大多数结构数据来自 X 射线晶体学,并且忽略了可塑性,因此环评分方法的构思和评估具有挑战性。在这项工作中,我们比较了各种评分方法在一组已知具有柔性的八个蛋白质环上的性能。评估了每种方法识别和选择所有已知构象的能力,并生成和投影潜在的能量景观,以可视化使用方法时获得的定性差异。尽管统计势是为了权衡准确性以降低计算成本而设计的,但它们仍提供了相当大的可靠性。在一个大型环模型池中,它们能够过滤掉具有统计学可能性的状态,同时保留那些类似于已知(因此可能)构象的状态。然而,仍然需要更精确的评估和结构细化的计算成本更高的方法。结果还强调了由于蛋白质中其他部分的小结构重排对环的建模能量景观的高影响,因此对蛋白质使用几种支架的重要性。