Ravera Enrico, Sgheri Luca, Parigi Giacomo, Luchinat Claudio
Center for Magnetic Resonance (CERM) and Department of Chemistry "Ugo Schiff", University of Florence, Via L. Sacconi 6, 50019, Sesto Fiorentino, Italy.
Istituto per le Applicazioni del Calcolo, Sezione di Firenze, CNR, Via Madonna del Piano 10, 50019 Sesto Fiorentino, Italy.
Phys Chem Chem Phys. 2016 Feb 17;18(8):5686-701. doi: 10.1039/c5cp04077a.
Conformational heterogeneity is key to the function of many biomacromolecules, but only a few groups have tried to characterize it until recently. Now, thanks to the increased throughput of experimental data and the increased computational power, the problem of the characterization of protein structural variability has become more and more popular. Several groups have devoted their efforts in trying to create quantitative, reliable and accurate protocols for extracting such information from averaged data. We analyze here different approaches, discussing strengths and weaknesses of each. All approaches can roughly be clustered into two groups: those satisfying the maximum entropy principle and those recovering ensembles composed of a restricted number of molecular conformations. In the first case, the solution focuses on the features that are common to all the infinite solutions satisfying the experimental data; in the second case, the reconstructed ensemble shows the conformational regions where a large probability can be placed. The upper limits for conformational probabilities (MaxOcc) can also be calculated. We also give an overview of the mainstream experimental observables, with considerations on the assumptions underlying their usage.
构象异质性是许多生物大分子功能的关键,但直到最近只有少数研究团队尝试对其进行表征。如今,由于实验数据通量的增加以及计算能力的提升,蛋白质结构变异性的表征问题变得越来越热门。几个研究团队致力于创建定量、可靠且准确的方案,以便从平均数据中提取此类信息。我们在此分析不同的方法,讨论每种方法的优缺点。所有方法大致可分为两类:满足最大熵原理的方法和恢复由有限数量分子构象组成的系综的方法。在第一种情况下,解决方案聚焦于满足实验数据的所有无限解所共有的特征;在第二种情况下,重建的系综展示了可放置大概率的构象区域。还可以计算构象概率的上限(最大占有率)。我们还概述了主流的实验可观测量,并考虑了其使用背后的假设。