Institute of Bioengineering, École Polytechnique Fédérale de Lausanne, Lausanne, Switzerland.
Swiss Institute of Bioinformatics (SIB), Lausanne, Switzerland.
PLoS Comput Biol. 2022 Mar 16;18(3):e1009178. doi: 10.1371/journal.pcbi.1009178. eCollection 2022 Mar.
Proteins are typically represented by discrete atomic coordinates providing an accessible framework to describe different conformations. However, in some fields proteins are more accurately represented as near-continuous surfaces, as these are imprinted with geometric (shape) and chemical (electrostatics) features of the underlying protein structure. Protein surfaces are dependent on their chemical composition and, ultimately determine protein function, acting as the interface that engages in interactions with other molecules. In the past, such representations were utilized to compare protein structures on global and local scales and have shed light on functional properties of proteins. Here we describe RosettaSurf, a surface-centric computational design protocol, that focuses on the molecular surface shape and electrostatic properties as means for protein engineering, offering a unique approach for the design of proteins and their functions. The RosettaSurf protocol combines the explicit optimization of molecular surface features with a global scoring function during the sequence design process, diverging from the typical design approaches that rely solely on an energy scoring function. With this computational approach, we attempt to address a fundamental problem in protein design related to the design of functional sites in proteins, even when structurally similar templates are absent in the characterized structural repertoire. Surface-centric design exploits the premise that molecular surfaces are, to a certain extent, independent of the underlying sequence and backbone configuration, meaning that different sequences in different proteins may present similar surfaces. We benchmarked RosettaSurf on various sequence recovery datasets and showcased its design capabilities by generating epitope mimics that were biochemically validated. Overall, our results indicate that the explicit optimization of surface features may lead to new routes for the design of functional proteins.
蛋白质通常由离散的原子坐标表示,为描述不同构象提供了一个可访问的框架。然而,在某些领域,蛋白质更准确地表示为近似连续的表面,因为这些表面上印刻着蛋白质结构的几何(形状)和化学(静电)特征。蛋白质表面取决于其化学成分,并最终决定蛋白质的功能,作为与其他分子相互作用的界面。过去,这些表示形式被用于在全局和局部尺度上比较蛋白质结构,并揭示了蛋白质的功能特性。在这里,我们描述了 RosettaSurf,这是一种基于表面的计算设计协议,专注于分子表面形状和静电特性,作为蛋白质工程的手段,为蛋白质及其功能的设计提供了一种独特的方法。RosettaSurf 协议将分子表面特征的显式优化与序列设计过程中的全局评分函数相结合,与仅依赖能量评分函数的典型设计方法不同。通过这种计算方法,我们试图解决与蛋白质功能位点设计相关的蛋白质设计中的一个基本问题,即使在特征结构库中不存在结构相似的模板。基于表面的设计利用了这样一个前提,即分子表面在一定程度上独立于底层序列和骨架构象,这意味着不同蛋白质中的不同序列可能呈现出相似的表面。我们在各种序列恢复数据集上对 RosettaSurf 进行了基准测试,并通过生成生物化学验证的表位模拟物展示了其设计能力。总的来说,我们的结果表明,表面特征的显式优化可能为功能性蛋白质的设计开辟新途径。