Wu Nicholas C, Olson C Anders, Sun Ren
Department of Molecular and Medical Pharmacology, David Geffen School of Medicine, University of California, Los Angeles, California, 90095.
Molecular Biology Institute, University of California, Los Angeles, California, 90095.
Protein Sci. 2016 Feb;25(2):530-9. doi: 10.1002/pro.2840. Epub 2015 Dec 8.
The effect of a mutation on protein stability is traditionally measured by genetic construction, expression, purification, and physical analysis using low-throughput methods. This process is tedious and limits the number of mutants able to be examined in a single study. In contrast, functional fitness effects can be measured in a high-throughput manner by various deep mutational scanning tools. Using protein GB 1, we have recently demonstrated the feasibility of estimating the mutational stability effect ( ΔΔG) of single-substitution based on the functional fitness profile of all double-substitutions. The principle is to identify genetic backgrounds that have an exhausted stability margin. The functional effect of an additional substitution on these genetic backgrounds can then be used to compute the mutational ΔΔG based on the biophysical relationship between functional fitness and thermodynamic stability. However, to identify such genetic backgrounds, the approach described in our previous study required a benchmark dataset, which is a set of known mutational ΔΔG. In this study, a benchmark-independent approach is developed. The genetic backgrounds of interest are identified using k-means clustering with the integration of structural information. We further demonstrated that a reasonable approximation of ΔΔG can also be obtained without taking structural information into account. In summary, this study describes a novel method for computing ΔΔG from double-substitution functional fitness profiles alone, without relying on any known mutational ΔΔG as a benchmark.
传统上,通过基因构建、表达、纯化以及使用低通量方法进行物理分析来测量突变对蛋白质稳定性的影响。这个过程很繁琐,并且限制了在单个研究中能够检测的突变体数量。相比之下,功能适应性效应可以通过各种深度突变扫描工具以高通量方式进行测量。利用蛋白质GB 1,我们最近证明了基于所有双取代的功能适应性概况来估计单取代的突变稳定性效应(ΔΔG)的可行性。其原理是识别具有耗尽稳定性余量的遗传背景。然后,可以基于功能适应性与热力学稳定性之间的生物物理关系,利用这些遗传背景上额外取代的功能效应来计算突变ΔΔG。然而,为了识别这样的遗传背景,我们先前研究中描述的方法需要一个基准数据集,即一组已知的突变ΔΔG。在本研究中,开发了一种不依赖基准的方法。利用k均值聚类并整合结构信息来识别感兴趣的遗传背景。我们进一步证明,在不考虑结构信息的情况下也可以获得ΔΔG的合理近似值。总之,本研究描述了一种仅从双取代功能适应性概况计算ΔΔG的新方法,而无需依赖任何已知的突变ΔΔG作为基准。