Genomic Medicine Institute, Lerner Research Institute, Cleveland Clinic, 9500 Euclid Avenue, Cleveland, OH 44195, USA.
BioFolD Unit, Department of Pharmacy and Biotechnology (FaBiT), University of Bologna, Via F. Selmi 3, 40126 Bologna, Italy.
Nucleic Acids Res. 2022 Jul 5;50(W1):W222-W227. doi: 10.1093/nar/gkac325.
Estimating the functional effect of single amino acid variants in proteins is fundamental for predicting the change in the thermodynamic stability, measured as the difference in the Gibbs free energy of unfolding, between the wild-type and the variant protein (ΔΔG). Here, we present the web-server of the DDGun method, which was previously developed for the ΔΔG prediction upon amino acid variants. DDGun is an untrained method based on basic features derived from evolutionary information. It is antisymmetric, as it predicts opposite ΔΔG values for direct (A → B) and reverse (B → A) single and multiple site variants. DDGun is available in two versions, one based on only sequence information and the other one based on sequence and structure information. Despite being untrained, DDGun reaches prediction performances comparable to those of trained methods. Here we make DDGun available as a web server. For the web server version, we updated the protein sequence database used for the computation of the evolutionary features, and we compiled two new data sets of protein variants to do a blind test of its performances. On these blind data sets of single and multiple site variants, DDGun confirms its prediction performance, reaching an average correlation coefficient between experimental and predicted ΔΔG of 0.45 and 0.49 for the sequence-based and structure-based versions, respectively. Besides being used for the prediction of ΔΔG, we suggest that DDGun should be adopted as a benchmark method to assess the predictive capabilities of newly developed methods. Releasing DDGun as a web-server, stand-alone program and docker image will facilitate the necessary process of method comparison to improve ΔΔG prediction.
估算蛋白质中单个氨基酸变异的功能效应对于预测热力学稳定性的变化至关重要,这可以通过比较野生型和变异型蛋白质之间的自由能差异(ΔΔG)来衡量。在这里,我们介绍了 DDGun 方法的网络服务器,该方法之前是为预测氨基酸变异的 ΔΔG 而开发的。DDGun 是一种基于从进化信息中提取的基本特征的无模型方法。它是反对称的,因为它预测直接(A → B)和反向(B → A)单突变和多突变的 ΔΔG 值相反。DDGun 有两个版本,一个仅基于序列信息,另一个基于序列和结构信息。尽管是无模型的,但 DDGun 达到了与训练方法相当的预测性能。在这里,我们将 DDGun 作为网络服务器提供。对于网络服务器版本,我们更新了用于计算进化特征的蛋白质序列数据库,并编译了两个新的蛋白质变异数据集,以对其性能进行盲测。在这些单突变和多突变的盲测数据集上,DDGun 确认了其预测性能,序列和结构版本的实验和预测 ΔΔG 之间的平均相关系数分别为 0.45 和 0.49。除了用于预测 ΔΔG 之外,我们还建议将 DDGun 作为基准方法来评估新开发方法的预测能力。将 DDGun 作为网络服务器、独立程序和 Docker 映像发布将有助于进行方法比较,以改进 ΔΔG 预测。