Rasouli Ali, Pickard Frank C, Sur Sreyoshi, Grossfield Alan, Işık Bennett Mehtap
Moderna, Inc., 325 Binney Street, Cambridge, Massachusetts 02142, United States.
Theoretical and Computational Biophysics Group, NIH Center for Macromolecular Modeling and Bioinformatics, Beckman Institute for Advanced Science and Technology, Department of Biochemistry, University of Illinois, Urbana, Illinois 61801, United States.
J Chem Inf Model. 2025 Jan 13;65(1):223-239. doi: 10.1021/acs.jcim.4c01505. Epub 2024 Dec 19.
Alchemical free energy calculations are widely used to predict the binding affinity of small molecule ligands to protein targets; however, the application of these methods to RNA targets has not been deeply explored. We systematically investigated how modeling decisions affect the performance of absolute binding free energy calculations for a relatively simple RNA model system: theophylline-binding RNA aptamer with theophylline and five analogs. The goal of this investigation was 2-fold: (1) understanding the performance levels we can expect from absolute free energy calculations for a simple RNA complex and (2) learning about practical modeling considerations that impact the success of RNA-binding predictions, which may be different from the best practices established for protein targets. We learned that magnesium ion (Mg) placement is a critical decision that impacts affinity predictions. When information regarding Mg positions is lacking, implementing RNA backbone restraints is an alternative way of stabilizing the RNA structure that recapitulates prediction accuracy. Since mistakes in Mg placement can be detrimental, omitting magnesium ions entirely and using RNA backbone restraints are attractive as a risk-mitigating approach. We found that predictions are sensitive to modeling experimental buffer conditions correctly, including salt type and ionic strength. We explored the effects of sampling in the alchemical protocol, choice of the ligand force field (GAFF2/OpenFF Sage), and water model (TIP3P/OPC) on predictions, which allowed us to give practical advice for the application of free energy methods to RNA targets. By capturing experimental buffer conditions and implementing RNA backbone restraints, we were able to compute binding affinities accurately (mean absolute error (MAE) = 2.2 kcal/mol, Pearson's correlation coefficient = 0.9, Kendall's τ = 0.7). We believe there is much to learn about how to apply free energy calculations for RNA targets and how to enhance their performance in prospective predictions. This study is an important first step for learning best practices and special considerations for RNA-ligand free energy calculations. Future studies will consider increasingly complicated ligands and diverse RNA systems and help the development of general protocols for therapeutically relevant RNA targets.
炼金术自由能计算被广泛用于预测小分子配体与蛋白质靶点的结合亲和力;然而,这些方法在RNA靶点上的应用尚未得到深入探索。我们系统地研究了建模决策如何影响一个相对简单的RNA模型系统的绝对结合自由能计算性能:茶碱结合RNA适配体与茶碱及五个类似物。这项研究的目标有两个:(1)了解对于一个简单的RNA复合物,我们可以从绝对自由能计算中期望得到的性能水平;(2)了解影响RNA结合预测成功的实际建模注意事项,这些可能与针对蛋白质靶点确立的最佳实践不同。我们了解到镁离子(Mg)的位置是一个影响亲和力预测的关键决策。当缺乏关于Mg位置的信息时,实施RNA主链约束是稳定RNA结构的另一种方法,其能重现预测准确性。由于Mg位置错误可能有害,完全省略镁离子并使用RNA主链约束作为一种降低风险的方法很有吸引力。我们发现预测对正确建模实验缓冲条件很敏感,包括盐类型和离子强度。我们探索了炼金术协议中的采样、配体力场(GAFF2/OpenFF Sage)的选择以及水模型(TIP3P/OPC)对预测的影响,这使我们能够为将自由能方法应用于RNA靶点提供实用建议。通过捕捉实验缓冲条件并实施RNA主链约束,我们能够准确计算结合亲和力(平均绝对误差(MAE) = 2.2千卡/摩尔,皮尔逊相关系数 = 0.9,肯德尔τ = 0.7)。我们相信,关于如何将自由能计算应用于RNA靶点以及如何在前瞻性预测中提高其性能,还有很多需要学习的地方。这项研究是了解RNA - 配体自由能计算的最佳实践和特殊注意事项的重要第一步。未来的研究将考虑越来越复杂的配体和多样的RNA系统,并有助于开发针对治疗相关RNA靶点的通用协议。