Department of Chemistry, Memorial University of Newfoundland, St. John's, Newfoundland and Labrador A1B 3T4, Canada.
J Chem Inf Model. 2020 Dec 28;60(12):6258-6268. doi: 10.1021/acs.jcim.0c00904. Epub 2020 Dec 2.
Many drug molecules contain biaryl fragments, resulting in a torsional barrier corresponding to rotation around the bond linking the aryls. The potential energy surfaces of these torsions vary significantly because of steric and electronic effects, ultimately affecting the relative stability of the molecular conformations in the protein-bound and solution states. Simulations of protein-ligand binding require accurate computational models to represent the intramolecular interactions to provide accurate predictions of the structure and dynamics of binding. In this article, we compare four force fields [generalized AMBER force field (GAFF), open force field (OpenFF), CHARMM general force field (CGenFF), optimized potentials for liquid simulations (OPLS)] and two neural network potentials (ANI-2x and ANI-1ccx) for their ability to predict the torsional potential energy surfaces of 88 biaryls extracted from drug fragments. The root mean square deviation (rmsd) over the full potential energy surface and the mean absolute deviation of the torsion rotational barrier height (MADB) relative to high-level ab initio reference data (CCSD(T1)*) were used as the measure of accuracy. Uncertainties in these metrics due to the composition of the data set were estimated using bootstrap analysis. In comparison to high-level ab initio data, ANI-1ccx was most accurate for predicting the barrier height (rmsd: 0.5 ± 0.0 kcal/mol, MADB: 0.8 ± 0.1 kcal/mol), followed closely by ANI-2x (rmsd: 0.5 ± 0.0 kcal/mol, MADB: 1.0 ± 0.2 kcal/mol), then CGenFF (rmsd: 0.8 ± 0.1 kcal/mol, MADB: 1.3 ± 0.1 kcal/mol) and OpenFF (rmsd: 0.7 ± 0.1 kcal/mol, MADB: 1.3 ± 0.1 kcal/mol), then GAFF (rmsd: 1.2 ± 0.2 kcal/mol, MADB: 2.6 ± 0.5 kcal/mol), and finally OPLS (rmsd: 3.6 ± 0.3 kcal/mol, MADB: 3.6 ± 0.3 kcal/mol). Significantly, the neural network potentials (NNPs) are systematically more accurate and more reliable than any of the force fields. As a practical example, the NNP/molecular mechanics method was used to simulate the isomerization of ozanimod, a drug used for multiple sclerosis. Multinanosecond molecular dynamics (MD) simulations in an explicit aqueous solvent were performed, as well as umbrella sampling and adaptive biasing force-enhanced sampling techniques. The rate constant for this isomerization calculated using transition state theory was 4.30 × 10 ns, which is consistent with direct MD simulations.
许多药物分子含有联芳片段,导致对应于芳基之间键的旋转的扭转势垒。这些扭转的势能面因空间和电子效应而有很大差异,最终影响蛋白质结合态和溶液态下分子构象的相对稳定性。蛋白质-配体结合的模拟需要准确的计算模型来表示分子内相互作用,以提供结合结构和动力学的准确预测。在本文中,我们比较了四种力场[广义 AMBER 力场(GAFF)、开放式力场(OpenFF)、CHARMM 通用力场(CGenFF)、优化液体模拟势能(OPLS)]和两种神经网络势(ANI-2x 和 ANI-1ccx),以评估它们预测 88 个从药物片段中提取的联芳基扭转势能表面的能力。均方根偏差(rmsd)在整个势能面和相对于高精度从头算参考数据(CCSD(T1)*)的扭转旋转势垒高度的平均绝对偏差(MADB)被用作准确性的度量。使用自举分析估计了由于数据集组成而导致这些指标的不确定性。与高精度从头算数据相比,ANI-1ccx 预测势垒高度最准确(rmsd:0.5 ± 0.0 kcal/mol,MADB:0.8 ± 0.1 kcal/mol),紧随其后的是 ANI-2x(rmsd:0.5 ± 0.0 kcal/mol,MADB:1.0 ± 0.2 kcal/mol),然后是 CGenFF(rmsd:0.8 ± 0.1 kcal/mol,MADB:1.3 ± 0.1 kcal/mol)和 OpenFF(rmsd:0.7 ± 0.1 kcal/mol,MADB:1.3 ± 0.1 kcal/mol),然后是 GAFF(rmsd:1.2 ± 0.2 kcal/mol,MADB:2.6 ± 0.5 kcal/mol),最后是 OPLS(rmsd:3.6 ± 0.3 kcal/mol,MADB:3.6 ± 0.3 kcal/mol)。值得注意的是,神经网络势(NNP)比任何力场都更准确、更可靠。作为一个实际的例子,使用 NNP/分子力学方法模拟了用于多发性硬化症的药物 ozanimod 的异构化。在明水中进行了多纳秒分子动力学(MD)模拟,以及伞状采样和自适应偏置力增强采样技术。使用过渡态理论计算的这种异构化的速率常数为 4.30 × 10 ns,与直接 MD 模拟一致。