Department of Medicinal Chemistry, University of Michigan, 428 Church Street, Ann Arbor, Michigan 48109-1056, United States.
J Chem Inf Model. 2021 Mar 22;61(3):1287-1299. doi: 10.1021/acs.jcim.0c01002. Epub 2021 Feb 18.
Protein dynamics play an important role in small molecule binding and can pose a significant challenge in the identification of potential binding sites. Cryptic binding sites have been defined as sites which require significant rearrangement of the protein structure to become physically accessible to a ligand. Mixed-solvent MD (MixMD) is a computational protocol which maps the surface of the protein using molecular dynamics (MD) of the unbound protein solvated in a 5% box of probe molecules with explicit water. This method has successfully identified known active and allosteric sites which did not require reorganization. In this study, we apply the MixMD protocol to identify known cryptic sites of 12 proteins characterized by a wide range of conformational changes. Of these 12 proteins, three require reorganization of side chains, five require loop movements, and four require movement of more significant structures such as whole helices. In five cases, we find that standard MixMD simulations are able to map the cryptic binding sites with at least one probe type. In two cases (guanylate kinase and TIE-2), accelerated MD, which increases sampling of torsional angles, was necessary to achieve mapping of portions of the cryptic binding site missed by standard MixMD. For more complex systems where movement of a helix or domain is necessary, MixMD was unable to map the binding site even with accelerated dynamics, possibly due to the limited timescale (100 ns for individual simulations). In general, similar conformational dynamics are observed in water-only simulations and those with probe molecules. This could imply that the probes are not driving opening events but rather take advantage of mapping sites that spontaneously open as part of their inherent conformational behavior. Finally, we show that docking to an ensemble of conformations from the standard MixMD simulations performs better than docking the apo crystal structure in nine cases and even better than half of the bound crystal structures. Poorer performance was seen in docking to ensembles of conformations from the accelerated MixMD simulations.
蛋白质动力学在小分子结合中起着重要作用,并且在识别潜在结合位点方面可能会带来重大挑战。隐匿结合位点被定义为需要蛋白质结构发生重大重排才能使配体物理上可接近的位点。混合溶剂 MD(MixMD)是一种计算方案,该方案使用未结合的蛋白质在 5%探针分子盒中的分子动力学(MD)对蛋白质表面进行映射,其中包含显式水。该方法已成功识别了不需要重组的已知活性和变构位点。在这项研究中,我们应用 MixMD 方案来识别 12 种具有广泛构象变化特征的蛋白质的已知隐匿位点。在这 12 种蛋白质中,有三种需要侧链重组,五种需要环运动,四种需要更大结构的运动,如整个螺旋。在五种情况下,我们发现标准的 MixMD 模拟能够用至少一种探针类型来映射隐匿结合位点。在两种情况下(鸟苷酸激酶和 TIE-2),需要加速 MD 来增加扭转角的采样,以实现标准 MixMD 错过的隐匿结合位点的部分映射。对于更复杂的系统,需要螺旋或结构域的运动,即使使用加速动力学,MixMD 也无法映射结合位点,这可能是由于时间限制(单个模拟 100ns)。一般来说,在仅用水和含有探针分子的模拟中观察到类似的构象动力学。这可能意味着探针不是在驱动开口事件,而是利用作为其固有构象行为一部分自发打开的位点进行映射。最后,我们表明,在 9 种情况下,从标准 MixMD 模拟的构象集合中对接比对接 apo 晶体结构的性能更好,甚至比一半的结合晶体结构的性能更好。在对接加速 MixMD 模拟的构象集合时,性能较差。