Department of Biochemistry and Molecular Biophysics, Washington University in St. Louis, 660 S. Euclid Ave., St. Louis, Missouri 63110, United States.
Medical Scientist Training Program, Washington University in St. Louis, 660 S. Euclid Ave., St. Louis, Missouri 63110, United States.
J Chem Theory Comput. 2023 Jul 25;19(14):4355-4363. doi: 10.1021/acs.jctc.2c01189. Epub 2023 Mar 22.
Cryptic pockets, or pockets absent in ligand-free, experimentally determined structures, hold great potential as drug targets. However, cryptic pocket openings are often beyond the reach of conventional biomolecular simulations because certain cryptic pocket openings involve slow motions. Here, we investigate whether AlphaFold can be used to accelerate cryptic pocket discovery either by generating structures with open pockets directly or generating structures with partially open pockets that can be used as starting points for simulations. We use AlphaFold to generate ensembles for 10 known cryptic pocket examples, including five that were deposited after AlphaFold's training data were extracted from the PDB. We find that in 6 out of 10 cases AlphaFold samples the open state. For plasmepsin II, an aspartic protease from the causative agent of malaria, AlphaFold only captures a partial pocket opening. As a result, we ran simulations from an ensemble of AlphaFold-generated structures and show that this strategy samples cryptic pocket opening, even though an equivalent amount of simulations launched from a ligand-free experimental structure fails to do so. Markov state models (MSMs) constructed from the AlphaFold-seeded simulations quickly yield a free energy landscape of cryptic pocket opening that is in good agreement with the same landscape generated with well-tempered metadynamics. Taken together, our results demonstrate that AlphaFold has a useful role to play in cryptic pocket discovery but that many cryptic pockets may remain difficult to sample using AlphaFold alone.
隐匿口袋,或在无配体的实验确定结构中不存在的口袋,具有作为药物靶点的巨大潜力。然而,隐匿口袋的开口通常超出了传统生物分子模拟的范围,因为某些隐匿口袋的开口涉及缓慢的运动。在这里,我们研究了 AlphaFold 是否可以通过直接生成带有开口的口袋结构或生成带有部分开口的口袋结构来加速隐匿口袋的发现,这些结构可以作为模拟的起点。我们使用 AlphaFold 为 10 个已知的隐匿口袋示例生成了集合,其中 5 个是在 AlphaFold 的训练数据从 PDB 中提取后被存入的。我们发现,在 10 个案例中有 6 个是 AlphaFold 采样到的开放状态。对于疟原虫病原体的天冬氨酸蛋白酶 plasmepsin II,AlphaFold 只捕获了部分口袋开口。因此,我们从 AlphaFold 生成的结构集合中运行了模拟,并表明这种策略可以采样隐匿口袋的开口,尽管从无配体的实验结构中启动相同数量的模拟无法做到这一点。从 AlphaFold 播种的模拟中构建的马尔可夫状态模型(MSM)可以快速生成隐匿口袋开口的自由能景观,与使用温和的元动力学生成的相同景观非常吻合。总之,我们的结果表明,AlphaFold 在隐匿口袋的发现中具有有用的作用,但许多隐匿口袋可能仍然难以仅使用 AlphaFold 进行采样。