Xu Tianxu, Wang Minjun, Liu Xiaoqian, Feng Dawei, Zhu Yanjuan, Fan Zhe, Rao Shurong, Lu Jing
Department, Institution:Key Laboratory of Molecular Pharmacology and Drug Evaluation, Ministry of Education, Collaborative Innovation Center of Advanced Drug Delivery System and Biotech Drugs in Universities of Shandong, School of Pharmacy, Yantai University, No. 30, Qingquan Road, Laishan District, Yantai, 264005, China.
Mol Inform. 2022 Dec;41(12):e2200088. doi: 10.1002/minf.202200088. Epub 2022 Oct 6.
Designing molecules with specific scaffolds can facilitate the discovery and optimization of lead compounds. Some scaffold-based molecular generation models have been developed using deep-learning methods based on specific scaffolds, although incorporating scaffold generalization is expected to achieve scaffold hopping. Moreover, most of the existing models focus on the 2D shape of the scaffold and overlook the stereochemical properties of the compound, especially for natural products. In this study, we optimized the scaffold-based molecular generation model designed by Lim et al. (Chemical Science 2020, 11, 1153-1164). Real-time ultrafast shape recognition with pharmacophore constraints (USRCAT) was introduced into the model to search for molecules similar to the 3D conformation and pharmacophore of the input scaffold sourced from the training set; the searched molecules were then used as new scaffolds to execute scaffold hopping. The optimized model could generate new molecules with the same chirality as the input scaffold. Furthermore, the probability distribution of the molecular structure and various physicochemical properties were analyzed to evaluate the model's generation capability. We thus believe that the optimized model can provide a basis for medicinal chemists to explore a wider chemical space toward optimization of the lead compounds and to screen the virtual compound library.
设计具有特定骨架的分子有助于先导化合物的发现和优化。尽管期望通过纳入骨架泛化来实现骨架跳跃,但已经使用基于特定骨架的深度学习方法开发了一些基于骨架的分子生成模型。此外,现有的大多数模型都侧重于骨架的二维形状,而忽略了化合物的立体化学性质,尤其是对于天然产物。在本研究中,我们优化了Lim等人(《化学科学》,2020年,11卷,1153 - 1164页)设计的基于骨架的分子生成模型。将具有药效团约束的实时超快形状识别(USRCAT)引入该模型,以搜索与源自训练集的输入骨架的三维构象和药效团相似的分子;然后将搜索到的分子用作新的骨架来进行骨架跳跃。优化后的模型可以生成与输入骨架具有相同手性的新分子。此外,分析了分子结构和各种物理化学性质的概率分布,以评估模型的生成能力。因此,我们认为优化后的模型可以为药物化学家探索更广阔的化学空间以优化先导化合物和筛选虚拟化合物库提供基础。