Sauer Susanne, Matter Hans, Hessler Gerhard, Grebner Christoph
Synthetic Molecular Design, Integrated Drug Discovery, Sanofi, Frankfurt, Germany.
Front Chem. 2022 Oct 19;10:1012507. doi: 10.3389/fchem.2022.1012507. eCollection 2022.
The identification and optimization of promising lead molecules is essential for drug discovery. Recently, artificial intelligence (AI) based generative methods provided complementary approaches for generating molecules under specific design constraints of relevance in drug design. The goal of our study is to incorporate protein 3D information directly into generative design by flexible docking plus an adapted protein-ligand scoring function, thereby moving towards automated structure-based design. First, the protein-ligand scoring function RFXscore integrating individual scoring terms, ligand descriptors, and combined terms was derived using the PDBbind database and internal data. Next, design results for different workflows are compared to solely ligand-based reward schemes. Our newly proposed, optimal workflow for structure-based generative design is shown to produce promising results, especially for those exploration scenarios, where diverse structures fitting to a protein binding site are requested. Best results are obtained using docking followed by RFXscore, while, depending on the exact application scenario, it was also found useful to combine this approach with other metrics that bias structure generation into "drug-like" chemical space, such as target-activity machine learning models, respectively.
识别和优化有前景的先导分子对于药物发现至关重要。最近,基于人工智能(AI)的生成方法为在药物设计相关的特定设计约束下生成分子提供了补充方法。我们研究的目标是通过灵活对接和适配的蛋白质-配体评分函数将蛋白质三维信息直接纳入生成设计中,从而迈向基于结构的自动化设计。首先,使用PDBbind数据库和内部数据推导了整合个体评分项、配体描述符和组合项的蛋白质-配体评分函数RFXscore。接下来,将不同工作流程的设计结果与仅基于配体的奖励方案进行比较。我们新提出的基于结构的生成设计的最优工作流程显示出产生了有前景的结果,特别是对于那些需要多种适合蛋白质结合位点结构的探索场景。使用对接后接RFXscore可获得最佳结果,同时,根据具体应用场景,还发现将这种方法与其他将结构生成偏向“类药物”化学空间的指标(如目标活性机器学习模型)相结合也很有用。