Nicolaou Christos A, Humblet Christine, Hu Hong, Martin Eva M, Dorsey Frank C, Castle Thomas M, Burton Keith Ian, Hu Haitao, Hendle Jorg, Hickey Michael J, Duerksen Joel, Wang Jibo, Erickson Jon A
Lilly Research Laboratories, Eli Lilly and Company, Indianapolis, Indiana 46285, United States.
ACS Med Chem Lett. 2019 Feb 4;10(3):278-286. doi: 10.1021/acsmedchemlett.8b00488. eCollection 2019 Mar 14.
Increasing the success rate and throughput of drug discovery will require efficiency improvements throughout the process that is currently used in the pharmaceutical community, including the crucial step of identifying hit compounds to act as drivers for subsequent optimization. Hit identification can be carried out through large compound collection screening and often involves the generation and testing of many hypotheses based on available knowledge. In practice, hypothesis generation can involve the selection of promising chemical structures from compound collections using predictive models built from previous screening/assay results. Available physical collections, typically used during hit identification, are of the order of 10 compounds but represent only a small fraction of the small molecule drug-like chemical space. In an effort to survey a larger portion of chemical space and eliminate inefficiencies during hit identification, we introduce a new process, termed Idea2Data (I2D) that tightly integrates computational and experimental components of the drug discovery process. I2D provides the ability to connect a vast virtual collection of compounds readily synthesizable on automated synthesis systems with computational predictive models for the identification of promising structures. This new paradigm enables researchers to process billions of virtual molecules and select structures that can be prepared on automated systems and made available for biological testing, allowing for timely hypothesis testing and follow-up. Since its introduction, I2D has positively impacted several portfolio efforts through identification of new chemical scaffolds and functionalization of existing scaffolds. In this Innovations paper, we describe the I2D process and present an application for the discovery of new ULK inhibitors.
提高药物发现的成功率和通量将需要在制药界目前使用的整个过程中提高效率,包括识别命中化合物这一关键步骤,这些命中化合物将作为后续优化的驱动因素。命中化合物的识别可以通过大型化合物库筛选来进行,并且通常涉及基于现有知识生成和测试许多假设。在实践中,假设生成可能涉及使用根据先前筛选/测定结果建立的预测模型从化合物库中选择有前景的化学结构。在命中化合物识别过程中通常使用的现有实体化合物库大约有10种化合物,但仅代表类药物小分子化学空间的一小部分。为了探索更大一部分化学空间并消除命中化合物识别过程中的低效率,我们引入了一种新的流程,称为Idea2Data(I2D),它紧密集成了药物发现过程的计算和实验组件。I2D能够将可在自动化合成系统上轻松合成的大量虚拟化合物库与用于识别有前景结构的计算预测模型相连接。这种新的模式使研究人员能够处理数十亿个虚拟分子,并选择可以在自动化系统上制备并用于生物学测试的结构,从而实现及时的假设测试和跟进。自推出以来,I2D通过识别新的化学骨架和对现有骨架进行功能化,对多项项目工作产生了积极影响。在这篇创新论文中,我们描述了I2D流程,并展示了一个发现新型ULK抑制剂的应用实例。