Department of Systems Biosciences for Drug Discovery, Graduate School of Pharmaceutical Sciences, Kyoto University, Kyoto, Japan.
Mol Syst Biol. 2011 Mar 1;7:472. doi: 10.1038/msb.2011.5.
The discovery of novel bioactive molecules advances our systems-level understanding of biological processes and is crucial for innovation in drug development. For this purpose, the emerging field of chemical genomics is currently focused on accumulating large assay data sets describing compound-protein interactions (CPIs). Although new target proteins for known drugs have recently been identified through mining of CPI databases, using these resources to identify novel ligands remains unexplored. Herein, we demonstrate that machine learning of multiple CPIs can not only assess drug polypharmacology but can also efficiently identify novel bioactive scaffold-hopping compounds. Through a machine-learning technique that uses multiple CPIs, we have successfully identified novel lead compounds for two pharmaceutically important protein families, G-protein-coupled receptors and protein kinases. These novel compounds were not identified by existing computational ligand-screening methods in comparative studies. The results of this study indicate that data derived from chemical genomics can be highly useful for exploring chemical space, and this systems biology perspective could accelerate drug discovery processes.
新型生物活性分子的发现促进了我们对生物过程的系统水平理解,对于药物开发的创新至关重要。为此,化学基因组学这一新兴领域目前专注于积累大量描述化合物-蛋白质相互作用(CPI)的测定数据集。尽管最近通过挖掘 CPI 数据库已经发现了已知药物的新靶标蛋白,但利用这些资源来识别新型配体仍未得到探索。在此,我们证明了通过机器学习多种 CPI 不仅可以评估药物多药性,还可以有效地识别新型生物活性骨架跳跃化合物。通过使用多种 CPI 的机器学习技术,我们成功地为两个具有重要药用价值的蛋白家族(G 蛋白偶联受体和蛋白激酶)鉴定出新型先导化合物。在比较研究中,这些新型化合物没有被现有的计算配体筛选方法所识别。本研究的结果表明,化学基因组学衍生的数据对于探索化学空间非常有用,这种系统生物学的观点可能会加速药物发现过程。