Arús-Pous Josep, Awale Mahendra, Probst Daniel, Reymond Jean-Louis
Department of Chemistry and Biochemistry, National Center for Competence in Research NCCR TransCure, University of Bern, Freiestrasse 3, CH-3012 Bern.
Department of Chemistry and Biochemistry, National Center for Competence in Research NCCR TransCure, University of Bern, Freiestrasse 3, CH-3012 Bern;, Email:
Chimia (Aarau). 2019 Dec 18;73(12):1018-1023. doi: 10.2533/chimia.2019.1018.
Chemical space is a concept to organize molecular diversity by postulating that different molecules occupy different regions of a mathematical space where the position of each molecule is defined by its properties. Our aim is to develop methods to explicitly explore chemical space in the area of drug discovery. Here we review our implementations of machine learning in this project, including our use of deep neural networks to enumerate the GDB13 database from a small sample set, to generate analogs of drugs and natural products after training with fragment-size molecules, and to predict the polypharmacology of molecules after training with known bioactive compounds from ChEMBL. We also discuss visualization methods for big data as means to keep track and learn from machine learning results. Computational tools discussed in this review are freely available at and .
化学空间是一个通过假定不同分子占据数学空间的不同区域来组织分子多样性的概念,其中每个分子的位置由其性质定义。我们的目标是开发在药物发现领域中明确探索化学空间的方法。在此,我们回顾我们在该项目中机器学习的实现,包括我们使用深度神经网络从小样本集枚举GDB13数据库、在用片段大小的分子训练后生成药物和天然产物的类似物,以及在用来自ChEMBL的已知生物活性化合物训练后预测分子的多药理学。我们还讨论了大数据的可视化方法,作为跟踪和从机器学习结果中学习的手段。本综述中讨论的计算工具可在 和 免费获取。