Schrödinger Inc., 120 West 45th Street, New York, NY 10036, USA.
Bioorg Med Chem. 2012 Sep 15;20(18):5379-87. doi: 10.1016/j.bmc.2012.03.037. Epub 2012 Mar 24.
Compound libraries comprise an integral component of drug discovery in the pharmaceutical and biotechnology industries. While in-house libraries often contain millions of molecules, this number pales in comparison to the accessible space of drug-like molecules. Therefore, care must be taken when adding new compounds to an existing library in order to ensure that unexplored regions in the chemical space are filled efficiently while not needlessly increasing the library size. In this work, we present an automated method to fill holes in an existing library using compounds from an external source and apply it to commercially available fragment libraries. The method, called Canvas HF, uses distances computed from 2D chemical fingerprints and selects compounds that fill vacuous regions while not suffering from the problem of selecting only compounds at the edge of the chemical space. We show that the method is robust with respect to different databases and the number of requested compounds to retrieve. We also present an extension of the method where chemical properties can be considered simultaneously with the selection process to bias the compounds toward a desired property space without imposing hard property cutoffs. We compare the results of Canvas HF to those obtained with a standard sphere exclusion method and with random compound selection and find that Canvas HF performs favorably. Overall, the method presented here offers an efficient and effective hole-filling strategy to augment compound libraries with compounds from external sources. The method does not have any fit parameters and therefore it should be applicable in most hole-filling applications.
化合物库是制药和生物技术行业药物发现的一个组成部分。虽然内部库通常包含数百万种分子,但与可药用分子的可用空间相比,这个数量相形见绌。因此,在向现有库中添加新化合物时必须小心谨慎,以确保在有效填补化学空间中未探索区域的同时,不会不必要地增加库的大小。在这项工作中,我们提出了一种使用外部来源的化合物来填补现有库中空白的自动化方法,并将其应用于商业上可用的片段库。该方法称为 Canvas HF,使用从二维化学指纹计算的距离,并选择填充空洞区域的化合物,而不会出现仅选择化学空间边缘化合物的问题。我们表明,该方法对不同的数据库和请求化合物的数量具有鲁棒性,可以检索。我们还提出了该方法的扩展,其中可以同时考虑化学性质和选择过程,以便在不施加硬性性质截止值的情况下将化合物偏向所需的性质空间。我们将 Canvas HF 的结果与标准球体排除方法以及随机化合物选择的结果进行比较,发现 Canvas HF 的性能较好。总体而言,这里提出的方法提供了一种有效的化合物库扩充策略,可以用外部来源的化合物来填充化合物库。该方法没有任何拟合参数,因此应该适用于大多数填补空白的应用。