Singh Narender, Guha Rajarshi, Giulianotti Marc A, Pinilla Clemencia, Houghten Richard A, Medina-Franco Jose L
Torrey Pines Institute for Molecular Studies, 11350 SW Village Parkway, Port St. Lucie, Florida 34987, USA.
J Chem Inf Model. 2009 Apr;49(4):1010-24. doi: 10.1021/ci800426u.
A multiple criteria approach is presented, that is used to perform a comparative analysis of four recently developed combinatorial libraries to drugs, Molecular Libraries Small Molecule Repository (MLSMR) and natural products. The compound databases were assessed in terms of physicochemical properties, scaffolds, and fingerprints. The approach enables the analysis of property space coverage, degree of overlap between collections, scaffold and structural diversity, and overall structural novelty. The degree of overlap between combinatorial libraries and drugs was assessed using the R-NN curve methodology, which measures the density of chemical space around a query molecule embedded in the chemical space of a target collection. The combinatorial libraries studied in this work exhibit scaffolds that were not observed in the drug, MLSMR, and natural products databases. The fingerprint-based comparisons indicate that these combinatorial libraries are structurally different than current drugs. The R-NN curve methodology revealed that a proportion of molecules in the combinatorial libraries is located within the property space of the drugs. However, the R-NN analysis also showed that there are a significant number of molecules in several combinatorial libraries that are located in sparse regions of the drug space.
本文提出了一种多标准方法,用于对最近开发的四个组合文库与药物、分子文库小分子储存库(MLSMR)和天然产物进行比较分析。从物理化学性质、支架和指纹方面对化合物数据库进行了评估。该方法能够分析性质空间覆盖范围、集合之间的重叠程度、支架和结构多样性以及整体结构新颖性。使用R-NN曲线方法评估组合文库与药物之间的重叠程度,该方法测量嵌入目标集合化学空间中的查询分子周围化学空间的密度。本研究中所研究的组合文库展示出在药物、MLSMR和天然产物数据库中未观察到的支架。基于指纹的比较表明,这些组合文库在结构上与现有药物不同。R-NN曲线方法表明,组合文库中的一部分分子位于药物的性质空间内。然而,R-NN分析还表明,几个组合文库中有大量分子位于药物空间的稀疏区域。