Li Qingliang, Bender Andreas, Pei Jianfeng, Lai Luhua
Beijing National Laboratory for Molecular Sciences, State Key Laboratory of Structural Chemistry for Stable and Unstable Species, College of Chemistry and Molecular Engineering, and Center for Theoretical Biology, Peking University, 100871 Beijing, China.
J Chem Inf Model. 2007 Sep-Oct;47(5):1776-86. doi: 10.1021/ci700107y. Epub 2007 Aug 25.
Probabilistic support vector machine (SVM) in combination with ECFP_4 (Extended Connectivity Fingerprints) were applied to establish a druglikeness filter for molecules. Here, the World Drug Index (WDI) and the Available Chemical Directory (ACD) were used as surrogates for druglike and nondruglike molecules, respectively. Compared with published methods using the same data sets, the classifier significantly improved the prediction accuracy, especially when using a larger data set of 341 601 compounds, which further pushed the correct classification rates up to 92.73%. On the other hand, most characteristic features for drugs and nondrugs found by the current method were visualized, which might be useful as guiding fragments for de novo drug design and fragment based drug design.
将概率支持向量机(SVM)与ECFP_4(扩展连接指纹)相结合,用于建立分子的类药筛选器。在此,世界药物索引(WDI)和可用化学目录(ACD)分别用作类药分子和非类药分子的替代物。与使用相同数据集的已发表方法相比,该分类器显著提高了预测准确性,尤其是在使用包含341601种化合物的更大数据集时,这进一步将正确分类率提高到了92.73%。另一方面,当前方法发现的大多数药物和非药物的特征都得到了可视化,这可能作为从头药物设计和基于片段的药物设计的指导片段。