Kortagere Sandhya, Chekmarev Dmitriy, Welsh William J, Ekins Sean
Department of Pharmacology and Environmental Bioinformatics and Computational Toxicology Center (ebCTC), University of Medicine & Dentistry of New Jersey (UMDNJ)-Robert Wood Johnson Medical School, 675 Hoes Lane, Piscataway, New Jersey, 08854, USA.
Pharm Res. 2008 Aug;25(8):1836-45. doi: 10.1007/s11095-008-9584-5. Epub 2008 Apr 16.
The goals of the present study were to apply a generalized regression model and support vector machine (SVM) models with Shape Signatures descriptors, to the domain of blood-brain barrier (BBB) modeling.
The Shape Signatures method is a novel computational tool that was used to generate molecular descriptors utilized with the SVM classification technique with various BBB datasets. For comparison purposes we have created a generalized linear regression model with eight MOE descriptors and these same descriptors were also used to create SVM models.
The generalized regression model was tested on 100 molecules not in the model and resulted in a correlation r2 = 0.65. SVM models with MOE descriptors were superior to regression models, while Shape Signatures SVM models were comparable or better than those with MOE descriptors. The best 2D shape signature models had 10-fold cross validation prediction accuracy between 80-83% and leave-20%-out testing prediction accuracy between 80-82% as well as correctly predicting 84% of BBB+ compounds (n = 95) in an external database of drugs.
Our data indicate that Shape Signatures descriptors can be used with SVM and these models may have utility for predicting blood-brain barrier permeation in drug discovery.
本研究的目标是将具有形状特征描述符的广义回归模型和支持向量机(SVM)模型应用于血脑屏障(BBB)建模领域。
形状特征方法是一种新型计算工具,用于生成与各种BBB数据集的SVM分类技术一起使用的分子描述符。为了进行比较,我们创建了一个具有八个MOE描述符的广义线性回归模型,并且这些相同的描述符也用于创建SVM模型。
在模型中未出现的100个分子上对广义回归模型进行了测试,相关系数r2 = 0.65。具有MOE描述符的SVM模型优于回归模型,而形状特征SVM模型与具有MOE描述符的模型相当或更好。最佳的二维形状特征模型在10倍交叉验证中的预测准确率在80 - 83%之间,留20%法测试的预测准确率在80 - 82%之间,并且在一个外部药物数据库中正确预测了84%的BBB +化合物(n = 95)。
我们的数据表明形状特征描述符可与SVM一起使用,并且这些模型可能在药物发现中预测血脑屏障通透性方面具有实用性。