Kumar Rajnish, Sharma Anju, Varadwaj Pritish Kumar
Department of Bioinformatics, Indian Institute of Information Technology Allahabad, Deoghat, Jhalwa, Allahabad, India .
J Nat Sci Biol Med. 2011 Jul;2(2):168-73. doi: 10.4103/0976-9668.92325.
A computational model for predicting oral bioavailability is very important both in the early stage of drug discovery to select the promising compounds for further optimizations and in later stage to identify candidates for clinical trials. In present study, we propose a support vector machine (SVM)-based kernel learning approach carried out at a set of 511 chemically diverse compounds with known oral bioavailability values.
For each drug, 12 descriptors were calculated. The selection of optimal hyper-plane parameters was performed with 384 training set data and the prediction efficiency of proposed classifier was tested on 127 test set data.
The overall prediction efficiency for the test set came out to be 96.85%. Youden's index and Matthew correlation index were found to be 0.929 and 0.909, respectively. The area under receiver operating curve (ROC) was found to be 0.943 with standard error 0.0253.
The prediction model suggests that while considering chemoinformatics approaches into account, SVM-based prediction of oral bioavailability can be a significantly important tool for drug development and discovery at a preliminary level.
用于预测口服生物利用度的计算模型在药物发现的早期阶段对于选择有前景的化合物进行进一步优化以及在后期阶段识别临床试验候选药物都非常重要。在本研究中,我们提出了一种基于支持向量机(SVM)的核学习方法,该方法在一组511种具有已知口服生物利用度值的化学结构多样的化合物上进行。
对于每种药物,计算12个描述符。使用384个训练集数据进行最优超平面参数的选择,并在127个测试集数据上测试所提出分类器的预测效率。
测试集的总体预测效率为96.85%。尤登指数和马修相关指数分别为0.929和0.909。发现受试者工作特征曲线(ROC)下的面积为0.943,标准误差为0.0253。
该预测模型表明,在考虑化学信息学方法时,基于支持向量机的口服生物利用度预测在初步水平上可以成为药物开发和发现的一个非常重要的工具。