Cui Ying, Chen Qinggang, Li Yaxiao, Tang Ling
Department of Medicine Chemistry, Logistics College of Chinese People's Armed Police Forces, Tianjin, 300309, China.
Tianjin Key Laboratory of Occupational and Environmental Hazards Biomarkers, Tianjin, 300309, China.
Arch Pharm Res. 2017 Feb;40(2):214-230. doi: 10.1007/s12272-016-0876-8. Epub 2016 Dec 27.
Flavonoids exhibit a high affinity for the purified cytosolic NBD (C-terminal nucleotide-binding domain) of P-glycoprotein (P-gp). To explore the affinity of flavonoids for P-gp, quantitative structure-activity relationship (QSAR) models were developed using support vector machines (SVMs). A novel method coupling a modified particle swarm optimization algorithm with random mutation strategy and a genetic algorithm coupled with SVM was proposed to simultaneously optimize the kernel parameters of SVM and determine the subset of optimized features for the first time. Using DRAGON descriptors to represent compounds for QSAR, three subsets (training, prediction and external validation set) derived from the dataset were employed to investigate QSAR. With excluding of the outlier, the correlation coefficient (R) of the whole training set (training and prediction) was 0.924, and the R of the external validation set was 0.941. The root-mean-square error (RMSE) of the whole training set was 0.0588; the RMSE of the cross-validation of the external validation set was 0.0443. The mean Q value of leave-many-out cross-validation was 0.824. With more informations from results of randomization analysis and applicability domain, the proposed model is of good predictive ability, stability.
黄酮类化合物对纯化的P-糖蛋白(P-gp)的胞质核苷结合域(C端核苷酸结合域)具有高亲和力。为了探究黄酮类化合物对P-gp的亲和力,使用支持向量机(SVM)建立了定量构效关系(QSAR)模型。提出了一种将改进的粒子群优化算法与随机突变策略相结合以及将遗传算法与SVM相结合的新方法,首次同时优化SVM的核参数并确定优化特征子集。使用DRAGON描述符来表示用于QSAR的化合物,采用从数据集中导出的三个子集(训练集、预测集和外部验证集)来研究QSAR。排除异常值后,整个训练集(训练集和预测集)的相关系数(R)为0.924,外部验证集的R为0.941。整个训练集的均方根误差(RMSE)为0.0588;外部验证集交叉验证的RMSE为0.0443。留多法交叉验证的平均Q值为0.824。基于来自随机分析结果和适用域的更多信息,所提出的模型具有良好的预测能力和稳定性。