Suppr超能文献

一种黄酮类化合物对P-糖蛋白亲和力的新模型:采用改进粒子群优化算法选择特征的遗传算法-支持向量机

A new model of flavonoids affinity towards P-glycoprotein: genetic algorithm-support vector machine with features selected by a modified particle swarm optimization algorithm.

作者信息

Cui Ying, Chen Qinggang, Li Yaxiao, Tang Ling

机构信息

Department of Medicine Chemistry, Logistics College of Chinese People's Armed Police Forces, Tianjin, 300309, China.

Tianjin Key Laboratory of Occupational and Environmental Hazards Biomarkers, Tianjin, 300309, China.

出版信息

Arch Pharm Res. 2017 Feb;40(2):214-230. doi: 10.1007/s12272-016-0876-8. Epub 2016 Dec 27.

Abstract

Flavonoids exhibit a high affinity for the purified cytosolic NBD (C-terminal nucleotide-binding domain) of P-glycoprotein (P-gp). To explore the affinity of flavonoids for P-gp, quantitative structure-activity relationship (QSAR) models were developed using support vector machines (SVMs). A novel method coupling a modified particle swarm optimization algorithm with random mutation strategy and a genetic algorithm coupled with SVM was proposed to simultaneously optimize the kernel parameters of SVM and determine the subset of optimized features for the first time. Using DRAGON descriptors to represent compounds for QSAR, three subsets (training, prediction and external validation set) derived from the dataset were employed to investigate QSAR. With excluding of the outlier, the correlation coefficient (R) of the whole training set (training and prediction) was 0.924, and the R of the external validation set was 0.941. The root-mean-square error (RMSE) of the whole training set was 0.0588; the RMSE of the cross-validation of the external validation set was 0.0443. The mean Q value of leave-many-out cross-validation was 0.824. With more informations from results of randomization analysis and applicability domain, the proposed model is of good predictive ability, stability.

摘要

黄酮类化合物对纯化的P-糖蛋白(P-gp)的胞质核苷结合域(C端核苷酸结合域)具有高亲和力。为了探究黄酮类化合物对P-gp的亲和力,使用支持向量机(SVM)建立了定量构效关系(QSAR)模型。提出了一种将改进的粒子群优化算法与随机突变策略相结合以及将遗传算法与SVM相结合的新方法,首次同时优化SVM的核参数并确定优化特征子集。使用DRAGON描述符来表示用于QSAR的化合物,采用从数据集中导出的三个子集(训练集、预测集和外部验证集)来研究QSAR。排除异常值后,整个训练集(训练集和预测集)的相关系数(R)为0.924,外部验证集的R为0.941。整个训练集的均方根误差(RMSE)为0.0588;外部验证集交叉验证的RMSE为0.0443。留多法交叉验证的平均Q值为0.824。基于来自随机分析结果和适用域的更多信息,所提出的模型具有良好的预测能力和稳定性。

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验