Ghaemian Paria, Shayanfar Ali
Biotechnology Research Center and Faculty of Pharmacy, Tabriz University of Medical Sciences, Tabriz, Iran.
Student Research Committee, Tabriz University of Medical Sciences, Tabriz, Iran.
Curr Comput Aided Drug Des. 2019;15(3):212-224. doi: 10.2174/1573409914666181003152042.
Permeability glycoprotein (P-gp) is one of the cell membrane proteins that can push some drugs out of the cell causing drug tolerance and its inhibition can prevent drug resistance.
In this study, we used image-based Quantitative Structure-Activity Relationship (QSAR) models to predict the P-gp inhibitory activity of epigallocatechin and gallocatechin derivatives.
The 2D-chemical structures and their P-gp inhibitory activity were taken from literature. The pixels of images and their Principal Components (PCs) were calculated using MATLAB software. Principle Component Regression (PCR), Artificial Neural Network (ANN) and Support Vector Machine (SVM) approaches were used to develop QSAR models. Statistical parameters included the leave one out cross-validated correlation coefficient (q2) for internal validation of the models and R2 of test set, Root Mean Square Error (RMSE) and Concordance Correlation Coefficient (CCC) were applied for external validation.
Six PCs from image analysis method were selected by stepwise regression for developing linear and non-linear models. Non-linear models i.e. ANN (with the R2 of 0.80 for test set) were chosen as the best for the established QSAR models.
According to the result of the external validation, ANN model based on image analysis method can predict the P-gp inhibitory activity of epigallocatechin and gallocatechin derivatives better than the PCR and SVM models.
通透性糖蛋白(P-糖蛋白)是一种细胞膜蛋白,可将某些药物排出细胞外,导致药物耐受性,对其进行抑制可预防耐药性。
在本研究中,我们使用基于图像的定量构效关系(QSAR)模型来预测表没食子儿茶素和没食子儿茶素衍生物的P-糖蛋白抑制活性。
二维化学结构及其P-糖蛋白抑制活性取自文献。使用MATLAB软件计算图像像素及其主成分(PC)。采用主成分回归(PCR)、人工神经网络(ANN)和支持向量机(SVM)方法建立QSAR模型。统计参数包括用于模型内部验证的留一法交叉验证相关系数(q2)和测试集的R2,均方根误差(RMSE)和一致性相关系数(CCC)用于外部验证。
通过逐步回归从图像分析方法中选择六个主成分来建立线性和非线性模型。非线性模型即人工神经网络(测试集的R2为0.80)被选为已建立的QSAR模型中的最佳模型。
根据外部验证结果,基于图像分析方法的人工神经网络模型在预测表没食子儿茶素和没食子儿茶素衍生物的P-糖蛋白抑制活性方面优于PCR和SVM模型。