Gombar Vijay K, Polli Joseph W, Humphreys Joan E, Wring Stephen A, Serabjit-Singh Cosette S
Department of Drug Metabolism and Pharmacokinetics, Metabolic and Viral Diseases' Center of Excellence for Drug Discovery, GlaxoSmithKline, Research Triangle Park, North Carolina 27709, USA.
J Pharm Sci. 2004 Apr;93(4):957-68. doi: 10.1002/jps.20035.
A quantitative structure-activity relationship (QSAR) model has been developed to predict whether a given compound is a P-glycoprotein (Pgp) substrate or not. The training set consisted of 95 compounds classified as substrates or non-substrates based on the results from in vitro monolayer efflux assays. The two-group linear discriminant model uses 27 statistically significant, information-rich structure quantifiers to compute the probability of a given structure to be a Pgp substrate. Analysis of the descriptors revealed that the ability to partition into membranes, molecular bulk, and the counts and electrotopological values of certain isolated and bonded hydrides are important structural attributes of substrates. The model fits the data with sensitivity of 100% and specificity of 90.6% in the jackknifed cross-validation test. A prediction accuracy of 86.2% was obtained on a test set of 58 compounds. Examination of the eight "mispredicted" compounds revealed two distinct categories. Five mispredictions were explained by experimental limitations of the efflux assay; these compounds had high permeability and/or were inhibitors of calcein-AM transport. Three mispredictions were due to limitations of the chemical space covered by the current model. The Pgp QSAR model provides an in silico screen to aid in compound selection and in vitro efflux assay prioritization.
已开发出一种定量构效关系(QSAR)模型,用于预测给定化合物是否为P-糖蛋白(Pgp)底物。训练集由95种化合物组成,这些化合物根据体外单层流出试验的结果被分类为底物或非底物。两组线性判别模型使用27个具有统计学意义且信息丰富的结构量化指标来计算给定结构成为Pgp底物的概率。对描述符的分析表明,分配到膜中的能力、分子体积以及某些孤立和键合氢化物的计数和电子拓扑值是底物的重要结构属性。在留一法交叉验证测试中,该模型对数据的拟合灵敏度为100%,特异性为90.6%。在一个由58种化合物组成的测试集上获得了86.2%的预测准确率。对8种“预测错误”的化合物进行检查后发现了两个不同的类别。5个预测错误可由流出试验的实验局限性来解释;这些化合物具有高渗透性和/或为钙黄绿素-AM转运的抑制剂。3个预测错误是由于当前模型所涵盖的化学空间的局限性。Pgp QSAR模型提供了一种计算机模拟筛选方法,以帮助化合物选择和体外流出试验的优先级排序。