Ahmadi Mehdi, Shahlaei Mohsen
Student Research Committee, Kermanshah University of Medical Sciences, Kermanshah, I.R. Iran.
Meical Biology Research Center, Kermanshah University of Medical Sciences, Kermanshah, I.R. Iran.
Res Pharm Sci. 2015 Jul-Aug;10(4):307-25.
P2X7 antagonist activity for a set of 49 molecules of the P2X7 receptor antagonists, derivatives of purine, was modeled with the aid of chemometric and artificial intelligence techniques. The activity of these compounds was estimated by means of combination of principal component analysis (PCA), as a well-known data reduction method, genetic algorithm (GA), as a variable selection technique, and artificial neural network (ANN), as a non-linear modeling method. First, a linear regression, combined with PCA, (principal component regression) was operated to model the structure-activity relationships, and afterwards a combination of PCA and ANN algorithm was employed to accurately predict the biological activity of the P2X7 antagonist. PCA preserves as much of the information as possible contained in the original data set. Seven most important PC's to the studied activity were selected as the inputs of ANN box by an efficient variable selection method, GA. The best computational neural network model was a fully-connected, feed-forward model with 7-7-1 architecture. The developed ANN model was fully evaluated by different validation techniques, including internal and external validation, and chemical applicability domain. All validations showed that the constructed quantitative structure-activity relationship model suggested is robust and satisfactory.
借助化学计量学和人工智能技术,对一组49种嘌呤衍生物的P2X7受体拮抗剂分子的P2X7拮抗剂活性进行了建模。这些化合物的活性通过主成分分析(PCA,一种著名的数据降维方法)、遗传算法(GA,一种变量选择技术)和人工神经网络(ANN,一种非线性建模方法)相结合的方式进行估计。首先,将线性回归与PCA相结合(主成分回归)来建立构效关系模型,随后采用PCA和ANN算法的组合来准确预测P2X7拮抗剂的生物活性。PCA尽可能多地保留原始数据集中包含的信息。通过一种有效的变量选择方法GA,选择了对所研究活性最重要的7个主成分作为ANN模型的输入。最佳的计算神经网络模型是一个具有7-7-1架构的全连接前馈模型。所开发的ANN模型通过不同的验证技术进行了全面评估,包括内部和外部验证以及化学适用域。所有验证均表明所提出的构建定量构效关系模型是稳健且令人满意的。