Shanghai Institute of Materia Medica, Chinese Academy of Sciences, Shanghai, China.
Acta Pharmacol Sin. 2011 Nov;32(11):1424-30. doi: 10.1038/aps.2011.112. Epub 2011 Oct 3.
To construct a reliable computational model for the classification of agonists and antagonists of 5-HT(1A) receptor.
Support vector machine (SVM), a well-known machine learning method, was employed to build a prediction model, and genetic algorithm (GA) was used to select the most relevant descriptors and to optimize two important parameters, C and r of the SVM model. The overall dataset used in this study comprised 284 ligands of the 5-HT(1A) receptor with diverse structures reported in the literatures.
A SVM model was successfully developed that could be used to predict the probability of a ligand being an agonist or antagonist of the 5-HT(1A) receptor. The predictive accuracy for training and test sets was 0.942 and 0.865, respectively. For compounds with probability estimate higher than 0.7, the predictive accuracy of the model for training and test sets was 0.954 and 0.927, respectively. To further validate our model, the receiver operating characteristic (ROC) curve was plotted, and the Area-Under-the-ROC- Curve (AUC) value was calculated to be 0.883 for training set and 0.906 for test set.
A reliable SVM model was successfully developed that could effectively distinguish agonists and antagonists among the ligands of the 5-HT(1A) receptor. To our knowledge, this is the first effort for the classification of 5-HT(1A) receptor agonists and antagonists based on a diverse dataset. This method may be used to classify the ligands of other members of the GPCR family.
构建一个可靠的计算模型,用于对 5-HT(1A) 受体激动剂和拮抗剂进行分类。
支持向量机(SVM)是一种著名的机器学习方法,用于构建预测模型,遗传算法(GA)用于选择最相关的描述符,并优化 SVM 模型的两个重要参数 C 和 r。本研究使用的总体数据集包括 284 种文献报道的具有不同结构的 5-HT(1A) 受体配体。
成功开发了一种 SVM 模型,可用于预测配体成为 5-HT(1A) 受体激动剂或拮抗剂的概率。训练集和测试集的预测准确率分别为 0.942 和 0.865。对于概率估计值高于 0.7 的化合物,模型对训练集和测试集的预测准确率分别为 0.954 和 0.927。为了进一步验证我们的模型,绘制了接收器工作特征(ROC)曲线,并计算出训练集的 AUC 值为 0.883,测试集的 AUC 值为 0.906。
成功开发了一种可靠的 SVM 模型,可有效区分 5-HT(1A) 受体配体中的激动剂和拮抗剂。据我们所知,这是首次基于多样化数据集对 5-HT(1A) 受体激动剂和拮抗剂进行分类的尝试。该方法可用于对 GPCR 家族其他成员的配体进行分类。