Xue C X, Zhang X Y, Liu M C, Hu Z D, Fan B T
Department of Chemistry, Lanzhou University, Lanzhou, Gansu 73000, PR China.
J Pharm Biomed Anal. 2005 Jul 1;38(3):497-507. doi: 10.1016/j.jpba.2005.01.035. Epub 2005 Mar 17.
Probabilistic neural networks (PNNs) were utilized for the classifications of 102 active compounds from diverse medicinal plants with anticancer activity against human rhinopharyngocele cell line KB. Molecular descriptors calculated from structure alone were used to represent molecular structures. A subset of the calculated descriptors selected using factor correlation analysis and forward stepwise regression was used to construct the prediction models. Linear discriminant analysis (LDA) was also utilized to construct the classification model to compare the results with those obtained by PNNs. The accuracy of the training set, the cross-validation set, and the test set given by PNNs and LDA were 100, 92.3, 90.9% and 71.8, 92.3, 54.5%, respectively, which indicated that the results obtained by PNNs agree well with the experimental values of these compounds and also revealed the superiority of PNNs over LDA approach for the classification of anticancer activities of compounds. The models built in this work would be of potential help in the design of novel and more potent anticancer agents.
概率神经网络(PNNs)被用于对来自多种具有抗人鼻咽癌KB细胞系抗癌活性的药用植物的102种活性化合物进行分类。仅从结构计算得到的分子描述符用于表示分子结构。使用因子相关分析和向前逐步回归选择的计算描述符子集用于构建预测模型。线性判别分析(LDA)也被用于构建分类模型,以便将结果与PNNs得到的结果进行比较。PNNs和LDA给出的训练集、交叉验证集和测试集的准确率分别为100%、92.3%、90.9%和71.8%、92.3%、54.5%,这表明PNNs得到的结果与这些化合物的实验值吻合良好,也揭示了PNNs在化合物抗癌活性分类方面优于LDA方法。本研究建立的模型将对新型、更有效的抗癌药物设计具有潜在帮助。