Guru Gobind Singh College of Pharmacy, Yamunanagar-135001, India.
Comput Biol Med. 2012 Oct;42(10):1026-41. doi: 10.1016/j.compbiomed.2012.08.006. Epub 2012 Sep 8.
Four novel distance based molecular descriptors termed as superpendentic eccentric distance sum indices 1-4 (denoted by:∫P-1EDS, ∫P-2EDS, ∫P-3EDS and ∫P-4EDS) as well as their topochemical counterparts (denoted by:∫cP-1EDS, ∫cP-2EDS, ∫cP-3EDS and ∫cP-4EDS) have been conceptualized and developed in the present study. The sensitivity towards branching, discriminating power, and degeneracy of the proposed novel descriptors were investigated. Utility of these indices was investigated for development of models through decision tree and moving average analysis for the prediction of human corticotropin releasing factor-1 receptor binding affinity of substituted pyrazines. A wide variety of 46 2D and 3D molecular descriptors including proposed indices was employed for development of models through decision tree and moving average analysis. The calculation of most of these descriptors for each compound of the dataset was performed using online E-Dragon software (version 1.0). An in-house computer programme was also employed to calculate additional topological descriptors which did not figure in E-Dragon software. The decision tree classified and correctly predicted the input data with an impressive accuracy of 92% in the training set and 71% during cross-validation. A total of three descriptors, identified by decision tree, were subsequently utilized for development of suitable models using moving average analysis. These models predicted human corticotropin releasing factor-1 receptor binding affinity with an accuracy of ≥85%. The statistical significance of models was assessed through sensitivity, specificity and Matthew's correlation coefficient. High discriminating power, high sensitivity towards branching amalgamated with negligible degeneracy offer proposed descriptors a vast potential for use in the quantitative structure-activity/property/toxicity relationships so as to facilitate drug design.
本研究提出了四个新的基于距离的分子描述符,分别称为超级偏心距离和指数 1-4(记为:∫P-1EDS、∫P-2EDS、∫P-3EDS 和∫P-4EDS)以及它们的拓扑对应物(记为:∫cP-1EDS、∫cP-2EDS、∫cP-3EDS 和∫cP-4EDS)。研究了这些新描述符的分支敏感性、区分能力和简并性。通过决策树和移动平均分析,研究了这些指数在取代吡嗪的人促皮质素释放因子-1 受体结合亲和力预测模型开发中的应用。通过决策树和移动平均分析,利用了广泛的 46 个二维和三维分子描述符,包括提出的描述符,开发模型。数据集的每个化合物的大多数这些描述符的计算是使用在线 E-Dragon 软件(版本 1.0)完成的。还使用内部计算机程序计算了 E-Dragon 软件中未包含的其他拓扑描述符。决策树对输入数据进行分类和正确预测,在训练集中的准确率为 92%,在交叉验证中的准确率为 71%,令人印象深刻。决策树确定了三个描述符,随后使用移动平均分析开发合适的模型。这些模型预测人促皮质素释放因子-1 受体结合亲和力的准确率≥85%。通过敏感性、特异性和马修相关系数评估模型的统计学意义。高区分能力、对分支的高敏感性与可忽略的简并性相结合,为提出的描述符提供了在定量构效关系/性质/毒性关系中广泛应用的潜力,以促进药物设计。