Roy Kunal, Ghosh Gopinath
Drug Theoretics and Cheminformatics Laboratory, Division of Medicinal and Pharmaceutical Chemistry, Department of Pharmaceutical Technology, Jadavpur University, Kolkata 700032, India.
Chem Biol Drug Des. 2008 Nov;72(5):383-94. doi: 10.1111/j.1747-0285.2008.00712.x.
In this communication, we have developed quantitative predictive models using human lethal concentration values of 26 organic compounds including some pharmaceuticals with extended topochemical atom (ETA) indices applying different chemometric tools and compared the extended topochemical atom models with the models developed from non-extended topochemical atom ones. Extended topochemical atom descriptors were also tried in combination with non-extended topochemical atom descriptors to develop better predictive models. The use of extended topochemical atom descriptors along with non-extended topochemical atom ones improved equation statistics and cross-validation quality. The best model with sound statistical quality was developed from partial least squares regression using extended topochemical atom descriptors in combination non-extended topochemical atom ones. Finally, to check true predictability of the ETA parameters, the data set was divided into training (n = 19) and test (n = 7) sets. Partial least squares and genetic partial least squares models were developed from the training set using extended topochemical atom indices and the models were validated using the test set. The extended topochemical atom models developed from different statistical tools suggest that the toxicity increases with bulk, chloro functionality, presence of electronegative atoms within a chain or ring and unsaturation, and decreases with hydroxy functionality and branching. The results suggest that the extended topochemical atom descriptors are sufficiently rich in chemical information to encode the structural features for QSAR/QSPR/QSTR modeling.
在本通讯中,我们利用26种有机化合物(包括一些药物)的人类致死浓度值,运用扩展拓扑化学原子(ETA)指数,使用不同的化学计量工具开发了定量预测模型,并将扩展拓扑化学原子模型与由非扩展拓扑化学原子模型开发的模型进行了比较。还尝试将扩展拓扑化学原子描述符与非扩展拓扑化学原子描述符结合起来,以开发更好的预测模型。使用扩展拓扑化学原子描述符与非扩展拓扑化学原子描述符相结合,提高了方程统计量和交叉验证质量。具有良好统计质量的最佳模型是通过使用扩展拓扑化学原子描述符与非扩展拓扑化学原子描述符相结合的偏最小二乘回归开发的。最后,为了检验ETA参数的真实预测能力,将数据集分为训练集(n = 19)和测试集(n = 7)。使用扩展拓扑化学原子指数从训练集开发偏最小二乘和遗传偏最小二乘模型,并使用测试集对模型进行验证。从不同统计工具开发的扩展拓扑化学原子模型表明,毒性随着体积、氯官能团、链或环内电负性原子的存在以及不饱和度的增加而增加,随着羟基官能团和支化的增加而降低。结果表明,扩展拓扑化学原子描述符在化学信息方面足够丰富,能够为QSAR/QSPR/QSTR建模编码结构特征。