Ivanova A A, Ivanov A A, Oliferenko A A, Palyulin V A, Zefirov N S
Department of Chemistry, Moscow State University, Moscow, 119992 Russia.
SAR QSAR Environ Res. 2005 Jun;16(3):231-46. doi: 10.1080/10659360500037115.
An improved strategy of quantitative structure-property relationship (QSPR) studies of diverse and inhomogeneous organic datasets has been proposed. A molecular connectivity term was successively corrected for different structural features encoded in fragmental descriptors. The so-called solvation index 1chis (a weighted Randic index) was used as a "leading" variable and standardized molecular fragments were employed as "corrective" class-specific variables. Performance of the new approach was illustrated by modelling a dataset of experimental normal boiling points of 833 organic compounds belonging to 20 structural classes. Firstly, separate QSPR models were derived for each class and for eight groups of structurally similar classes. Finally, a general model formed by combining all the classes together was derived (r2=0.957, s=12.9degreesC). The strategy outlined can find application in QSPR analyses of massive, highly diverse databases of organic compounds.
已提出一种用于对多样且不均匀的有机数据集进行定量结构-性质关系(QSPR)研究的改进策略。针对片段描述符中编码的不同结构特征,对分子连接性项进行了连续校正。所谓的溶剂化指数1chis(加权兰迪指数)用作“主导”变量,标准化的分子片段用作“校正”类特异性变量。通过对属于20个结构类别的833种有机化合物的实验正常沸点数据集进行建模,说明了新方法的性能。首先,为每个类别以及八组结构相似的类别推导了单独的QSPR模型。最后,通过将所有类别组合在一起形成了一个通用模型(r2 = 0.957,s = 12.9℃)。所概述的策略可应用于对大量、高度多样的有机化合物数据库的QSPR分析。