Consonni Viviana, Todeschini Roberto, Pavan Manuela
Department of Environmental Sciences, Milano - Bicocca University, P.za della Scienza 1, 20126 Milano, Italy.
J Chem Inf Comput Sci. 2002 May-Jun;42(3):682-92. doi: 10.1021/ci015504a.
Novel molecular descriptors based on a leverage matrix similar to that defined in statistics and usually used for regression diagnostics are presented. This leverage matrix, called Molecular Influence Matrix (MIM), is here proposed as a new molecular representation easily calculated from the spatial coordinates of the molecule atoms in a chosen conformation. The proposed molecular descriptors are called GETAWAY (GEometry, Topology, and Atom-Weights AssemblY) as they try to match 3D-molecular geometry provided by the molecular influence matrix and atom relatedness by molecular topology, with chemical information by using different atomic weightings (atomic mass, polarizability, van der Waals volume, and electronegativity, together with unit weights). A first set of molecular descriptors, called H-GETAWAY, is derived by using only the information provided by the molecular influence matrix, while a second set, called R-GETAWAY, combines this information with geometric interatomic distances in the molecule. The prediction ability in structure-property correlations of the new descriptors was tested by analyzing regressions of these descriptors for selected properties of octanes.
本文提出了基于与统计学中定义的杠杆矩阵类似且通常用于回归诊断的新型分子描述符。这种杠杆矩阵称为分子影响矩阵(MIM),在此被提议作为一种新的分子表示形式,它可以根据所选构象中分子原子的空间坐标轻松计算得出。所提出的分子描述符被称为GETAWAY(几何、拓扑和原子权重组合),因为它们试图通过分子影响矩阵提供的三维分子几何结构、分子拓扑结构的原子相关性以及使用不同的原子权重(原子质量、极化率、范德华体积和电负性以及单位权重)来匹配化学信息。第一组分子描述符称为H-GETAWAY,仅通过分子影响矩阵提供的信息得出,而第二组称为R-GETAWAY,则将此信息与分子中的几何原子间距离相结合。通过分析这些描述符对辛烷选定性质的回归,测试了新描述符在结构-性质相关性方面的预测能力。