Marrero-Ponce Yovani
Department of Pharmacy, Faculty of Chemical-Pharmacy, and Department of Drug Design, Chemical Bioactive Center, Central University of Las Villas, Santa Clara, 54830, Villa Clara, Cuba.
J Chem Inf Comput Sci. 2004 Nov-Dec;44(6):2010-26. doi: 10.1021/ci049950k.
This report describes a new set of molecular descriptors of relevance to QSAR/QSPR studies and drug design, atom linear indices fk(xi). These atomic level chemical descriptors are based on the calculation of linear maps on Rn[fk(xi): Rn--> Rn] in canonical basis. In this context, the kth power of the molecular pseudograph's atom adjacency matrix [Mk(G)] denotes the matrix of fk(xi) with respect to the canonical basis. In addition, a local-fragment (atom-type) formalism was developed. The kth atom-type linear indices are calculated by summing the kth atom linear indices of all atoms of the same atom type in the molecules. Moreover, total (whole-molecule) linear indices are also proposed. This descriptor is a linear functional (linear form) on Rn. That is, the kth total linear indices is a linear map from Rn to the scalar R[ fk(x): Rn --> R]. Thus, the kth total linear indices are calculated by summing the atom linear indices of all atoms in the molecule. The features of the kth total and local linear indices are illustrated by examples of various types of molecular structures, including chain-lengthening, branching, heteroatoms-content, and multiple bonds. Additionally, the linear independence of the local linear indices to other 0D, 1D, 2D, and 3D molecular descriptors is demonstrated by using principal component analysis for 42 very heterogeneous molecules. Much redundancy and overlapping was found among total linear indices and most of the other structural indices presently in use in the QSPR/QSAR practice. On the contrary, the information carried by atom-type linear indices was strikingly different from that codified in most of the 229 0D-3D molecular descriptors used in this study. It is concluded that the local linear indices are an independent indices containing important structural information to be used in QSPR/QSAR and drug design studies. In this sense, atom, atom-type, and total linear indices were used for the prediction of pIC50 values for the cleavage process of a set of flavone derivatives inhibitors of HIV-1 integrase. Quantitative models found are significant from a statistical point of view (R of 0.965, 0.902, and 0.927, respectively) and permit a clear interpretation of the studied properties in terms of the structural features of molecules. A LOO cross-validation procedure revealed that the regression models had a fairly good predictability (q2 of 0.679, 0.543, and 0.721, respectively). The comparison with other approaches reveals good behavior of the method proposed. The approach described in this paper appears to be an excellent alternative or guides for discovery and optimization of new lead compounds.
本报告描述了一组与定量构效关系/定量构性关系(QSAR/QSPR)研究及药物设计相关的新分子描述符——原子线性指数fk(xi)。这些原子水平的化学描述符基于在标准基下Rn上线性映射的计算[fk(xi): Rn→Rn]。在此背景下,分子伪图的原子邻接矩阵的k次幂[Mk(G)]表示fk(xi)相对于标准基的矩阵。此外,还开发了一种局部片段(原子类型)形式体系。第k个原子类型线性指数通过对分子中相同原子类型的所有原子的第k个原子线性指数求和来计算。此外,还提出了总的(全分子)线性指数。该描述符是Rn上的线性泛函(线性形式)。也就是说,第k个总线性指数是从Rn到标量R的线性映射[fk(x): Rn→R]。因此,第k个总线性指数通过对分子中所有原子的原子线性指数求和来计算。通过各种类型分子结构的例子,包括链增长、分支、杂原子含量和多重键,说明了第k个总线性指数和局部线性指数的特征。此外,通过对42个非常不同的分子进行主成分分析,证明了局部线性指数与其他0维、1维、2维及3维分子描述符的线性独立性。在QSAR/QSPR实践中,发现总线性指数与目前使用的大多数其他结构指数之间存在大量冗余和重叠。相反,原子类型线性指数所携带的信息与本研究中使用的229个0维至3维分子描述符中的大多数所编码的信息明显不同。得出的结论是,局部线性指数是独立的指数,包含重要的结构信息,可用于QSAR/QSPR和药物设计研究。从这个意义上说,原子、原子类型和总线性指数被用于预测一组HIV-1整合酶黄酮衍生物抑制剂裂解过程的pIC50值。从统计学角度来看,所发现的定量模型具有显著性(R分别为0.965、0.902和0.927),并且能够根据分子的结构特征对所研究的性质进行清晰的解释。留一法交叉验证程序表明回归模型具有相当好的预测能力(q2分别为0.679、0.543和0.721)。与其他方法的比较表明所提出的方法表现良好。本文所述方法似乎是发现和优化新先导化合物的极佳选择或指导。