Netzeva Tatiana I, Aptula Aynur O, Benfenati Emilio, Cronin Mark T D, Gini Giuseppina, Lessigiarska Iglika, Maran Uko, Vracko Marjan, Schüürmann Gerrit
School of Pharmacy and Chemistry, Liverpool John Moores University, Byrom Street, Liverpool L3 3AF, England.
J Chem Inf Model. 2005 Jan-Feb;45(1):106-14. doi: 10.1021/ci049747p.
The quality of quantitative structure-activity relationship (QSAR) models depends on the quality of their constitutive elements including the biological activity, statistical procedure applied, and the physicochemical and structural descriptors. The aim of this study was to assess the comparative use of ab initio and semiempirical quantum chemical calculations for the development of toxicological QSARs applied to a large and chemically diverse data set. A heterogeneous collection of 568 organic compounds with 96 h acute toxicity measured to the fish fathead minnow (Pimephales promelas) was utilized. A total of 162 descriptors were calculated using the semiempirical AM1 Hamiltonian, and 121 descriptors were compiled using an ab initio (B3LYP/6-31G**) method. The QSARs were derived using multiple linear regression (MLR) and partial least squares (PLS) analyses. Statistically similar models were obtained using AM1 and B3LYP calculated descriptors supported by the use of the logarithm of the octanol-water partition coefficient (log K(ow)). The main difference between the models derived by both MLR and PLS with the two sets of quantum chemical descriptors was concentrated on the type of descriptors selected. It was concluded that for large-scale predictions, irrespective of the mechanism of toxic action, the use of precise but time-consuming ab initio methods does not offer considerable advantage compared to the semiempirical calculations and could be avoided.
定量构效关系(QSAR)模型的质量取决于其构成要素的质量,包括生物活性、所应用的统计程序以及物理化学和结构描述符。本研究的目的是评估从头算和半经验量子化学计算在开发应用于大型且化学性质多样数据集的毒理学QSAR中的比较用途。使用了一组由568种有机化合物组成的异质集合,这些化合物对黑头呆鱼(Pimephales promelas)的96小时急性毒性已测定。使用半经验AM1哈密顿量计算了总共162个描述符,并使用从头算(B3LYP/6-31G**)方法汇编了121个描述符。通过多元线性回归(MLR)和偏最小二乘法(PLS)分析得出QSAR。使用AM1和B3LYP计算的描述符,并借助辛醇-水分配系数的对数(log K(ow)),获得了统计上相似的模型。由MLR和PLS使用两组量子化学描述符得出的模型之间的主要差异集中在所选描述符的类型上。得出的结论是,对于大规模预测,无论毒性作用机制如何,与半经验计算相比,使用精确但耗时的从头算方法并没有明显优势,因此可以避免使用。