Alvarez-Ginarte Yoanna María, Crespo-Otero Rachel, Marrero-Ponce Yovani, Noheda-Marin Pedro, Garcia de la Vega Jose Manuel, Montero-Cabrera Luis Alberto, Ruiz García José Alberto, Caldera-Luzardo José A, Alvarado Ysaias J
Pharmaceutical Chemistry Center, 16042 La Habana, Cuba.
Bioorg Med Chem. 2008 Jun 15;16(12):6448-59. doi: 10.1016/j.bmc.2008.04.001. Epub 2008 Apr 7.
Predictive quantitative structure-activity relationship (QSAR) models of anabolic and androgenic activities for the testosterone and dihydrotestosterone steroid analogues were obtained by means of multiple linear regression using quantum and physicochemical molecular descriptors (MD) as well as a genetic algorithm for the selection of the best subset of variables. Quantitative models found for describing the anabolic (androgenic) activity are significant from a statistical point of view: R(2) of 0.84 (0.72 and 0.70). A leave-one-out cross-validation procedure revealed that the regression models had a fairly good predictability [q(2) of 0.80 (0.60 and 0.59)]. In addition, other QSAR models were developed to predict anabolic/androgenic (A/A) ratios and the best regression equation explains 68% of the variance for the experimental values of AA ratio and has a rather adequate q(2) of 0.51. External validation, by using test sets, was also used in each experiment in order to evaluate the predictive power of the obtained models. The result shows that these QSARs have quite good predictive abilities (R(2) of 0.90, 0.72 (0.55), and 0.53) for anabolic activity, androgenic activity, and A/A ratios, respectively. Last, a Williams plot was used in order to define the domain of applicability of the models as a squared area within +/-2 band for residuals and a leverage threshold of h=0.16. No apparent outliers were detected and the models can be used with high accuracy in this applicability domain. MDs included in our QSAR models allow the structural interpretation of the biological process, evidencing the main role of the shape of molecules, hydrophobicity, and electronic properties. Attempts were made to include lipophilicity (octanol-water partition coefficient (logP)) and electronic (hardness (eta)) values of the whole molecules in the multivariate relations. It was found from the study that the logP of molecules has positive contribution to the anabolic and androgenic activities and high values of eta produce unfavorable effects. The found MDs can also be efficiently used in similarity studies based on cluster analysis. Our model for the anabolic/androgenic ratio (expressed by weight of levator ani muscle, LA, and seminal vesicle, SV, in mice) predicts that the 2-aminomethylene-17alpha-methyl-17beta-hydroxy-5alpha-androstan-3-one (43) compound is the most potent anabolic steroid, and the 17alpha-methyl-2beta,17beta-dihydroxy-5alpha-androstane (31) compound is the least potent one of this series. The approach described in this report is an alternative for the discovery and optimization of leading anabolic compounds among steroids and analogues. It also gives an important role to electron exchange terms of molecular interactions to this kind of steroid activity.
通过使用量子和物理化学分子描述符(MD)以及用于选择最佳变量子集的遗传算法,借助多元线性回归获得了睾酮和双氢睾酮类固醇类似物的合成代谢和雄激素活性的预测性定量构效关系(QSAR)模型。从统计学角度来看,用于描述合成代谢(雄激素)活性的定量模型具有显著性:R(2)为0.84(0.72和0.70)。留一法交叉验证程序表明回归模型具有相当好的预测能力[q(2)为0.80(0.60和0.59)]。此外,还开发了其他QSAR模型来预测合成代谢/雄激素(A/A)比值,最佳回归方程解释了AA比值实验值68%的方差,并且具有相当合适的q(2)值0.51。在每个实验中还使用测试集进行外部验证,以评估所获得模型的预测能力。结果表明,这些QSAR分别对合成代谢活性、雄激素活性和A/A比值具有相当好的预测能力(R(2)分别为0.90、0.72(0.55)和0.53)。最后,使用威廉姆斯图将模型的适用域定义为残差在+/-2范围内的平方区域以及杠杆阈值h = 0.16。未检测到明显的异常值,并且这些模型可以在该适用域内高精度地使用。我们的QSAR模型中包含的MD允许对生物过程进行结构解释,证明了分子形状、疏水性和电子性质的主要作用。尝试将整个分子的亲脂性(辛醇 - 水分配系数(logP))和电子性质(硬度(eta))值纳入多元关系中。研究发现,分子的logP对合成代谢和雄激素活性有正向贡献,而高eta值会产生不利影响。所发现的MD也可以有效地用于基于聚类分析的相似性研究中。我们的合成代谢/雄激素比值模型(以小鼠提肛肌(LA)和精囊(SV)的重量表示)预测,2 - 氨亚甲基 - 17α - 甲基 - 17β - 羟基 - 5α - 雄甾烷 - 3 - 酮(43)化合物是最有效的合成代谢类固醇,而该系列中17α - 甲基 - 2β,17β - 二羟基 - 5α - 雄甾烷(31)化合物是效力最低的。本报告中描述的方法是在类固醇及其类似物中发现和优化领先合成代谢化合物的一种替代方法。它还表明分子相互作用中的电子交换项对这类类固醇活性具有重要作用。