Department of Bioscience and Biotechnology, Sejong University, 209 Neungdong-ro, Kwangjin-gu, Seoul 05006, Korea.
Phys Chem Chem Phys. 2019 Feb 27;21(9):5189-5199. doi: 10.1039/c8cp07002d.
We establish a comprehensive quantitative structure-activity relationship (QSAR) model termed AlphaQ through the machine learning algorithm to associate the fully quantum mechanical molecular descriptors with various biochemical and pharmacological properties. Preliminarily, a novel method for molecular structural alignments was developed in such a way to maximize the quantum mechanical cross correlations among the molecules. Besides the improvement of structural alignments, three-dimensional (3D) distribution of the molecular electrostatic potential was introduced as the unique numerical descriptor for individual molecules. These dual modifications lead to a substantial accuracy enhancement in multifarious 3D-QSAR prediction models of AlphaQ. Most remarkably, AlphaQ has been proven to be applicable to structurally diverse molecules to the extent that it outperforms the conventional QSAR methods in estimating the inhibitory activity against thrombin, the water-cyclohexane distribution coefficient, the permeability across the membrane of the Caco-2 cell, and the metabolic stability in human liver microsomes. Due to the simplicity in model building and the high predictive capability for varying biochemical and pharmacological properties, AlphaQ is anticipated to serve as a valuable screening tool at both early and late stages of drug discovery.
我们通过机器学习算法建立了一个全面的定量构效关系(QSAR)模型,称为 AlphaQ,以将全量子力学分子描述符与各种生化和药理学性质联系起来。首先,开发了一种新的分子结构对齐方法,以最大化分子之间的量子力学交叉相关。除了改进结构对齐外,还引入了分子静电势的三维(3D)分布作为单个分子的独特数值描述符。这两个改进导致 AlphaQ 的各种 3D-QSAR 预测模型的准确性得到了显著提高。最值得注意的是,AlphaQ 已被证明适用于结构多样化的分子,以至于它在估计对凝血酶的抑制活性、水-环己烷分配系数、Caco-2 细胞的膜通透性和人肝微粒体中的代谢稳定性方面优于传统的 QSAR 方法。由于模型构建简单,并且对各种生化和药理学性质具有高预测能力,因此预计 AlphaQ 将成为药物发现早期和后期的有价值的筛选工具。