Raush Eugene, Abagyan Ruben, Totrov Maxim
Molsoft L.L.C., 11199 Sorrento Valley Road, S209, San Diego, California 92121, United States.
Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California San Diego, La Jolla, California 92093, United States.
J Chem Theory Comput. 2024 May 14;20(9):4054-4063. doi: 10.1021/acs.jctc.4c00280. Epub 2024 Apr 26.
We present a neural-network-based high-throughput molecular conformer-generation algorithm. A chemical graph-convolutional network is trained to predict low-energy conformers in internal coordinate representation (bond lengths, bond, and torsion angles), starting from two-dimensional (2D) chemical topology. Generative neural network (NN) architecture performs denoising from torsion space, producing conformer ensembles with populations that are well correlated with torsion energy profiles. Short force-field-based energy minimization is applied to refine final conformers. All computation-intensive stages of the algorithm are GPU-optimized. The procedure (termed GINGER) is benchmarked on a commonly used test set of bioactive three-dimensional (3D) conformers from the PDB. We demonstrate highly competitive results in conformer recovery and throughput rates suitable for giga-scale compound library processing. A web server that allows interactive conformer ensemble generation by GINGER and their viewing is made freely available at https://www.molsoft.com/gingerdemo.html.
我们提出了一种基于神经网络的高通量分子构象异构体生成算法。训练一个化学图卷积网络,从二维(2D)化学拓扑结构开始,预测内部坐标表示(键长、键角和扭转角)中的低能量构象异构体。生成神经网络(NN)架构从扭转空间执行去噪,生成构象异构体集合,其种群与扭转能量分布高度相关。应用基于短力场的能量最小化来优化最终构象异构体。该算法的所有计算密集型阶段都进行了GPU优化。该程序(称为GINGER)在来自PDB的常用生物活性三维(3D)构象异构体测试集上进行了基准测试。我们在构象异构体恢复和适合千兆规模化合物库处理的通量率方面展示了极具竞争力的结果。一个允许通过GINGER交互式生成构象异构体集合并进行查看的网络服务器可在https://www.molsoft.com/gingerdemo.html上免费获得。