Yang Zhijiang, Wang Liangliang, Huang Tengxin, Wang Yunfan, Gao Mingchi, Hou Tingjun, Ding Junjie, Xiao Junhua
State Key Laboratory of Chemistry for NBC Hazards Protection, Beijing, 102205, P. R. China.
Innovation Institute for Artificial Intelligence in Medicine of Zhejiang University, College of Pharmaceutical Sciences, Zhejiang University, Hangzhou, 310058, P. R. China.
Adv Sci (Weinh). 2025 Sep;12(33):e04867. doi: 10.1002/advs.202504867. Epub 2025 Jun 20.
Recently, various self-supervised learning (SSL) methods based on 3D graph neural networks (GNNs) have been developed to comprehensively represent the structural information of molecules in 3D space; this is essential for discovering new drugs. However, existing methods fail to comprehensively characterize the 3D structures of molecules and neglect the electronic structural information that significantly influences key properties such as molecular reactivity, strong electrostatic interactions, and chemical adsorption. Therefore, here, a novel molecular representation learning method is constructed, Q-GEM, incorporating quantum and geometric structural information enhancement, based on the quantum chemical property database QuanDB and SSL methods. Q-GEM comprises a GNN embedded with the molecular electronic and complete 3D geometrical structural information as well as several well-designed multiscale SSL tasks, achieving superior absolute molecular conformation prediction and conformational discrimination. The Q-GEM achieved state-of-the-art performance in 12 out of 13 prediction tasks on the MoleculeNet dataset, with an average performance improvement of 3.3% and 2.0% for classification and regression prediction tasks, respectively. Moreover, an average performance improvement of 5.2% is achieved in three localized quantum chemical properties, fully demonstrating the excellent performance of Q-GEM in distinguishing molecular electronic structures. The Q-GEM represents a novel, powerful breakthrough for accurate molecular property prediction.
最近,基于三维图神经网络(GNN)的各种自监督学习(SSL)方法已被开发出来,用于全面表示三维空间中分子的结构信息;这对于发现新药至关重要。然而,现有方法未能全面表征分子的三维结构,并且忽略了对分子反应性、强静电相互作用和化学吸附等关键性质有显著影响的电子结构信息。因此,在此构建了一种新颖的分子表示学习方法Q-GEM,它基于量子化学性质数据库QuanDB和SSL方法,融合了量子和几何结构信息增强。Q-GEM包括一个嵌入分子电子和完整三维几何结构信息的GNN以及几个精心设计的多尺度SSL任务,实现了卓越的绝对分子构象预测和构象辨别。Q-GEM在MoleculeNet数据集的13个预测任务中的12个上达到了当前最优性能,分类和回归预测任务的平均性能分别提高了3.3%和2.0%。此外,在三个局部量子化学性质方面平均性能提高了5.2%,充分证明了Q-GEM在区分分子电子结构方面的卓越性能。Q-GEM代表了准确分子性质预测方面的一个新颖而强大的突破。