Hou Xiaoyang, Zhu Tian, Ren Milong, Duan Bo, Zhang Chunming, Bu Dongbo, Sun Shiwei
Key Laboratory of Intelligent Information Processing, Institute of Computing Technology, Beijing, 100049, China.
School of Computer Science and Technology, University of Chinese Academy of Sciences, Beijing, 100049, China.
Bioinformatics. 2024 Sep 2;40(9). doi: 10.1093/bioinformatics/btae524.
Molecular representation learning is pivotal for advancing deep learning applications in quantum chemistry and drug discovery. Existing methods for molecular representation learning often fall short of fully capturing the intricate interactions within chemical bonds of 2D topological graphs and the multifaceted effects of 3D geometric conformations.
To overcome these challenges, we present a novel contrastive learning strategy for molecular representation learning, named Geometric Triangle Awareness Model (GTAM). This method integrates innovative molecular encoders for both 2D graphs and 3D conformations, enabling the accurate capture of geometric dependencies among edges in graph-based molecular structures. Furthermore, GTAM is bolstered by the development of two contrastive training objectives designed to facilitate the direct transfer of edge information between 2D topological graphs and 3D geometric conformations, enhancing the functionality of the molecular encoders. Through extensive evaluations on a range of 2D and 3D downstream tasks, our model has demonstrated superior performance over existing approaches.
The test code and data of GTAM are available online at https://github.com/StellaHxy/GTAM.
分子表示学习对于推动深度学习在量子化学和药物发现中的应用至关重要。现有的分子表示学习方法往往无法充分捕捉二维拓扑图化学键内的复杂相互作用以及三维几何构象的多方面影响。
为了克服这些挑战,我们提出了一种用于分子表示学习的新型对比学习策略,名为几何三角形感知模型(GTAM)。该方法集成了用于二维图和三维构象的创新分子编码器,能够准确捕捉基于图的分子结构中边之间的几何依赖性。此外,GTAM通过开发两个对比训练目标得到加强,旨在促进二维拓扑图和三维几何构象之间边信息的直接传递,增强分子编码器的功能。通过对一系列二维和三维下游任务的广泛评估,我们的模型已证明其性能优于现有方法。
GTAM的测试代码和数据可在https://github.com/StellaHxy/GTAM上在线获取。