Wang Yifei, Li Yunrui, Liu Lin, Hong Pengyu, Xu Hao
Department of Computer Science, Brandeis University, Waltham, Massachusetts 02453-2728, United States.
Department of Chemistry, Stanford University, Stanford, California 94305, United States.
J Chem Inf Model. 2025 Jul 14;65(13):6547-6557. doi: 10.1021/acs.jcim.5c00430. Epub 2025 Jun 23.
The versatility of multimodal deep learning holds tremendous promise for advancing scientific research and practical applications. As this field continues to evolve, the collective power of cross-modal analysis promises to drive transformative innovations, opening new frontiers in chemical understanding and drug discovery. Hence, we introduce asymmetric contrastive multimodal learning (ACML), a specifically designed approach to enhance molecular understanding and accelerate advancements in drug discovery. ACML harnesses the power of effective asymmetric contrastive learning to seamlessly transfer information from various chemical modalities to molecular graph representations. By combining pretrained chemical unimodal encoders and a shallow-designed graph encoder with 5 layers, ACML facilitates the assimilation of coordinated chemical semantics from different modalities, leading to comprehensive representation learning with efficient training. We demonstrate the effectiveness of this framework through large-scale cross-modality retrieval and isomer discrimination tasks. Additionally, ACML enhances interpretability by revealing chemical semantics in graph presentations and bolsters the expressive power of graph neural networks, as evidenced by improved performance in molecular property prediction tasks from MoleculeNet and Therapeutics Data Commons (TDC). Ultimately, ACML exemplifies its potential to revolutionize molecular representational learning, offering deeper insights into the chemical semantics of diverse modalities and paving the way for groundbreaking advancements in chemical research and drug discovery.
多模态深度学习的多功能性为推进科学研究和实际应用带来了巨大希望。随着这一领域的不断发展,跨模态分析的集体力量有望推动变革性创新,在化学理解和药物发现方面开辟新的前沿领域。因此,我们引入了不对称对比多模态学习(ACML),这是一种专门设计的方法,用于增强分子理解并加速药物发现的进展。ACML利用有效的不对称对比学习的力量,将信息从各种化学模态无缝转移到分子图表示中。通过将预训练的化学单模态编码器和一个设计简单的5层图编码器相结合,ACML促进了来自不同模态的协调化学语义的同化,从而通过高效训练实现全面的表示学习。我们通过大规模跨模态检索和异构体辨别任务证明了该框架的有效性。此外,ACML通过在图表示中揭示化学语义增强了可解释性,并增强了图神经网络的表达能力,这在来自分子网络(MoleculeNet)和治疗数据共享库(TDC)的分子性质预测任务中性能的提升得到了证明。最终,ACML展示了其变革分子表示学习的潜力,为不同模态的化学语义提供了更深入的见解,并为化学研究和药物发现的突破性进展铺平了道路。