• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

SMICLR:基于多种分子表示的对比学习用于半监督和无监督表示学习。

SMICLR: Contrastive Learning on Multiple Molecular Representations for Semisupervised and Unsupervised Representation Learning.

机构信息

Institute of Science and Technology, Federal University of São Paulo (Unifesp), 12247-014, São José dos Campos, SP, Brazil.

São Carlos Institute of Chemistry, University of São Paulo, P.O. Box 780, 13560-970, São Carlos, SP, Brazil.

出版信息

J Chem Inf Model. 2022 Sep 12;62(17):3948-3960. doi: 10.1021/acs.jcim.2c00521. Epub 2022 Aug 31.

DOI:10.1021/acs.jcim.2c00521
PMID:36044610
Abstract

Machine learning as a tool for chemical space exploration broadens horizons to work with known and unknown molecules. At its core lies molecular representation, an essential key to improve learning about structure-property relationships. Recently, contrastive frameworks have been showing impressive results for representation learning in diverse domains. Therefore, this paper proposes a contrastive framework that embraces multimodal molecular data. Specifically, our approach jointly trains a graph encoder and an encoder for the simplified molecular-input line-entry system (SMILES) string to perform the contrastive learning objective. Since SMILES is the basis of our method, i.e., we built the molecular graph from the SMILES, we call our framework as SMILES Contrastive Learning (SMICLR). When stacking a nonlinear regressor on the SMICLR's pretrained encoder and fine-tuning the entire model, we reduced the prediction error by, on average, 44% and 25% for the energetic and electronic properties of the QM9 data set, respectively, over the supervised baseline. We further improved our framework's performance when applying data augmentations in each molecular-input representation. Moreover, SMICLR demonstrated competitive representation learning results in an unsupervised setting.

摘要

机器学习作为一种探索化学空间的工具,拓宽了研究已知和未知分子的视野。其核心是分子表示,这是改进结构-性质关系学习的关键。最近,对比框架在不同领域的表示学习中取得了令人瞩目的成果。因此,本文提出了一种结合多模态分子数据的对比框架。具体来说,我们的方法联合训练了一个图编码器和一个简化分子输入行式系统(SMILES)字符串的编码器,以执行对比学习目标。由于 SMILES 是我们方法的基础,即我们从 SMILES 构建分子图,因此我们将我们的框架称为 SMILES 对比学习(SMICLR)。当在 SMICLR 的预训练编码器上堆叠一个非线性回归器并微调整个模型时,与监督基线相比,我们分别将 QM9 数据集的能量和电子性质的预测误差平均降低了 44%和 25%。当在每个分子输入表示中应用数据增强时,我们进一步提高了框架的性能。此外,SMICLR 在无监督设置中表现出了有竞争力的表示学习结果。

相似文献

1
SMICLR: Contrastive Learning on Multiple Molecular Representations for Semisupervised and Unsupervised Representation Learning.SMICLR:基于多种分子表示的对比学习用于半监督和无监督表示学习。
J Chem Inf Model. 2022 Sep 12;62(17):3948-3960. doi: 10.1021/acs.jcim.2c00521. Epub 2022 Aug 31.
2
Investigating Contrastive Pair Learning's Frontiers in Supervised, Semisupervised, and Self-Supervised Learning.探究对比对学习在监督学习、半监督学习和自监督学习中的前沿进展。
J Imaging. 2024 Aug 13;10(8):196. doi: 10.3390/jimaging10080196.
3
Attention-wise masked graph contrastive learning for predicting molecular property.基于注意力机制的掩码图对比学习预测分子性质。
Brief Bioinform. 2022 Sep 20;23(5). doi: 10.1093/bib/bbac303.
4
Unsupervised graph-level representation learning with hierarchical contrasts.基于分层对比的无监督图级表示学习
Neural Netw. 2023 Jan;158:359-368. doi: 10.1016/j.neunet.2022.11.019. Epub 2022 Nov 26.
5
CONSMI: Contrastive Learning in the Simplified Molecular Input Line Entry System Helps Generate Better Molecules.CONSMI:简化分子输入线性条目系统中的对比学习有助于生成更好的分子。
Molecules. 2024 Jan 19;29(2):495. doi: 10.3390/molecules29020495.
6
Learning self-supervised molecular representations for drug-drug interaction prediction.学习用于药物-药物相互作用预测的自监督分子表示。
BMC Bioinformatics. 2024 Jan 30;25(1):47. doi: 10.1186/s12859-024-05643-7.
7
MoCL: Data-driven Molecular Fingerprint via Knowledge-aware Contrastive Learning from Molecular Graph.MoCL:通过基于分子图的知识感知对比学习实现的数据驱动分子指纹
KDD. 2021 Aug;2021:3585-3594. doi: 10.1145/3447548.3467186. Epub 2021 Aug 14.
8
Community-CL: An Enhanced Community Detection Algorithm Based on Contrastive Learning.社区CL:一种基于对比学习的增强型社区检测算法。
Entropy (Basel). 2023 May 29;25(6):864. doi: 10.3390/e25060864.
9
GraphCL-DTA: A Graph Contrastive Learning With Molecular Semantics for Drug-Target Binding Affinity Prediction.GraphCL-DTA:一种基于分子语义的图对比学习方法,用于药物-靶标结合亲和力预测。
IEEE J Biomed Health Inform. 2024 Aug;28(8):4544-4552. doi: 10.1109/JBHI.2024.3350666. Epub 2024 Aug 6.
10
KAMPNet: multi-source medical knowledge augmented medication prediction network with multi-level graph contrastive learning.KAMPNet:基于多层次图对比学习的多源医学知识增强药物预测网络。
BMC Med Inform Decis Mak. 2023 Oct 30;23(1):243. doi: 10.1186/s12911-023-02325-x.

引用本文的文献

1
Advancing Drug Discovery with Enhanced Chemical Understanding via Asymmetric Contrastive Multimodal Learning.通过不对称对比多模态学习增强化学理解以推进药物发现
J Chem Inf Model. 2025 Jul 14;65(13):6547-6557. doi: 10.1021/acs.jcim.5c00430. Epub 2025 Jun 23.
2
Integrated multimodal hierarchical fusion and meta-learning for enhanced molecular property prediction.用于增强分子性质预测的集成多模态分层融合与元学习
Brief Bioinform. 2025 May 1;26(3). doi: 10.1093/bib/bbaf251.
3
MolFCL: predicting molecular properties through chemistry-guided contrastive and prompt learning.
MolFCL:通过化学引导的对比学习和提示学习预测分子性质
Bioinformatics. 2025 Feb 4;41(2). doi: 10.1093/bioinformatics/btaf061.
4
PolyCL: contrastive learning for polymer representation learning explicit and implicit augmentations.PolyCL:用于聚合物表示学习的对比学习 显式和隐式增强
Digit Discov. 2024 Nov 28;4(1):149-160. doi: 10.1039/d4dd00236a. eCollection 2025 Jan 15.
5
Complementary multi-modality molecular self-supervised learning via non-overlapping masking for property prediction.通过非重叠掩蔽进行互补多模态分子自监督学习以进行性质预测。
Brief Bioinform. 2024 May 23;25(4). doi: 10.1093/bib/bbae256.
6
A focus on molecular representation learning for the prediction of chemical properties.专注于用于化学性质预测的分子表示学习。
Chem Sci. 2024 Mar 25;15(14):5052-5055. doi: 10.1039/d4sc90043j. eCollection 2024 Apr 3.
7
MolFeSCue: enhancing molecular property prediction in data-limited and imbalanced contexts using few-shot and contrastive learning.MolFeSCue:利用少样本和对比学习增强数据有限和不平衡环境下的分子性质预测。
Bioinformatics. 2024 Mar 29;40(4). doi: 10.1093/bioinformatics/btae118.
8
CONSMI: Contrastive Learning in the Simplified Molecular Input Line Entry System Helps Generate Better Molecules.CONSMI:简化分子输入线性条目系统中的对比学习有助于生成更好的分子。
Molecules. 2024 Jan 19;29(2):495. doi: 10.3390/molecules29020495.
9
From intuition to AI: evolution of small molecule representations in drug discovery.从直觉到人工智能:药物发现中小分子表示的演变。
Brief Bioinform. 2023 Nov 22;25(1). doi: 10.1093/bib/bbad422.