• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于分子性质预测的基于活性悬崖的对比学习

Activity Cliff-Informed Contrastive Learning for Molecular Property Prediction.

作者信息

Shen Wan Xiang, Cui Chao, Su Xiaorui, Zhang Zaixi, Velez-Arce Alejandro, Wang Jianming, Shi Xiangcheng, Zhang Yanbing, Wu Jie, Chen Yu Zong, Zitnik Marinka

机构信息

Department of Biomedical Informatics, Harvard Medical School, Boston, MA, USA.

Department of Chemistry, National University of Singapore, 117543, Singapore.

出版信息

Res Sq. 2024 Dec 4:rs.3.rs-2988283. doi: 10.21203/rs.3.rs-2988283/v2.

DOI:10.21203/rs.3.rs-2988283/v2
PMID:39678335
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11643338/
Abstract

Modeling molecular activity and quantitative structure-activity relationships of chemical compounds is critical in drug design. Graph neural networks, which utilize molecular structures as frames, have shown success in assessing the biological activity of chemical compounds, guiding the selection and optimization of candidates for further development. However, current models often overlook activity cliffs (ACs)-cases where structurally similar molecules exhibit different bioactivities-due to latent spaces primarily optimized for structural features. Here, we introduce AC-awareness (ACA), an inductive bias designed to enhance molecular representation learning for activity modeling. The ACA jointly optimizes metric learning in the latent space and task performance in the target space, making models more sensitive to ACs. We develop ACANet, an AC-informed contrastive learning approach that can be integrated with any graph neural network. Experiments on 39 benchmark datasets demonstrate that AC-informed representations of chemical compounds consistently outperform standard models in bioactivity prediction across both regression and classification tasks. AC-informed models show strong performance in predicting pharmacokinetic and safety-relevant molecular properties. ACA paves the way toward activity-informed molecular representations, providing a valuable tool for the early stages of lead compound identification, refinement, and virtual screening.

摘要

对化合物的分子活性和定量构效关系进行建模在药物设计中至关重要。以分子结构为框架的图神经网络在评估化合物的生物活性、指导进一步开发的候选物的选择和优化方面已取得成功。然而,由于潜在空间主要针对结构特征进行优化,当前模型常常忽略活性悬崖(ACs),即结构相似的分子表现出不同生物活性的情况。在此,我们引入了AC感知(ACA),这是一种归纳偏差,旨在增强用于活性建模的分子表示学习。ACA在潜在空间中联合优化度量学习和目标空间中的任务性能,使模型对ACs更加敏感。我们开发了ACANet,一种可与任何图神经网络集成的基于AC的对比学习方法。在39个基准数据集上进行的实验表明,在回归和分类任务的生物活性预测中,化合物的基于AC的表示始终优于标准模型。基于AC的模型在预测药代动力学和安全相关分子性质方面表现出强大性能。ACA为基于活性的分子表示铺平了道路,为先导化合物识别、优化和虚拟筛选的早期阶段提供了有价值的工具。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cd34/11643338/52e38bda3ac6/nihpp-rs2988283v2-f0007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cd34/11643338/d3172787072f/nihpp-rs2988283v2-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cd34/11643338/c7caf4f3f467/nihpp-rs2988283v2-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cd34/11643338/eaaa365293ae/nihpp-rs2988283v2-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cd34/11643338/a4c43fe8a707/nihpp-rs2988283v2-f0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cd34/11643338/8d1ae188f4f5/nihpp-rs2988283v2-f0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cd34/11643338/0bc49dbb26e9/nihpp-rs2988283v2-f0006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cd34/11643338/52e38bda3ac6/nihpp-rs2988283v2-f0007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cd34/11643338/d3172787072f/nihpp-rs2988283v2-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cd34/11643338/c7caf4f3f467/nihpp-rs2988283v2-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cd34/11643338/eaaa365293ae/nihpp-rs2988283v2-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cd34/11643338/a4c43fe8a707/nihpp-rs2988283v2-f0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cd34/11643338/8d1ae188f4f5/nihpp-rs2988283v2-f0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cd34/11643338/0bc49dbb26e9/nihpp-rs2988283v2-f0006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cd34/11643338/52e38bda3ac6/nihpp-rs2988283v2-f0007.jpg

相似文献

1
Activity Cliff-Informed Contrastive Learning for Molecular Property Prediction.用于分子性质预测的基于活性悬崖的对比学习
Res Sq. 2024 Dec 4:rs.3.rs-2988283. doi: 10.21203/rs.3.rs-2988283/v2.
2
OLB-AC: toward optimizing ligand bioactivities through deep graph learning and activity cliffs.OLB-AC:通过深度图学习和活性悬崖优化配体生物活性。
Bioinformatics. 2024 Jun 3;40(6). doi: 10.1093/bioinformatics/btae365.
3
Exploring QSAR models for activity-cliff prediction.探索用于活性悬崖预测的定量构效关系模型。
J Cheminform. 2023 Apr 17;15(1):47. doi: 10.1186/s13321-023-00708-w.
4
A Topology-Enhanced Multi-Viewed Contrastive Approach for Molecular Graph Representation Learning and Classification.一种用于分子图表示学习和分类的拓扑增强多视图对比方法。
Mol Inform. 2025 Jan;44(1):e202400252. doi: 10.1002/minf.202400252.
5
DGCL: dual-graph neural networks contrastive learning for molecular property prediction.DGCL:用于分子性质预测的双图神经网络对比学习。
Brief Bioinform. 2024 Sep 23;25(6). doi: 10.1093/bib/bbae474.
6
Activity cliff-aware reinforcement learning for de novo drug design.用于从头药物设计的活动悬崖感知强化学习
J Cheminform. 2025 Apr 21;17(1):54. doi: 10.1186/s13321-025-01006-3.
7
Molecular property prediction based on graph contrastive learning with partial feature masking.基于部分特征掩码的图对比学习的分子性质预测
J Mol Graph Model. 2025 Jul;138:109014. doi: 10.1016/j.jmgm.2025.109014. Epub 2025 Mar 17.
8
CMMS-GCL: cross-modality metabolic stability prediction with graph contrastive learning.CMMS-GCL:基于图对比学习的跨模态代谢稳定性预测。
Bioinformatics. 2023 Aug 1;39(8). doi: 10.1093/bioinformatics/btad503.
9
Chemical reaction enhanced graph learning for molecule representation.化学反应增强图学习的分子表示。
Bioinformatics. 2024 Oct 1;40(10). doi: 10.1093/bioinformatics/btae558.
10
An Integrated Fuzzy Neural Network and Topological Data Analysis for Molecular Graph Representation Learning and Property Forecasting.用于分子图表示学习和性质预测的集成模糊神经网络与拓扑数据分析
Mol Inform. 2025 Mar;44(3):e202400335. doi: 10.1002/minf.202400335.

本文引用的文献

1
A BERT-based pretraining model for extracting molecular structural information from a SMILES sequence.一种基于BERT的预训练模型,用于从SMILES序列中提取分子结构信息。
J Cheminform. 2024 Jun 19;16(1):71. doi: 10.1186/s13321-024-00848-7.
2
Chemprop: A Machine Learning Package for Chemical Property Prediction.Chemprop:一个用于化学性质预测的机器学习工具包。
J Chem Inf Model. 2024 Jan 8;64(1):9-17. doi: 10.1021/acs.jcim.3c01250. Epub 2023 Dec 26.
3
A knowledge-guided pre-training framework for improving molecular representation learning.
一种基于知识引导的预训练框架,用于改进分子表示学习。
Nat Commun. 2023 Nov 21;14(1):7568. doi: 10.1038/s41467-023-43214-1.
4
Chemical structure-aware molecular image representation learning.化学结构感知的分子图像表示学习。
Brief Bioinform. 2023 Sep 22;24(6). doi: 10.1093/bib/bbad404.
5
Anatomy of Potency Predictions Focusing on Structural Analogues with Increasing Potency Differences Including Activity Cliffs.关注具有递增效价差异的结构类似物的效价预测的解剖学,包括活性悬崖。
J Chem Inf Model. 2023 Nov 27;63(22):7032-7044. doi: 10.1021/acs.jcim.3c01530. Epub 2023 Nov 9.
6
DeepDelta: predicting ADMET improvements of molecular derivatives with deep learning.深度德尔塔:利用深度学习预测分子衍生物的ADMET性质改善情况。
J Cheminform. 2023 Oct 27;15(1):101. doi: 10.1186/s13321-023-00769-x.
7
Limitations of representation learning in small molecule property prediction.小分子性质预测中表示学习的局限性。
Nat Commun. 2023 Oct 13;14(1):6394. doi: 10.1038/s41467-023-41967-3.
8
A systematic study of key elements underlying molecular property prediction.对分子性质预测背后关键要素的系统研究。
Nat Commun. 2023 Oct 13;14(1):6395. doi: 10.1038/s41467-023-41948-6.
9
Computational approaches streamlining drug discovery.计算方法简化药物发现。
Nature. 2023 Apr;616(7958):673-685. doi: 10.1038/s41586-023-05905-z. Epub 2023 Apr 26.
10
Exploring QSAR models for activity-cliff prediction.探索用于活性悬崖预测的定量构效关系模型。
J Cheminform. 2023 Apr 17;15(1):47. doi: 10.1186/s13321-023-00708-w.