双曲空间中的半监督分层药物嵌入。

Semi-supervised Hierarchical Drug Embedding in Hyperbolic Space.

机构信息

Intelligent Systems Program, School of Computing and Information, University of Pittsburgh, Pittsburgh, Pennsylvania 15206, United States.

Department of Biomedical Informatics, University of Pittsburgh, Pittsburgh, Pennsylvania 15206, United States.

出版信息

J Chem Inf Model. 2020 Dec 28;60(12):5647-5657. doi: 10.1021/acs.jcim.0c00681. Epub 2020 Nov 3.

DOI:10.1021/acs.jcim.0c00681

PMID:33140969

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7943198/

Abstract

Learning accurate drug representations is essential for tasks such as computational drug repositioning and prediction of drug side effects. A drug hierarchy is a valuable source that encodes knowledge of relations among drugs in a tree-like structure where drugs that act on the same organs, treat the same disease, or bind to the same biological target are grouped together. However, its utility in learning drug representations has not yet been explored, and currently described drug representations cannot place novel molecules in a drug hierarchy. Here, we develop a semi-supervised drug embedding that incorporates two sources of information: (1) underlying chemical grammar that is inferred from chemical structures of drugs and drug-like molecules (unsupervised) and (2) hierarchical relations that are encoded in an expert-crafted hierarchy of approved drugs (supervised). We use the Variational Auto-Encoder (VAE) framework to encode the chemical structures of molecules and use the drug-drug similarity information obtained from the hierarchy to induce the clustering of drugs in hyperbolic space. The hyperbolic space is amenable for encoding hierarchical relations. Both quantitative and qualitative results support that the learned drug embedding can accurately reproduce the chemical structure and recapitulate the hierarchical relations among drugs. Furthermore, our approach can infer the pharmacological properties of novel molecules by retrieving similar drugs from the embedding space. We demonstrate that our drug embedding can predict new uses and discover new side effects of existing drugs. We show that it significantly outperforms comparison methods in both tasks.

摘要

学习准确的药物表示对于计算药物重定位和预测药物副作用等任务至关重要。药物层级结构是一种有价值的资源，它以树状结构编码了药物之间的关系知识，其中作用于相同器官、治疗相同疾病或与相同生物靶点结合的药物被分组在一起。然而，它在学习药物表示方面的应用尚未得到探索，并且目前描述的药物表示法无法将新分子置于药物层级结构中。在这里，我们开发了一种半监督药物嵌入，它结合了两种信息来源：（1）从药物和类药物分子的化学结构中推断出的基本化学语法（无监督）和（2）在专家精心制作的批准药物层级结构中编码的层次关系（监督）。我们使用变分自动编码器（VAE）框架对分子的化学结构进行编码，并使用从层次结构中获得的药物-药物相似性信息来诱导药物在双曲空间中的聚类。双曲空间适合编码层次关系。定量和定性结果都支持所学习的药物嵌入可以准确地再现化学结构，并概括药物之间的层次关系。此外，我们的方法可以通过从嵌入空间中检索相似的药物来推断新分子的药理学特性。我们证明了我们的药物嵌入可以预测现有药物的新用途和发现新的副作用。我们表明，它在这两个任务中的表现都明显优于比较方法。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5e79/7943198/cc4df7762a34/nihms-1672193-f0001.jpg

相似文献

Semi-supervised Hierarchical Drug Embedding in Hyperbolic Space.双曲空间中的半监督分层药物嵌入。

J Chem Inf Model. 2020 Dec 28;60(12):5647-5657. doi: 10.1021/acs.jcim.0c00681. Epub 2020 Nov 3.

Gating-Enhanced Hierarchical Structure Learning in Hyperbolic Space and Multi-scale Neighbor Topology Learning in Euclidean Space for Prediction of Microbe-Drug Associations.用于预测微生物-药物关联的双曲空间中的门控增强层次结构学习和欧几里得空间中的多尺度邻居拓扑学习。

J Chem Inf Model. 2024 Oct 14;64(19):7806-7815. doi: 10.1021/acs.jcim.4c01340. Epub 2024 Sep 26.

Adverse Drug Reaction Predictions Using Stacking Deep Heterogeneous Information Network Embedding Approach.基于堆叠深度异质信息网络嵌入方法的药物不良反应预测。

Molecules. 2018 Dec 4;23(12):3193. doi: 10.3390/molecules23123193.

Drug Repositioning by Integrating Known Disease-Gene and Drug-Target Associations in a Semi-supervised Learning Model.通过在半监督学习模型中整合已知疾病-基因和药物-靶点关联进行药物重新定位

Acta Biotheor. 2018 Dec;66(4):315-331. doi: 10.1007/s10441-018-9325-z. Epub 2018 Apr 26.

Exploring Hierarchical Information in Hyperbolic Space for Self-Supervised Image Hashing.探索双曲空间中的层次信息用于自监督图像哈希

IEEE Trans Image Process. 2024;33:1768-1781. doi: 10.1109/TIP.2024.3371358. Epub 2024 Mar 8.

Hyperbolic hierarchical knowledge graph embeddings for biological entities.用于生物实体的双曲分层知识图谱嵌入

J Biomed Inform. 2023 Nov;147:104503. doi: 10.1016/j.jbi.2023.104503. Epub 2023 Sep 29.

A two-tiered unsupervised clustering approach for drug repositioning through heterogeneous data integration.一种通过异构数据集成进行药物重定位的两层无监督聚类方法。

BMC Bioinformatics. 2018 Apr 11;19(1):129. doi: 10.1186/s12859-018-2123-4.

Molecular descriptor analysis of approved drugs using unsupervised learning for drug repurposing.使用无监督学习对已批准药物进行分子描述符分析，以实现药物再利用。

Comput Biol Med. 2021 Nov;138:104856. doi: 10.1016/j.compbiomed.2021.104856. Epub 2021 Sep 10.

MHADTI: predicting drug-target interactions via multiview heterogeneous information network embedding with hierarchical attention mechanisms.MHADTI：基于层次注意力机制的多视图异质信息网络嵌入预测药物-靶标相互作用

Brief Bioinform. 2022 Nov 19;23(6). doi: 10.1093/bib/bbac434.

Polypharmacy side effect prediction based on semi-implicit graph variational auto-encoder.基于半隐式图变分自动编码器的药物副作用预测。

J Bioinform Comput Biol. 2024 Aug;22(4):2450020. doi: 10.1142/S0219720024500203. Epub 2024 Sep 12.

引用本文的文献

A promising drug repurposing approach for Alzheimer's treatment: Givinostat improves cognitive behavior and pathological features in APP/PS1 mice.一种用于治疗阿尔茨海默病的有前景的药物重新利用方法：吉维诺司他改善APP/PS1小鼠的认知行为和病理特征。

Redox Biol. 2024 Dec;78:103420. doi: 10.1016/j.redox.2024.103420. Epub 2024 Nov 6.

Weakly supervised video anomaly detection based on hyperbolic space.基于双曲空间的弱监督视频异常检测

Sci Rep. 2024 Nov 1;14(1):26348. doi: 10.1038/s41598-024-77505-4.

Towards explainable interaction prediction: Embedding biological hierarchies into hyperbolic interaction space.迈向可解释的交互预测：将生物层次结构嵌入双曲交互空间。

PLoS One. 2024 Mar 21;19(3):e0300906. doi: 10.1371/journal.pone.0300906. eCollection 2024.

PolyID: Artificial Intelligence for Discovering Performance-Advantaged and Sustainable Polymers.PolyID：用于发现性能优越且可持续聚合物的人工智能。

Macromolecules. 2023 Oct 19;56(21):8547-8557. doi: 10.1021/acs.macromol.3c00994. eCollection 2023 Nov 14.

FLONE: fully Lorentz network embedding for inferring novel drug targets.FLONE：用于推断新型药物靶点的全洛伦兹网络嵌入

Bioinform Adv. 2023 May 24;3(1):vbad066. doi: 10.1093/bioadv/vbad066. eCollection 2023.

Hyperbolic matrix factorization improves prediction of drug-target associations.双曲矩阵分解提高药物-靶标关联预测。

Sci Rep. 2023 Jan 18;13(1):959. doi: 10.1038/s41598-023-27995-5.

De novo Prediction of Cell-Drug Sensitivities Using Deep Learning-based Graph Regularized Matrix Factorization.基于深度学习的图正则化矩阵分解的细胞药物敏感性从头预测。

Pac Symp Biocomput. 2022;27:278-289.

本文引用的文献

VAE-Sim: A Novel Molecular Similarity Measure Based on a Variational Autoencoder.VAE-Sim：一种基于变分自动编码器的新型分子相似性度量方法。

Molecules. 2020 Jul 29;25(15):3446. doi: 10.3390/molecules25153446.

Assessing the impact of generative AI on medicinal chemistry.评估生成式人工智能对药物化学的影响。

Nat Biotechnol. 2020 Feb;38(2):143-145. doi: 10.1038/s41587-020-0418-2.

Efficient multi-objective molecular optimization in a continuous latent space.连续潜在空间中的高效多目标分子优化。

Chem Sci. 2019 Jul 8;10(34):8016-8024. doi: 10.1039/c9sc01928f. eCollection 2019 Sep 14.

Analyzing Learned Molecular Representations for Property Prediction.分析用于性质预测的学习分子表示。

J Chem Inf Model. 2019 Aug 26;59(8):3370-3388. doi: 10.1021/acs.jcim.9b00237. Epub 2019 Aug 13.

Discovering Links Between Side Effects and Drugs Using a Diffusion Based Method.利用基于扩散的方法发现药物副作用之间的关联。

Sci Rep. 2019 Jul 18;9(1):10436. doi: 10.1038/s41598-019-46939-6.

Representation Tradeoffs for Hyperbolic Embeddings.双曲嵌入的表示权衡

Proc Mach Learn Res. 2018;80:4460-4469.

Exploiting machine learning for end-to-end drug discovery and development.利用机器学习进行端到端的药物发现和开发。

Nat Mater. 2019 May;18(5):435-441. doi: 10.1038/s41563-019-0338-z. Epub 2019 Apr 18.

Applications of machine learning in drug discovery and development.机器学习在药物发现和开发中的应用。

Nat Rev Drug Discov. 2019 Jun;18(6):463-477. doi: 10.1038/s41573-019-0024-5.

Drug repurposing: progress, challenges and recommendations.药物重定位：进展、挑战和建议。

Nat Rev Drug Discov. 2019 Jan;18(1):41-58. doi: 10.1038/nrd.2018.168. Epub 2018 Oct 12.

Machine learning for molecular and materials science.机器学习在分子和材料科学中的应用。

Nature. 2018 Jul;559(7715):547-555. doi: 10.1038/s41586-018-0337-2. Epub 2018 Jul 25.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验