使用神经机器翻译预测药物代谢物。

Prediction of drug metabolites using neural machine translation.

作者信息

Litsa Eleni E, Das Payel, Kavraki Lydia E

机构信息

Department of Computer Science, Rice University Houston TX USA

IBM Research AI, IBM Thomas J. Watson Research Center Yorktown Heights NY 10598 USA

出版信息

Chem Sci. 2020 Sep 24;11(47):12777-12788. doi: 10.1039/d0sc02639e.

DOI:10.1039/d0sc02639e

PMID:34094473

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8162519/

Abstract

Metabolic processes in the human body can alter the structure of a drug affecting its efficacy and safety. As a result, the investigation of the metabolic fate of a candidate drug is an essential part of drug design studies. Computational approaches have been developed for the prediction of possible drug metabolites in an effort to assist the traditional and resource-demanding experimental route. Current methodologies are based upon metabolic transformation rules, which are tied to specific enzyme families and therefore lack generalization, and additionally may involve manual work from experts limiting scalability. We present a rule-free, end-to-end learning-based method for predicting possible human metabolites of small molecules including drugs. The metabolite prediction task is approached as a sequence translation problem with chemical compounds represented using the SMILES notation. We perform transfer learning on a deep learning transformer model for sequence translation, originally trained on chemical reaction data, to predict the outcome of human metabolic reactions. We further build an ensemble model to account for multiple and diverse metabolites. Extensive evaluation reveals that the proposed method generalizes well to different enzyme families, as it can correctly predict metabolites through phase I and phase II drug metabolism as well as other enzymes. Compared to existing rule-based approaches, our method has equivalent performance on the major enzyme families while it additionally finds metabolites through less common enzymes. Our results indicate that the proposed approach can provide a comprehensive study of drug metabolism that does not restrict to the major enzyme families and does not require the extraction of transformation rules.

摘要

人体中的代谢过程会改变药物的结构，影响其疗效和安全性。因此，研究候选药物的代谢命运是药物设计研究的重要组成部分。为了辅助传统且资源需求大的实验途径，已开发出计算方法来预测可能的药物代谢物。当前的方法基于代谢转化规则，这些规则与特定的酶家族相关联，因此缺乏通用性，此外可能还需要专家进行人工操作，限制了可扩展性。我们提出了一种基于端到端学习的无规则方法，用于预测包括药物在内的小分子的可能人体代谢物。代谢物预测任务被视为一个序列翻译问题，使用SMILES符号表示化学化合物。我们在一个最初针对化学反应数据训练的用于序列翻译的深度学习变压器模型上进行迁移学习，以预测人体代谢反应的结果。我们进一步构建了一个集成模型来考虑多种不同的代谢物。广泛的评估表明，所提出的方法对不同的酶家族具有良好的通用性，因为它可以正确预测通过I期和II期药物代谢以及其他酶产生的代谢物。与现有的基于规则的方法相比，我们的方法在主要酶家族上具有同等性能，同时还能通过不太常见的酶找到代谢物。我们的结果表明，所提出的方法可以提供对药物代谢的全面研究，不局限于主要酶家族，也不需要提取转化规则。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c25d/8162519/88bb047409c6/d0sc02639e-f1.jpg

相似文献

Prediction of drug metabolites using neural machine translation.使用神经机器翻译预测药物代谢物。

Chem Sci. 2020 Sep 24;11(47):12777-12788. doi: 10.1039/d0sc02639e.

MetaPredictor: in silico prediction of drug metabolites based on deep language models with prompt engineering.MetaPredictor：基于深度学习模型和提示工程的药物代谢产物的计算预测。

Brief Bioinform. 2024 Jul 25;25(5). doi: 10.1093/bib/bbae374.

Translational Metabolomics of Head Injury: Exploring Dysfunctional Cerebral Metabolism with Ex Vivo NMR Spectroscopy-Based Metabolite Quantification头部损伤的转化代谢组学：基于体外核磁共振波谱的代谢物定量分析探索脑代谢功能障碍

Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.在流行地区，服用抗叶酸抗疟药物的人群中，叶酸补充剂与疟疾易感性和严重程度的关系。

Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217.

Substructure-based neural machine translation for retrosynthetic prediction.用于逆合成预测的基于子结构的神经机器翻译

J Cheminform. 2021 Jan 11;13(1):4. doi: 10.1186/s13321-020-00482-z.

Machine Learning Using Neural Networks for Metabolomic Pathway Analyses.基于神经网络的代谢组学通路分析的机器学习方法

Methods Mol Biol. 2023;2553:395-415. doi: 10.1007/978-1-0716-2617-7_17.

Deep Learning Based Drug Metabolites Prediction.基于深度学习的药物代谢物预测

Front Pharmacol. 2020 Jan 30;10:1586. doi: 10.3389/fphar.2019.01586. eCollection 2019.

GLORY: Generator of the Structures of Likely Cytochrome P450 Metabolites Based on Predicted Sites of Metabolism.GLORY：基于预测代谢位点的细胞色素P450可能代谢物结构生成器。

Front Chem. 2019 Jun 12;7:402. doi: 10.3389/fchem.2019.00402. eCollection 2019.

SyGMa: combining expert knowledge and empirical scoring in the prediction of metabolites.SyGMa：在代谢物预测中结合专家知识与经验评分

ChemMedChem. 2008 May;3(5):821-32. doi: 10.1002/cmdc.200700312.

Machine Learning in Drug Metabolism Study.药物代谢研究中的机器学习

Curr Drug Metab. 2022;23(13):1012-1026. doi: 10.2174/1389200224666221227094144.

引用本文的文献

Artificial intelligence and computational methods in human metabolism research: A comprehensive survey.人类新陈代谢研究中的人工智能与计算方法：全面综述。

J Pharm Anal. 2025 Aug;15(8):101437. doi: 10.1016/j.jpha.2025.101437. Epub 2025 Aug 18.

Automated Annotation of Sites of Metabolism from Biotransformation Data.基于生物转化数据的代谢位点自动注释

J Chem Inf Model. 2025 Jul 14;65(13):7065-7080. doi: 10.1021/acs.jcim.5c00819. Epub 2025 Jun 17.

MTGGF: A Metabolism Type-Aware Graph Generative Model for Molecular Metabolite Prediction.MTGGF：一种用于分子代谢物预测的代谢类型感知图生成模型。

Interdiscip Sci. 2025 Jan 6. doi: 10.1007/s12539-024-00681-4.

Expanding PFAS Identification with Transformation Product Libraries: Nontargeted Analysis Reveals Biotransformation Products in Mice.利用转化产物库扩展全氟和多氟烷基物质的鉴定：非靶向分析揭示小鼠体内的生物转化产物

Environ Sci Technol. 2025 Jan 14;59(1):119-131. doi: 10.1021/acs.est.4c07750. Epub 2024 Dec 20.

Deep learning in template-free de novo biosynthetic pathway design of natural products.无模板的天然产物从头生物合成途径设计中的深度学习。

Brief Bioinform. 2024 Sep 23;25(6). doi: 10.1093/bib/bbae495.

Brief Bioinform. 2024 Jul 25;25(5). doi: 10.1093/bib/bbae374.

Transformers and large language models in healthcare: A review.医疗保健中的变压器和大型语言模型：综述。

Artif Intell Med. 2024 Aug;154:102900. doi: 10.1016/j.artmed.2024.102900. Epub 2024 Jun 5.

Application of Transformers in Cheminformatics.Transformer 在化学信息学中的应用。

J Chem Inf Model. 2024 Jun 10;64(11):4392-4409. doi: 10.1021/acs.jcim.3c02070. Epub 2024 May 30.

Cheminformatics and artificial intelligence for accelerating agrochemical discovery.用于加速农用化学品发现的化学信息学与人工智能

Front Chem. 2023 Nov 29;11:1292027. doi: 10.3389/fchem.2023.1292027. eCollection 2023.

In silico and in vitro metabolism studies of the new synthetic opiate AP-237 (bucinnazine) using bioinformatics tools.使用生物信息学工具对新型合成阿片类药物 AP-237（布西嗪）进行体内和体外代谢研究。

Arch Toxicol. 2024 Jan;98(1):165-179. doi: 10.1007/s00204-023-03617-x. Epub 2023 Oct 15.

本文引用的文献

Randomized SMILES strings improve the quality of molecular generative models.随机化的SMILES字符串提高了分子生成模型的质量。

J Cheminform. 2019 Nov 21;11(1):71. doi: 10.1186/s13321-019-0393-0.

Transfer learning enables the molecular transformer to predict regio- and stereoselective reactions on carbohydrates.迁移学习使分子转换器能够预测碳水化合物的区域和立体选择性反应。

Nat Commun. 2020 Sep 25;11(1):4874. doi: 10.1038/s41467-020-18671-7.

GLORYx: Prediction of the Metabolites Resulting from Phase 1 and Phase 2 Biotransformations of Xenobiotics.GLORYx：预测外源性物质在 I 相和 II 相生物转化中产生的代谢产物。

Chem Res Toxicol. 2021 Feb 15;34(2):286-299. doi: 10.1021/acs.chemrestox.0c00224. Epub 2020 Aug 26.

Molecular Transformer: A Model for Uncertainty-Calibrated Chemical Reaction Prediction.分子变压器：一种用于不确定性校准化学反应预测的模型。

ACS Cent Sci. 2019 Sep 25;5(9):1572-1583. doi: 10.1021/acscentsci.9b00576. Epub 2019 Aug 30.

GLORY: Generator of the Structures of Likely Cytochrome P450 Metabolites Based on Predicted Sites of Metabolism.GLORY：基于预测代谢位点的细胞色素P450可能代谢物结构生成器。

Front Chem. 2019 Jun 12;7:402. doi: 10.3389/fchem.2019.00402. eCollection 2019.

A graph-convolutional neural network model for the prediction of chemical reactivity.一种用于预测化学反应性的图卷积神经网络模型。

Chem Sci. 2018 Nov 26;10(2):370-377. doi: 10.1039/c8sc04228d. eCollection 2019 Jan 14.

In silico approaches and tools for the prediction of drug metabolism and fate: A review.基于计算的方法和工具在药物代谢和命运预测中的应用：综述。

Comput Biol Med. 2019 Mar;106:54-64. doi: 10.1016/j.compbiomed.2019.01.008. Epub 2019 Jan 16.

BioTransformer: a comprehensive computational tool for small molecule metabolism prediction and metabolite identification.生物转化器：一种用于小分子代谢预测和代谢物鉴定的综合计算工具。

J Cheminform. 2019 Jan 5;11(1):2. doi: 10.1186/s13321-018-0324-5.

Computational methods and tools to predict cytochrome P450 metabolism for drug discovery.用于药物发现的预测细胞色素 P450 代谢的计算方法和工具。

Chem Biol Drug Des. 2019 Apr;93(4):377-386. doi: 10.1111/cbdd.13445. Epub 2019 Jan 15.

"Found in Translation": predicting outcomes of complex organic chemistry reactions using neural sequence-to-sequence models.《翻译中的发现》：使用神经序列到序列模型预测复杂有机化学反应的结果。

Chem Sci. 2018 Jun 22;9(28):6091-6098. doi: 10.1039/c8sc02339e. eCollection 2018 Jul 28.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

使用神经机器翻译预测药物代谢物。

Prediction of drug metabolites using neural machine translation.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献