MEN：利用可解释的多模态编码网络进行CYP450抑制剂的精准预测

MEN: leveraging explainable multimodal encoding network for precision prediction of CYP450 inhibitors.

作者信息

Atwereboannah Abena Achiaa, Wu Wei-Ping, Al-Antari Mugahed A, Yussif Sophyani B, Ejiyi Chukwuebuka J, Tenagyei Edwin K, Kissanga Grace-Mercure B, Emmanuel Gyarteng S A, Gu Yeong Hyeon, Ahene Emmanuel

机构信息

School of Computer Science and Engineering, University of Electronic Science and Technology, Chengdu, People's Republic of China.

SipingSoft Co. Ltd., Tianfu Software Park, Chengdu, People's Republic of China.

出版信息

Sci Rep. 2025 Jul 1;15(1):21820. doi: 10.1038/s41598-025-04982-6.

DOI:10.1038/s41598-025-04982-6

PMID:40592952

Abstract

Drug-drug interactions (DDIs) present serious risks in clinical settings, especially for patients who are prescribed multiple medications. A major factor contributing to these interactions is the inhibition of cytochrome P450 (CYP450) enzymes, which are vital for drug metabolism. As a result, reliably identifying compounds that may inhibit CYP450 enzymes is a key step in drug development. However, existing machine learning (ML) methods often fall short in terms of prediction accuracy and biological interpretability. To address this challenge, we introduce a Multimodal Encoder Network (MEN) aimed at improving the prediction of CYP450 inhibitors. This model combines three types of molecular data (chemical fingerprints, molecular graphs, and protein sequences) by applying specialized encoders tailored to each format. Specifically, the Fingerprint Encoder Network (FEN) processes molecular fingerprints, the Graph Encoder Network (GEN) extracts structural features from graph-based representations, and the Protein Encoder Network (PEN) captures sequential patterns from protein sequences. By integrating these diverse data types, MEN can extract complementary information that enhances predictive performance. The encoded outputs from FEN, GEN, and PEN are fused to build a comprehensive feature representation. An explainable AI (XAI) module is incorporated into the model to support biological interpretation, using visualization techniques such as heatmaps. The model was trained and validated using two datasets: chemical structures in SMILES format from PubChem and protein sequences of five CYP450 isoforms (1A2, 2C9, 2C19, 2D6, and 3A4) obtained from the Protein Data Bank (PDB). MEN achieved an average accuracy of 93.7% across all isoforms. The individual encoders performed with accuracies of 80.8% (FEN), 82.3% (GEN), and 81.5% (PEN). Additional performance results include an AUC of 98.5%, sensitivity of 95.9%, specificity of 97.2%, precision of 80.6%, F1-score of 83.4%, and a Matthews correlation coefficient (MCC) of 88.2%. All data and code are available at https://github.com/GracedAbena/MEN-Leveraging-Explainable-Multimodal-Encoding-Network .

摘要

药物相互作用（DDIs）在临床环境中存在严重风险，尤其是对于那些被开具多种药物的患者。导致这些相互作用的一个主要因素是细胞色素P450（CYP450）酶的抑制，而这些酶对于药物代谢至关重要。因此，可靠地识别可能抑制CYP450酶的化合物是药物开发中的关键一步。然而，现有的机器学习（ML）方法在预测准确性和生物学可解释性方面往往存在不足。为应对这一挑战，我们引入了一种多模态编码器网络（MEN），旨在改进对CYP450抑制剂的预测。该模型通过应用针对每种格式量身定制的专门编码器，结合了三种类型的分子数据（化学指纹、分子图和蛋白质序列）。具体而言，指纹编码器网络（FEN）处理分子指纹，图编码器网络（GEN）从基于图的表示中提取结构特征，蛋白质编码器网络（PEN）从蛋白质序列中捕获序列模式。通过整合这些不同的数据类型，MEN可以提取增强预测性能的互补信息。FEN、GEN和PEN的编码输出被融合以构建全面的特征表示。一个可解释人工智能（XAI）模块被纳入模型以支持生物学解释，使用诸如热图等可视化技术。该模型使用两个数据集进行训练和验证：来自PubChem的SMILES格式化学结构以及从蛋白质数据库（PDB）获得的五种CYP450同工型（1A2、2C9、2C19、2D6和3A4）的蛋白质序列。MEN在所有同工型上实现了93.7%的平均准确率。各个编码器的准确率分别为80.8%（FEN）、82.3%（GEN）和81.5%（PEN）。其他性能结果包括曲线下面积（AUC）为98.5%、灵敏度为95.9%、特异性为97.2%、精确率为80.6%、F1分数为83.4%以及马修斯相关系数（MCC）为88.2%。所有数据和代码可在https://github.com/GracedAbena/MEN-Leveraging-Explainable-Multimodal-Encoding-Network获取。

相似文献

MEN: leveraging explainable multimodal encoding network for precision prediction of CYP450 inhibitors.

Sci Rep. 2025 Jul 1;15(1):21820. doi: 10.1038/s41598-025-04982-6.

Are Current Survival Prediction Tools Useful When Treating Subsequent Skeletal-related Events From Bone Metastases?

Clin Orthop Relat Res. 2024 Sep 1;482(9):1710-1721. doi: 10.1097/CORR.0000000000003030. Epub 2024 Mar 22.

Knowledge Graph-Enhanced Deep Learning Model (H-SYSTEM) for Hypertensive Intracerebral Hemorrhage: Model Development and Validation.

J Med Internet Res. 2025 Jun 12;27:e66055. doi: 10.2196/66055.

Stabilizing machine learning for reproducible and explainable results: A novel validation approach to subject-specific insights.

Comput Methods Programs Biomed. 2025 Jun 21;269:108899. doi: 10.1016/j.cmpb.2025.108899.

Signs and symptoms to determine if a patient presenting in primary care or hospital outpatient settings has COVID-19.

Cochrane Database Syst Rev. 2022 May 20;5(5):CD013665. doi: 10.1002/14651858.CD013665.pub3.

DDINet: Drug-drug interaction prediction network based on multi-molecular fingerprint features and multi-head attention centered weighted autoencoder.

J Bioinform Comput Biol. 2025 Feb;23(1):2550003. doi: 10.1142/S0219720025500039.

Falls prevention interventions for community-dwelling older adults: systematic review and meta-analysis of benefits, harms, and patient values and preferences.

Syst Rev. 2024 Nov 26;13(1):289. doi: 10.1186/s13643-024-02681-3.

Systemic pharmacological treatments for chronic plaque psoriasis: a network meta-analysis.

Cochrane Database Syst Rev. 2021 Apr 19;4(4):CD011535. doi: 10.1002/14651858.CD011535.pub4.

Antidepressants for pain management in adults with chronic pain: a network meta-analysis.

Health Technol Assess. 2024 Oct;28(62):1-155. doi: 10.3310/MKRT2948.

Leveraging a foundation model zoo for cell similarity search in oncological microscopy across devices.

Front Oncol. 2025 Jun 18;15:1480384. doi: 10.3389/fonc.2025.1480384. eCollection 2025.

本文引用的文献

GTransCYPs: an improved graph transformer neural network with attention pooling for reliably predicting CYP450 inhibitors.

J Cheminform. 2024 Oct 29;16(1):119. doi: 10.1186/s13321-024-00915-z.

DeepP450: Predicting Human P450 Activities of Small Molecules by Integrating Pretrained Protein Language Model and Molecular Representation.

J Chem Inf Model. 2024 Apr 22;64(8):3149-3160. doi: 10.1021/acs.jcim.4c00115. Epub 2024 Apr 8.

Prediction of Cytochrome P450 Inhibition Using a Deep Learning Approach and Substructure Pattern Recognition.

J Chem Inf Model. 2024 Apr 8;64(7):2528-2538. doi: 10.1021/acs.jcim.3c01396. Epub 2023 Oct 21.

A deep learning framework for accurate reaction prediction and its application on high-throughput experimentation data.

J Cheminform. 2023 Aug 11;15(1):72. doi: 10.1186/s13321-023-00732-w.

Machine Learning Models to Predict Cytochrome P450 2B6 Inhibitors and Substrates.

Chem Res Toxicol. 2023 Aug 21;36(8):1332-1344. doi: 10.1021/acs.chemrestox.3c00065. Epub 2023 Jul 12.

A Robust Machine Learning Framework Built Upon Molecular Representations Predicts CYP450 Inhibition: Toward Precision in Drug Repurposing.

OMICS. 2023 Jul;27(7):305-314. doi: 10.1089/omi.2023.0075. Epub 2023 Jul 4.

StackTHPred: Identifying Tumor-Homing Peptides through GBDT-Based Feature Selection with Stacking Ensemble Architecture.

Int J Mol Sci. 2023 Jun 19;24(12):10348. doi: 10.3390/ijms241210348.

DEEPCYPs: A deep learning platform for enhanced cytochrome P450 activity prediction.

Front Pharmacol. 2023 Apr 10;14:1099093. doi: 10.3389/fphar.2023.1099093. eCollection 2023.

PubChem 2023 update.

Nucleic Acids Res. 2023 Jan 6;51(D1):D1373-D1380. doi: 10.1093/nar/gkac956.

A unified GCNN model for predicting CYP450 inhibitors by using graph convolutional neural networks with attention mechanism.

Comput Biol Med. 2022 Nov;150:106177. doi: 10.1016/j.compbiomed.2022.106177. Epub 2022 Oct 8.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

MEN：利用可解释的多模态编码网络进行CYP450抑制剂的精准预测

MEN: leveraging explainable multimodal encoding network for precision prediction of CYP450 inhibitors.

作者信息

机构信息

出版信息

相似文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

本文引用的文献