一种用于预测P-糖蛋白底物和抑制剂的多模态对比学习框架。

A multimodal contrastive learning framework for predicting P-glycoprotein substrates and inhibitors.

作者信息

Zhang Yixue, Wu Jialu, Kang Yu, Hou Tingjun

机构信息

College of Pharmaceutical Sciences, Zhejiang University, Hangzhou, 310058, China.

Polytechnic Institute of Zhejiang University, Zhejiang University, Hangzhou, 310015, China.

出版信息

J Pharm Anal. 2025 Aug;15(8):101313. doi: 10.1016/j.jpha.2025.101313. Epub 2025 Apr 16.

DOI:10.1016/j.jpha.2025.101313

PMID:40919587

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12409376/

Abstract

P-glycoprotein (P-gp) is a transmembrane protein widely involved in the absorption, distribution, metabolism, excretion, and toxicity (ADMET) of drugs within the human body. Accurate prediction of P-gp inhibitors and substrates is crucial for drug discovery and toxicological assessment. However, existing models rely on limited molecular information, leading to suboptimal model performance for predicting P-gp inhibitors and substrates. To overcome this challenge, we compiled an extensive dataset from public databases and literature, consisting of 5,943 P-gp inhibitors and 4,018 substrates, notable for their high quantity, quality, and structural uniqueness. In addition, we curated two external test sets to validate the model's generalization capability. Subsequently, we developed a multimodal graph contrastive learning (GCL) model for the prediction of P-gp inhibitors and substrates (MC-PGP). This framework integrates three types of features from Simplified Molecular Input Line Entry System (SMILES) sequences, molecular fingerprints, and molecular graphs using an attention-based fusion strategy to generate a unified molecular representation. Furthermore, we employed a GCL approach to enhance structural representations by aligning local and global structures. Extensive experimental results highlight the superior performance of MC-PGP, which achieves improvements in the area under the curve of receiver operating characteristic (AUC-ROC) of 9.82% and 10.62% on the external P-gp inhibitor and external P-gp substrate datasets, respectively, compared with 12 state-of-the-art methods. Furthermore, the interpretability analysis of all three molecular feature types offers comprehensive and complementary insights, demonstrating that MC-PGP effectively identifies key functional groups involved in P-gp interactions. These chemically intuitive insights provide valuable guidance for the design and optimization of drug candidates.

摘要

P-糖蛋白（P-gp）是一种跨膜蛋白，广泛参与人体内药物的吸收、分布、代谢、排泄和毒性（ADMET）过程。准确预测P-gp抑制剂和底物对于药物发现和毒理学评估至关重要。然而，现有模型依赖有限的分子信息，导致在预测P-gp抑制剂和底物时模型性能欠佳。为克服这一挑战，我们从公共数据库和文献中汇编了一个广泛的数据集，其中包括5943种P-gp抑制剂和4018种底物，其数量、质量和结构独特性都很显著。此外，我们策划了两个外部测试集来验证模型的泛化能力。随后，我们开发了一种用于预测P-gp抑制剂和底物的多模态图对比学习（GCL）模型（MC-PGP）。该框架使用基于注意力的融合策略整合来自简化分子输入线性条目系统（SMILES）序列、分子指纹和分子图的三种类型特征，以生成统一的分子表示。此外，我们采用GCL方法通过对齐局部和全局结构来增强结构表示。大量实验结果突出了MC-PGP的卓越性能，与12种最先进的方法相比，它在外部P-gp抑制剂数据集和外部P-gp底物数据集上分别实现了受试者操作特征曲线下面积（AUC-ROC）提高9.82%和10.62%。此外，对所有三种分子特征类型的可解释性分析提供了全面且互补的见解，表明MC-PGP有效地识别了参与P-gp相互作用的关键官能团。这些具有化学直观性的见解为药物候选物的设计和优化提供了有价值的指导。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/36bd/12409376/efa0b2656c73/ga1.jpg

相似文献

A multimodal contrastive learning framework for predicting P-glycoprotein substrates and inhibitors.一种用于预测P-糖蛋白底物和抑制剂的多模态对比学习框架。

J Pharm Anal. 2025 Aug;15(8):101313. doi: 10.1016/j.jpha.2025.101313. Epub 2025 Apr 16.

Comparison of Two Modern Survival Prediction Tools, SORG-MLA and METSSS, in Patients With Symptomatic Long-bone Metastases Who Underwent Local Treatment With Surgery Followed by Radiotherapy and With Radiotherapy Alone.两种现代生存预测工具 SORG-MLA 和 METSSS 在接受手术联合放疗和单纯放疗治疗有症状长骨转移患者中的比较。

Clin Orthop Relat Res. 2024 Dec 1;482(12):2193-2208. doi: 10.1097/CORR.0000000000003185. Epub 2024 Jul 23.

iACP-DPNet: a dual-pooling causal dilated convolutional network for interpretable anticancer peptide identification.iACP-DPNet：一种用于可解释抗癌肽识别的双池因果扩张卷积网络。

Funct Integr Genomics. 2025 Jul 4;25(1):147. doi: 10.1007/s10142-025-01641-x.

Prescription of Controlled Substances: Benefits and Risks管制药品的处方：益处与风险

Are Current Survival Prediction Tools Useful When Treating Subsequent Skeletal-related Events From Bone Metastases?当前的生存预测工具在治疗骨转移后的骨骼相关事件时有用吗？

Clin Orthop Relat Res. 2024 Sep 1;482(9):1710-1721. doi: 10.1097/CORR.0000000000003030. Epub 2024 Mar 22.

MEN: leveraging explainable multimodal encoding network for precision prediction of CYP450 inhibitors.MEN：利用可解释的多模态编码网络进行CYP450抑制剂的精准预测

Sci Rep. 2025 Jul 1;15(1):21820. doi: 10.1038/s41598-025-04982-6.

EPI-DynFusion: enhancer-promoter interaction prediction model based on sequence features and dynamic fusion mechanisms.EPI-DynFusion：基于序列特征和动态融合机制的增强子-启动子相互作用预测模型。

Front Genet. 2025 Jul 23;16:1614222. doi: 10.3389/fgene.2025.1614222. eCollection 2025.

Short-Term Memory Impairment短期记忆障碍

Development and Validation of a Brain Aging Biomarker in Middle-Aged and Older Adults: Deep Learning Approach.中老年人群脑衰老生物标志物的开发与验证：深度学习方法

JMIR Aging. 2025 Aug 1;8:e73004. doi: 10.2196/73004.

Stabilizing machine learning for reproducible and explainable results: A novel validation approach to subject-specific insights.稳定机器学习以获得可重复和可解释的结果：一种针对特定个体见解的新型验证方法。

Comput Methods Programs Biomed. 2025 Jun 21;269:108899. doi: 10.1016/j.cmpb.2025.108899.

本文引用的文献

Molecular property prediction based on graph structure learning.基于图结构学习的分子性质预测。

Bioinformatics. 2024 May 2;40(5). doi: 10.1093/bioinformatics/btae304.

MS-BACL: enhancing metabolic stability prediction through bond graph augmentation and contrastive learning.MS-BACL：通过键合图增强和对比学习提高代谢稳定性预测。

Brief Bioinform. 2024 Mar 27;25(3). doi: 10.1093/bib/bbae127.

Recent developments of P-glycoprotein inhibitors and its structure-activity relationship (SAR) studies.近年来 P-糖蛋白抑制剂的发展及其构效关系（SAR）研究。

Bioorg Chem. 2024 Feb;143:106997. doi: 10.1016/j.bioorg.2023.106997. Epub 2023 Nov 25.

A systematic study of key elements underlying molecular property prediction.对分子性质预测背后关键要素的系统研究。

Nat Commun. 2023 Oct 13;14(1):6395. doi: 10.1038/s41467-023-41948-6.

Recent Studies of Artificial Intelligence on Drug Absorption.人工智能在药物吸收方面的最新研究。

J Chem Inf Model. 2023 Oct 23;63(20):6198-6211. doi: 10.1021/acs.jcim.3c00960. Epub 2023 Oct 11.

CMMS-GCL: cross-modality metabolic stability prediction with graph contrastive learning.CMMS-GCL：基于图对比学习的跨模态代谢稳定性预测。

Bioinformatics. 2023 Aug 1;39(8). doi: 10.1093/bioinformatics/btad503.

Pharmacophoric-constrained heterogeneous graph transformer model for molecular property prediction.用于分子性质预测的药效团约束异构图变换器模型

Commun Chem. 2023 Apr 3;6(1):60. doi: 10.1038/s42004-023-00857-x.

A Perspective on Explanations of Molecular Prediction Models.分子预测模型解释的透视。

J Chem Theory Comput. 2023 Apr 25;19(8):2149-2160. doi: 10.1021/acs.jctc.2c01235. Epub 2023 Mar 27.

HiGNN: A Hierarchical Informative Graph Neural Network for Molecular Property Prediction Equipped with Feature-Wise Attention.HiGNN：一种配备特征级注意力机制的用于分子性质预测的分层信息图神经网络。

J Chem Inf Model. 2023 Jan 9;63(1):43-55. doi: 10.1021/acs.jcim.2c01099. Epub 2022 Dec 14.

Deep learning methods for molecular representation and property prediction.深度学习方法在分子表示和性质预测中的应用。

Drug Discov Today. 2022 Dec;27(12):103373. doi: 10.1016/j.drudis.2022.103373. Epub 2022 Sep 24.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

一种用于预测P-糖蛋白底物和抑制剂的多模态对比学习框架。

A multimodal contrastive learning framework for predicting P-glycoprotein substrates and inhibitors.

作者信息

机构信息

出版信息

相似文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

本文引用的文献