• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

Top-DTI:整合拓扑深度学习与大语言模型用于药物靶点相互作用预测

Top-DTI: Integrating Topological Deep Learning and Large Language Models for Drug Target Interaction Prediction.

作者信息

Talo Muhammed, Bozdag Serdar

机构信息

Department of Computer Science and Engineering, University of North Texas, Denton, TX 76207, USA.

BioDiscovery Institute, University of North Texas, Denton, TX 76207, USA.

出版信息

bioRxiv. 2025 Feb 8:2025.02.07.637146. doi: 10.1101/2025.02.07.637146.

DOI:10.1101/2025.02.07.637146
PMID:39975019
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11839103/
Abstract

MOTIVATION

The accurate prediction of drug-target interactions (DTI) is a crucial step in drug discovery, providing a foundation for identifying novel therapeutics. Traditional drug development is both costly and time-consuming, often spanning over a decade. Computational approaches help narrow the pool of compound candidates, offering significant starting points for experimental validation. In this study, we propose Top-DTI framework for predicting DTI by integrating topological data analysis (TDA) with large language models (LLMs). Top-DTI leverages persistent homology to extract topological features from protein contact maps and drug molecular images. Simultaneously, protein and drug LLMs generate semantically rich embeddings that capture sequential and contextual information from protein sequences and drug SMILES strings. By combining these complementary features, Top-DTI enhances predictive performance and robustness.

RESULTS

Experimental results on the public BioSNAP and Human DTI benchmark datasets demonstrate that the proposed Top-DTI model outperforms state-of-the-art approaches across multiple evaluation metrics, including AUROC, AUPRC, sensitivity, and specificity. Furthermore, the Top-DTI model achieves superior performance in the challenging cold-split scenario, where the test and validation sets contain drugs or targets absent from the training set. This setting simulates real-world scenarios and highlights the robustness of the model. Notably, incorporating topological features alongside LLM embeddings significantly improves predictive performance, underscoring the value of integrating structural and sequence-based representations.

AVAILABILITY

The data and source code of Top-DTI is available at https://github.com/bozdaglab/Top_DTI under Creative Commons Attribution Non Commercial 4.0 International Public License.

摘要

动机

准确预测药物-靶点相互作用(DTI)是药物发现中的关键一步,为识别新型治疗方法奠定基础。传统药物开发成本高且耗时,通常跨越十多年。计算方法有助于缩小化合物候选范围,为实验验证提供重要起点。在本研究中,我们提出了Top-DTI框架,通过将拓扑数据分析(TDA)与大语言模型(LLM)相结合来预测DTI。Top-DTI利用持久同调从蛋白质接触图和药物分子图像中提取拓扑特征。同时,蛋白质和药物大语言模型生成语义丰富的嵌入,捕捉来自蛋白质序列和药物SMILES字符串的序列和上下文信息。通过结合这些互补特征,Top-DTI提高了预测性能和稳健性。

结果

在公共BioSNAP和人类DTI基准数据集上的实验结果表明,所提出的Top-DTI模型在包括AUROC、AUPRC、敏感性和特异性在内的多个评估指标上优于现有方法。此外,Top-DTI模型在具有挑战性的冷分割场景中表现出色,其中测试集和验证集包含训练集中不存在的药物或靶点。此设置模拟了现实世界的场景,突出了模型的稳健性。值得注意的是,将拓扑特征与大语言模型嵌入相结合显著提高了预测性能,强调了整合基于结构和序列的表示的价值。

可用性

Top-DTI的数据和源代码可在https://github.com/bozdaglab/Top_DTI上获取,遵循知识共享署名非商业性4.0国际公共许可证。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/04f4/11839103/72019108d13d/nihpp-2025.02.07.637146v1-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/04f4/11839103/25eee87e01b6/nihpp-2025.02.07.637146v1-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/04f4/11839103/380311d6586e/nihpp-2025.02.07.637146v1-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/04f4/11839103/72019108d13d/nihpp-2025.02.07.637146v1-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/04f4/11839103/25eee87e01b6/nihpp-2025.02.07.637146v1-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/04f4/11839103/380311d6586e/nihpp-2025.02.07.637146v1-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/04f4/11839103/72019108d13d/nihpp-2025.02.07.637146v1-f0003.jpg

相似文献

1
Top-DTI: Integrating Topological Deep Learning and Large Language Models for Drug Target Interaction Prediction.Top-DTI:整合拓扑深度学习与大语言模型用于药物靶点相互作用预测
bioRxiv. 2025 Feb 8:2025.02.07.637146. doi: 10.1101/2025.02.07.637146.
2
Top-DTI: integrating topological deep learning and large language models for drug-target interaction prediction.Top-DTI:整合拓扑深度学习和大语言模型用于药物-靶点相互作用预测
Bioinformatics. 2025 Jul 1;41(Supplement_1):i133-i141. doi: 10.1093/bioinformatics/btaf183.
3
Use of Large Language Models to Classify Epidemiological Characteristics in Synthetic and Real-World Social Media Posts About Conjunctivitis Outbreaks: Infodemiology Study.利用大语言模型对合成及真实世界社交媒体上有关结膜炎爆发的帖子中的流行病学特征进行分类:信息流行病学研究
J Med Internet Res. 2025 Jul 2;27:e65226. doi: 10.2196/65226.
4
SS-DTI: A deep learning method integrating semantic and structural information for drug-target interaction prediction.SS-DTI:一种整合语义和结构信息用于药物-靶点相互作用预测的深度学习方法。
J Bioinform Comput Biol. 2025 Feb;23(1):2550002. doi: 10.1142/S0219720025500027. Epub 2025 Mar 25.
5
Improving drug-drug interaction prediction via in-context learning and judging with large language models.通过上下文学习和大语言模型判断来改善药物相互作用预测
Front Pharmacol. 2025 Jun 2;16:1589788. doi: 10.3389/fphar.2025.1589788. eCollection 2025.
6
Psychometric Evaluation of Large Language Model Embeddings for Personality Trait Prediction.用于人格特质预测的大语言模型嵌入的心理测量评估
J Med Internet Res. 2025 Jul 8;27:e75347. doi: 10.2196/75347.
7
Artificial intelligence for diagnosing exudative age-related macular degeneration.人工智能在渗出性年龄相关性黄斑变性诊断中的应用。
Cochrane Database Syst Rev. 2024 Oct 17;10(10):CD015522. doi: 10.1002/14651858.CD015522.pub2.
8
Enhancing Pulmonary Disease Prediction Using Large Language Models With Feature Summarization and Hybrid Retrieval-Augmented Generation: Multicenter Methodological Study Based on Radiology Report.使用具有特征总结和混合检索增强生成功能的大语言模型增强肺部疾病预测:基于放射学报告的多中心方法学研究
J Med Internet Res. 2025 Jun 11;27:e72638. doi: 10.2196/72638.
9
Predicting Affinity Through Homology (PATH): Interpretable Binding Affinity Prediction with Persistent Homology.通过同源性预测亲和力(PATH):基于持久同源性的可解释结合亲和力预测
bioRxiv. 2024 Oct 21:2023.11.16.567384. doi: 10.1101/2023.11.16.567384.
10
Stabilizing machine learning for reproducible and explainable results: A novel validation approach to subject-specific insights.稳定机器学习以获得可重复和可解释的结果:一种针对特定个体见解的新型验证方法。
Comput Methods Programs Biomed. 2025 Jun 21;269:108899. doi: 10.1016/j.cmpb.2025.108899.

本文引用的文献

1
Accurate and transferable drug-target interaction prediction with DrugLAMP.使用DrugLAMP进行准确且可转移的药物-靶点相互作用预测。
Bioinformatics. 2024 Nov 28;40(12). doi: 10.1093/bioinformatics/btae693.
2
MGNDTI: A Drug-Target Interaction Prediction Framework Based on Multimodal Representation Learning and the Gating Mechanism.MGNDTI:一种基于多模态表示学习和门控机制的药物-靶标相互作用预测框架。
J Chem Inf Model. 2024 Aug 26;64(16):6684-6698. doi: 10.1021/acs.jcim.4c00957. Epub 2024 Aug 13.
3
GSL-DTI: Graph structure learning network for Drug-Target interaction prediction.
GSL-DTI:用于药物-靶标相互作用预测的图结构学习网络。
Methods. 2024 Mar;223:136-145. doi: 10.1016/j.ymeth.2024.01.018. Epub 2024 Feb 14.
4
DLM-DTI: a dual language model for the prediction of drug-target interaction with hint-based learning.DLM-DTI:一种基于提示学习的药物-靶点相互作用预测双语模型。
J Cheminform. 2024 Feb 1;16(1):14. doi: 10.1186/s13321-024-00808-1.
5
Contrastive learning in protein language space predicts interactions between drugs and protein targets.蛋白质语言空间中的对比学习可预测药物与蛋白质靶标之间的相互作用。
Proc Natl Acad Sci U S A. 2023 Jun 13;120(24):e2220778120. doi: 10.1073/pnas.2220778120. Epub 2023 Jun 8.
6
Genomics data analysis via spectral shape and topology.基于谱形状和拓扑的基因组数据分析。
PLoS One. 2023 Apr 26;18(4):e0284820. doi: 10.1371/journal.pone.0284820. eCollection 2023.
7
CoaDTI: multi-modal co-attention based framework for drug-target interaction annotation.CoaDTI:基于多模态协同注意力的药物-靶标相互作用标注框架。
Brief Bioinform. 2022 Nov 19;23(6). doi: 10.1093/bib/bbac446.
8
Fine-tuning of BERT Model to Accurately Predict Drug-Target Interactions.微调BERT模型以准确预测药物-靶点相互作用。
Pharmaceutics. 2022 Aug 16;14(8):1710. doi: 10.3390/pharmaceutics14081710.
9
HyperAttentionDTI: improving drug-protein interaction prediction by sequence-based deep learning with attention mechanism.超注意力 DTI:基于注意力机制的序列深度学习提高药物-蛋白相互作用预测
Bioinformatics. 2022 Jan 12;38(3):655-662. doi: 10.1093/bioinformatics/btab715.
10
How Much Does It Cost to Research and Develop a New Drug? A Systematic Review and Assessment.研究和开发一种新药需要多少钱?系统评价和评估。
Pharmacoeconomics. 2021 Nov;39(11):1243-1269. doi: 10.1007/s40273-021-01065-y. Epub 2021 Aug 9.