• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

神经符号化人工智能驱动的糖尿病治疗用二肽基肽酶-4抑制剂发现(NeSyDPP-4):利用神经符号化人工智能方法发现用于糖尿病治疗的二肽基肽酶-4抑制剂

NeSyDPP-4: discovering DPP-4 inhibitors for diabetes treatment with a neuro-symbolic AI approach.

作者信息

Hossain Delower, Saghapour Ehsan, Chen Jake Y

机构信息

Department of Computer Science, The University of Alabama at Birmingham, Birmingham, AL, United States.

System Pharmacology and AI Research Center (SPARC), The University of Alabama at Birmingham, Birmingham, AL, United States.

出版信息

Front Bioinform. 2025 Jul 21;5:1603133. doi: 10.3389/fbinf.2025.1603133. eCollection 2025.

DOI:10.3389/fbinf.2025.1603133
PMID:40761758
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12319772/
Abstract

INTRODUCTION

Diabetes Mellitus (DM) constitutes a global epidemic and is one of the top ten leading causes of mortality (WHO, 2019), projected to rank seventh by 2030. The US National Diabetes Statistics Report (2021) states that 38.4 million Americans have diabetes. Dipeptidyl Peptidase-4 (DPP-4) is an FDA-approved target for the treatment of type 2 diabetes mellitus (T2DM). However, current DPP-4 inhibitors may cause adverse effects, including gastrointestinal issues, severe joint pain (FDA safety warning), nasopharyngitis, hypersensitivity, and nausea. Moreover, the development of novel drugs and the assessment of DPP-4 inhibition are both costly and often impractical. These challenges highlight the urgent need for efficient approaches to facilitate the discovery and optimization of safer and more effective DPP-4 inhibitors.

METHODOLOGY

Quantitative Structure-Activity Relationship (QSAR) modeling is a widely used computational approach for evaluating the properties of chemical substances. In this study, we employed a Neuro-symbolic (NeSy) approach, specifically the Logic Tensor Network (LTN), to develop a DPP-4 QSAR model capable of identifying potential small-molecule inhibitors and predicting bioactivity classification. For comparison, we also implemented baseline models using Deep Neural Networks (DNNs) and Transformers. A total of 6,563 bioactivity records (SMILES-based compounds with IC values) were collected from ChEMBL, PubChem, BindingDB, and GTP. Feature sets used for model training included descriptors (CDK Extended-PaDEL), fingerprints (Morgan), chemical language model embeddings (ChemBERTa-2), LLaMa 3.2 embedding features, and physicochemical properties.

RESULTS

Among all tested configurations, the Neuro-symbolic QSAR model (NeSyDPP-4) performed best using a combination of CDK extended and Morgan fingerprints. The model achieved an accuracy of 0.9725, an F1-score of 0.9723, an ROC AUC of 0.9719, and a Matthews correlation coefficient (MCC) of 0.9446. These results outperformed the baseline DNN and Transformer models, as well as existing state-of-the-art (SOTA) methods. To further validate the robustness of the model, we conducted an external evaluation using the Drug Target Common (DTC) dataset, where NeSyDPP-4 also demonstrated strong performance, with an accuracy of 0.9579, an AUC-ROC of 0.9565, a Matthews Correlation Coefficient (MCC) of 0.9171, and an F1-score of 0.9577.

DISCUSSION

These findings suggest that the NeSyDPP-4 model not only delivered high predictive performance but also demonstrated generalizability to external datasets. This approach presents a cost-effective and reliable alternative to traditional vivo screening, offering valuable support for the identification and classification of biologically active DPP-4 inhibitors in the treatment of type 2 diabetes mellitus (T2DM).

摘要

引言

糖尿病(DM)是一种全球性流行病,是全球十大主要死因之一(世界卫生组织,2019年),预计到2030年将升至第七位。美国国家糖尿病统计报告(2021年)指出,有3840万美国人患有糖尿病。二肽基肽酶-4(DPP-4)是美国食品药品监督管理局(FDA)批准的用于治疗2型糖尿病(T2DM)的靶点。然而,目前的DPP-4抑制剂可能会引起不良反应,包括胃肠道问题、严重关节疼痛(FDA安全警告)、鼻咽炎、过敏反应和恶心。此外,新型药物的研发以及DPP-4抑制作用的评估成本高昂且往往不切实际。这些挑战凸显了迫切需要高效的方法来促进更安全、更有效的DPP-4抑制剂的发现和优化。

方法

定量构效关系(QSAR)建模是一种广泛应用于评估化学物质性质的计算方法。在本研究中,我们采用了一种神经符号(NeSy)方法,即逻辑张量网络(LTN),来开发一个能够识别潜在小分子抑制剂并预测生物活性分类的DPP-4 QSAR模型。为了进行比较,我们还使用深度神经网络(DNN)和Transformer实现了基线模型。总共从ChEMBL、PubChem、BindingDB和GTP收集了6563条生物活性记录(基于SMILES的化合物及其IC值)。用于模型训练的特征集包括描述符(CDK扩展PaDEL)、指纹(摩根指纹)、化学语言模型嵌入(ChemBERTa-2)、LLaMa 3.2嵌入特征和物理化学性质。

结果

在所有测试配置中,神经符号QSAR模型(NeSyDPP-4)使用CDK扩展指纹和摩根指纹的组合表现最佳。该模型的准确率为0.9725,F1分数为0.9723,ROC曲线下面积(AUC)为0.9719,马修斯相关系数(MCC)为0.9446。这些结果优于基线DNN和Transformer模型以及现有的最先进(SOTA)方法。为了进一步验证模型的稳健性,我们使用药物靶点通用(DTC)数据集进行了外部评估,其中NeSyDPP-4也表现出强大的性能,准确率为0.9579,AUC-ROC为0.9565,马修斯相关系数(MCC)为0.9171,F1分数为0.9577。

讨论

这些发现表明,NeSyDPP-4模型不仅具有较高的预测性能,而且对外部数据集具有通用性。这种方法为传统的体内筛选提供了一种经济高效且可靠的替代方案,为2型糖尿病(T2DM)治疗中生物活性DPP-4抑制剂的识别和分类提供了有价值的支持。

相似文献

1
NeSyDPP-4: discovering DPP-4 inhibitors for diabetes treatment with a neuro-symbolic AI approach.神经符号化人工智能驱动的糖尿病治疗用二肽基肽酶-4抑制剂发现(NeSyDPP-4):利用神经符号化人工智能方法发现用于糖尿病治疗的二肽基肽酶-4抑制剂
Front Bioinform. 2025 Jul 21;5:1603133. doi: 10.3389/fbinf.2025.1603133. eCollection 2025.
2
NeSyDPP4-QSAR: A Neuro-Symbolic AI Approach for Potent DPP-4-Inhibitor Discovery in Diabetes Treatment.神经符号化二肽基肽酶4定量构效关系:一种用于糖尿病治疗中高效二肽基肽酶4抑制剂发现的神经符号人工智能方法。
bioRxiv. 2025 Apr 5:2025.03.31.646336. doi: 10.1101/2025.03.31.646336.
3
The Black Book of Psychotropic Dosing and Monitoring.《精神药物剂量与监测黑皮书》
Psychopharmacol Bull. 2024 Jul 8;54(3):8-59.
4
Comparison of Two Modern Survival Prediction Tools, SORG-MLA and METSSS, in Patients With Symptomatic Long-bone Metastases Who Underwent Local Treatment With Surgery Followed by Radiotherapy and With Radiotherapy Alone.两种现代生存预测工具 SORG-MLA 和 METSSS 在接受手术联合放疗和单纯放疗治疗有症状长骨转移患者中的比较。
Clin Orthop Relat Res. 2024 Dec 1;482(12):2193-2208. doi: 10.1097/CORR.0000000000003185. Epub 2024 Jul 23.
5
Dipeptidyl-peptidase (DPP)-4 inhibitors and glucagon-like peptide (GLP)-1 analogues for prevention or delay of type 2 diabetes mellitus and its associated complications in people at increased risk for the development of type 2 diabetes mellitus.二肽基肽酶(DPP)-4抑制剂和胰高血糖素样肽(GLP)-1类似物用于预防或延缓2型糖尿病高危人群发生2型糖尿病及其相关并发症。
Cochrane Database Syst Rev. 2017 May 10;5(5):CD012204. doi: 10.1002/14651858.CD012204.pub2.
6
Drugs for preventing postoperative nausea and vomiting in adults after general anaesthesia: a network meta-analysis.成人全身麻醉后预防术后恶心呕吐的药物:网状Meta分析
Cochrane Database Syst Rev. 2020 Oct 19;10(10):CD012859. doi: 10.1002/14651858.CD012859.pub2.
7
Systemic pharmacological treatments for chronic plaque psoriasis: a network meta-analysis.慢性斑块状银屑病的全身药理学治疗:一项网状Meta分析。
Cochrane Database Syst Rev. 2020 Jan 9;1(1):CD011535. doi: 10.1002/14651858.CD011535.pub3.
8
Systemic pharmacological treatments for chronic plaque psoriasis: a network meta-analysis.系统性药理学治疗慢性斑块状银屑病:网络荟萃分析。
Cochrane Database Syst Rev. 2021 Apr 19;4(4):CD011535. doi: 10.1002/14651858.CD011535.pub4.
9
Systemic pharmacological treatments for chronic plaque psoriasis: a network meta-analysis.慢性斑块状银屑病的全身药理学治疗:一项网状荟萃分析。
Cochrane Database Syst Rev. 2017 Dec 22;12(12):CD011535. doi: 10.1002/14651858.CD011535.pub2.
10
Antidepressants for pain management in adults with chronic pain: a network meta-analysis.抗抑郁药治疗成人慢性疼痛的疼痛管理:一项网络荟萃分析。
Health Technol Assess. 2024 Oct;28(62):1-155. doi: 10.3310/MKRT2948.

本文引用的文献

1
Towards Data-And Knowledge-Driven AI: A Survey on Neuro-Symbolic Computing.迈向数据与知识驱动的人工智能:神经符号计算综述
IEEE Trans Pattern Anal Mach Intell. 2025 Feb;47(2):878-899. doi: 10.1109/TPAMI.2024.3483273. Epub 2025 Jan 9.
2
StructuralDPPIV: a novel deep learning model based on atom structure for predicting dipeptidyl peptidase-IV inhibitory peptides.结构型 DPPIV:一种基于原子结构的新型深度学习模型,用于预测二肽基肽酶-IV 抑制肽。
Bioinformatics. 2024 Feb 1;40(2). doi: 10.1093/bioinformatics/btae057.
3
A survey on neural-symbolic learning systems.
关于神经符号学习系统的调查。
Neural Netw. 2023 Sep;166:105-126. doi: 10.1016/j.neunet.2023.06.028. Epub 2023 Jul 6.
4
Virtual screening of dipeptidyl peptidase-4 inhibitors using quantitative structure-activity relationship-based artificial intelligence and molecular docking of hit compounds.基于定量构效关系的人工智能和命中化合物的分子对接对二肽基肽酶-4 抑制剂的虚拟筛选。
Comput Biol Chem. 2021 Dec;95:107597. doi: 10.1016/j.compbiolchem.2021.107597. Epub 2021 Oct 30.
5
A novel artificial intelligence protocol to investigate potential leads for diabetes mellitus.一种用于研究糖尿病潜在线索的新型人工智能协议。
Mol Divers. 2021 Aug;25(3):1375-1393. doi: 10.1007/s11030-021-10204-8. Epub 2021 Mar 9.
6
Adverse event profiles of dipeptidyl peptidase-4 inhibitors: data mining of the public version of the FDA adverse event reporting system.二肽基肽酶-4抑制剂的不良事件概况:美国食品药品监督管理局不良事件报告系统公开版本的数据挖掘
BMC Pharmacol Toxicol. 2020 Sep 16;21(1):68. doi: 10.1186/s40360-020-00447-w.
7
MultiPredGO: Deep Multi-Modal Protein Function Prediction by Amalgamating Protein Structure, Sequence, and Interaction Information.MultiPredGO:通过融合蛋白质结构、序列和相互作用信息进行深度多模态蛋白质功能预测。
IEEE J Biomed Health Inform. 2021 May;25(5):1832-1838. doi: 10.1109/JBHI.2020.3022806. Epub 2021 May 11.
8
DeepMiR2GO: Inferring Functions of Human MicroRNAs Using a Deep Multi-Label Classification Model.DeepMiR2GO:使用深度多标签分类模型推断人类 microRNA 的功能。
Int J Mol Sci. 2019 Nov 30;20(23):6046. doi: 10.3390/ijms20236046.
9
Predicting DPP-IV inhibitors with machine learning approaches.运用机器学习方法预测二肽基肽酶-IV抑制剂
J Comput Aided Mol Des. 2017 Apr;31(4):393-402. doi: 10.1007/s10822-017-0009-6. Epub 2017 Feb 2.
10
BindingDB in 2015: A public database for medicinal chemistry, computational chemistry and systems pharmacology.2015年的BindingDB:一个用于药物化学、计算化学和系统药理学的公共数据库。
Nucleic Acids Res. 2016 Jan 4;44(D1):D1045-53. doi: 10.1093/nar/gkv1072. Epub 2015 Oct 19.