• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

神经符号化二肽基肽酶4定量构效关系:一种用于糖尿病治疗中高效二肽基肽酶4抑制剂发现的神经符号人工智能方法。

NeSyDPP4-QSAR: A Neuro-Symbolic AI Approach for Potent DPP-4-Inhibitor Discovery in Diabetes Treatment.

作者信息

Hossain Delower, Saghapour Ehsan, Chen Jake Y

出版信息

bioRxiv. 2025 Apr 5:2025.03.31.646336. doi: 10.1101/2025.03.31.646336.

DOI:10.1101/2025.03.31.646336
PMID:40236174
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11996372/
Abstract

Diabetes Mellitus (DM) is a global epidemic and among the top ten leading causes of mortality (WHO, 2019), projected to rank seventh by 2030. The US National Diabetes Statistics Report (2021) states that 38.4 million Americans have diabetes. Dipeptidyl Peptidase-4 (DPP-4) is an FDA-approved target for type 2 diabetes mellitus (T2DM) treatment. However, current DPP-4 inhibitors are associated with adverse effects, including gastrointestinal issues, severe joint pain (FDA safety warning), nasopharyngitis, hypersensitivity, and nausea. Identifying novel inhibitors is crucial. Direct in vivo DPP-4 inhibition assessment is costly and impractical, making in silico IC50 prediction a viable alternative. Quantitative Structure-Activity Relationship (QSAR) modeling is a widely used computational approach for chemical substance assessment. We employ LTN, a neuro-symbolic approach, alongside DNN and transformers as baselines. DPP-4-related data is sourced from PubChem, ChEMBL, BindingDB, and GTP, comprising 6,563 bioactivity records (SMILES-based compounds with IC50 values) after deduplication and thresholding. A diverse set of features including descriptors (CDK Extended-PaDEL), fingerprints (Morgan), chemical language model embeddings (ChemBERTa2), LLaMa 3.2, and physicochemical properties is used to train the NeSyDPP4-QSAR model. The NeSyDPP4-QSAR model yielded the highest accuracy, incorporating CDKextended and Morgan fingerprints, with an accuracy of 0.9725, an F1-score of 0.9723, an ROC AUC of 0.9719, and an MCC of 0.9446. The performance was benchmarked against two standard baseline models: a deep neural network and a transformer. To ensure fair comparisons, DNN models used the equivalent attributes with the same dimension and network configuration as NeSyDPP4-QSAR. Our findings showed that integrating the Neuro-symbolic strategy (neural network-based learning and symbolic reasoning) holds immense potential for discovering drugs that can inhibit diabetes mellitus and classifying biological activities that inhibit it.

摘要

糖尿病(DM)是一种全球性流行病,是十大主要死因之一(世界卫生组织,2019年),预计到2030年将升至第七位。美国国家糖尿病统计报告(2021年)指出,有3840万美国人患有糖尿病。二肽基肽酶-4(DPP-4)是美国食品药品监督管理局(FDA)批准的用于2型糖尿病(T2DM)治疗的靶点。然而,目前的DPP-4抑制剂存在不良反应,包括胃肠道问题、严重关节疼痛(FDA安全警告)、鼻咽炎、过敏反应和恶心。因此,寻找新型抑制剂至关重要。直接在体内评估DPP-4抑制作用成本高昂且不切实际,使得通过计算机模拟预测IC50成为一种可行的替代方法。定量构效关系(QSAR)建模是一种广泛应用于化学物质评估的计算方法。我们采用基于神经符号的方法LTN,并将深度神经网络(DNN)和变压器模型作为基线进行比较。DPP-4相关数据来源于PubChem、ChEMBL、BindingDB和GTP,经过去重和阈值处理后,包含6563条生物活性记录(基于SMILES的化合物及其IC50值)。我们使用了一系列不同的特征,包括描述符(CDK Extended-PaDEL)、指纹(Morgan)、化学语言模型嵌入(ChemBERTa2)、LLaMa 3.2以及物理化学性质,来训练NeSyDPP4-QSAR模型。NeSyDPP4-QSAR模型结合CDKextended和Morgan指纹,获得了最高的准确率,其准确率为0.9725,F1分数为0.9723,ROC曲线下面积(ROC AUC)为0.9719,马修斯相关系数(MCC)为0.9446。该模型的性能以两个标准基线模型为基准进行评估:深度神经网络和变压器模型。为确保公平比较,DNN模型使用了与NeSyDPP4-QSAR相同维度和网络配置的等效属性。我们的研究结果表明,整合神经符号策略(基于神经网络的学习和符号推理)在发现能够抑制糖尿病的药物以及对抑制糖尿病的生物活性进行分类方面具有巨大潜力。

相似文献

1
NeSyDPP4-QSAR: A Neuro-Symbolic AI Approach for Potent DPP-4-Inhibitor Discovery in Diabetes Treatment.神经符号化二肽基肽酶4定量构效关系:一种用于糖尿病治疗中高效二肽基肽酶4抑制剂发现的神经符号人工智能方法。
bioRxiv. 2025 Apr 5:2025.03.31.646336. doi: 10.1101/2025.03.31.646336.
2
NeSyDPP-4: discovering DPP-4 inhibitors for diabetes treatment with a neuro-symbolic AI approach.神经符号化人工智能驱动的糖尿病治疗用二肽基肽酶-4抑制剂发现(NeSyDPP-4):利用神经符号化人工智能方法发现用于糖尿病治疗的二肽基肽酶-4抑制剂
Front Bioinform. 2025 Jul 21;5:1603133. doi: 10.3389/fbinf.2025.1603133. eCollection 2025.
3
The Black Book of Psychotropic Dosing and Monitoring.《精神药物剂量与监测黑皮书》
Psychopharmacol Bull. 2024 Jul 8;54(3):8-59.
4
Dipeptidyl-peptidase (DPP)-4 inhibitors and glucagon-like peptide (GLP)-1 analogues for prevention or delay of type 2 diabetes mellitus and its associated complications in people at increased risk for the development of type 2 diabetes mellitus.二肽基肽酶(DPP)-4抑制剂和胰高血糖素样肽(GLP)-1类似物用于预防或延缓2型糖尿病高危人群发生2型糖尿病及其相关并发症。
Cochrane Database Syst Rev. 2017 May 10;5(5):CD012204. doi: 10.1002/14651858.CD012204.pub2.
5
Comparison of Two Modern Survival Prediction Tools, SORG-MLA and METSSS, in Patients With Symptomatic Long-bone Metastases Who Underwent Local Treatment With Surgery Followed by Radiotherapy and With Radiotherapy Alone.两种现代生存预测工具 SORG-MLA 和 METSSS 在接受手术联合放疗和单纯放疗治疗有症状长骨转移患者中的比较。
Clin Orthop Relat Res. 2024 Dec 1;482(12):2193-2208. doi: 10.1097/CORR.0000000000003185. Epub 2024 Jul 23.
6
The quantity, quality and findings of network meta-analyses evaluating the effectiveness of GLP-1 RAs for weight loss: a scoping review.评估胰高血糖素样肽-1受体激动剂(GLP-1 RAs)减肥效果的网状Meta分析的数量、质量及结果:一项范围综述
Health Technol Assess. 2025 Jun 25:1-73. doi: 10.3310/SKHT8119.
7
Short-Term Memory Impairment短期记忆障碍
8
Systemic pharmacological treatments for chronic plaque psoriasis: a network meta-analysis.系统性药理学治疗慢性斑块状银屑病:网络荟萃分析。
Cochrane Database Syst Rev. 2021 Apr 19;4(4):CD011535. doi: 10.1002/14651858.CD011535.pub4.
9
Drugs for preventing postoperative nausea and vomiting in adults after general anaesthesia: a network meta-analysis.成人全身麻醉后预防术后恶心呕吐的药物:网状Meta分析
Cochrane Database Syst Rev. 2020 Oct 19;10(10):CD012859. doi: 10.1002/14651858.CD012859.pub2.
10
Newer agents for blood glucose control in type 2 diabetes: systematic review and economic evaluation.新型 2 型糖尿病血糖控制药物:系统评价和经济评估。
Health Technol Assess. 2010 Jul;14(36):1-248. doi: 10.3310/hta14360.