• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

IDR解码器:一种针对内在无序区域进行合理药物发现的机器学习方法。

IDRdecoder: a machine learning approach for rational drug discovery toward intrinsically disordered regions.

作者信息

Shionyu-Mitusyama Clara, Ohmori Satoshi, Hirata Subaru, Ishida Hirokazu, Shirai Tsuyoshi

机构信息

Department of Bioscience, Nagahama Institute of Bio-Science and Technology, Nagahama, Shiga, Japan.

Faculty of Data Science, Shiga University 1-1-1 Banba, Hikone, Shiga, Japan.

出版信息

Front Bioinform. 2025 Jul 18;5:1627836. doi: 10.3389/fbinf.2025.1627836. eCollection 2025.

DOI:10.3389/fbinf.2025.1627836
PMID:40756902
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12313641/
Abstract

INTRODUCTION

Intrinsically disordered regions (IDRs) of proteins have traditionally been overlooked as drug targets. However, with growing recognition of their crucial role in biological activity and their involvement in various diseases, IDRs have emerged as promising targets for drug discovery. Despite this potential, rational methodologies for IDR-targeted drug discovery remain underdeveloped, primarily due to a lack of reference experimental data.

METHODS

This study explores a machine learning approach to predict IDR functions, drug interaction sites, and interacting molecular substructures within IDR sequences. To address the data gap, stepwise transfer learning was employed. IDRdecoder sequentially generate predictions for IDR classification, interaction sites, and interacting ligand substructures. In the first step, the neural net was trained as autoencoder by using 26,480,862 predicted IDR sequences. Then it was trained against 57,692 ligand-binding PDB sequences with higher IDR tendency via transfer learning for predict ligand interacting sites and ligand types.

RESULTS

IDRdecoder was evaluated against 9 IDR sequences, which were experimentally detailed as drug targets. In the encoding space, specific GO terms related to the hypothesized functions of the evaluation IDR sequences were highly enriched. The model's prediction performance for drug interacting sites and ligand types demonstrated the area under the curve (AUC) of 0.616 and 0.702, respectively. The performance was compared with existing methods including ProteinBERT, and IDRdecoder demonstrated moderately improved performance.

DISCUSSION

IDRdecoder is the first application for predicting drug interaction sites and ligands in IDR sequences. Analysis of the prediction results revealed characteristics beneficial for IDR-drug design; for instance, Tyr and Ala are preferred target sites, while flexible substructures, such as alkyl groups, are favored in ligand molecules.

摘要

引言

蛋白质的内在无序区域(IDR)传统上一直被忽视作为药物靶点。然而,随着人们越来越认识到它们在生物活性中的关键作用以及它们与各种疾病的关联,IDR已成为药物发现的有希望的靶点。尽管有这种潜力,但针对IDR的药物发现的合理方法仍然不够发达,主要是由于缺乏参考实验数据。

方法

本研究探索了一种机器学习方法来预测IDR功能、药物相互作用位点以及IDR序列内的相互作用分子亚结构。为了解决数据缺口,采用了逐步迁移学习。IDRdecoder依次对IDR分类、相互作用位点和相互作用配体亚结构进行预测。在第一步中,通过使用26,480,862个预测的IDR序列将神经网络训练为自动编码器。然后通过迁移学习针对具有更高IDR倾向的57,692个配体结合PDB序列对其进行训练,以预测配体相互作用位点和配体类型。

结果

针对9个作为药物靶点进行了实验详细研究的IDR序列对IDRdecoder进行了评估。在编码空间中,与评估IDR序列的假设功能相关的特定基因本体(GO)术语高度富集。该模型对药物相互作用位点和配体类型的预测性能分别显示曲线下面积(AUC)为0.616和0.702。将该性能与包括ProteinBERT在内的现有方法进行了比较,IDRdecoder表现出适度的性能提升。

讨论

IDRdecoder是预测IDR序列中药物相互作用位点和配体的首个应用。对预测结果的分析揭示了对IDR药物设计有益的特征;例如,酪氨酸(Tyr)和丙氨酸(Ala)是优选的靶点位点,而配体分子中倾向于柔性亚结构,如烷基。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/49ec/12313641/be15e54bcced/fbinf-05-1627836-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/49ec/12313641/edad682686ab/fbinf-05-1627836-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/49ec/12313641/de94405da06e/fbinf-05-1627836-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/49ec/12313641/9142da59904a/fbinf-05-1627836-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/49ec/12313641/be15e54bcced/fbinf-05-1627836-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/49ec/12313641/edad682686ab/fbinf-05-1627836-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/49ec/12313641/de94405da06e/fbinf-05-1627836-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/49ec/12313641/9142da59904a/fbinf-05-1627836-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/49ec/12313641/be15e54bcced/fbinf-05-1627836-g004.jpg

相似文献

1
IDRdecoder: a machine learning approach for rational drug discovery toward intrinsically disordered regions.IDR解码器:一种针对内在无序区域进行合理药物发现的机器学习方法。
Front Bioinform. 2025 Jul 18;5:1627836. doi: 10.3389/fbinf.2025.1627836. eCollection 2025.
2
Comparison of Two Modern Survival Prediction Tools, SORG-MLA and METSSS, in Patients With Symptomatic Long-bone Metastases Who Underwent Local Treatment With Surgery Followed by Radiotherapy and With Radiotherapy Alone.两种现代生存预测工具 SORG-MLA 和 METSSS 在接受手术联合放疗和单纯放疗治疗有症状长骨转移患者中的比较。
Clin Orthop Relat Res. 2024 Dec 1;482(12):2193-2208. doi: 10.1097/CORR.0000000000003185. Epub 2024 Jul 23.
3
Are Current Survival Prediction Tools Useful When Treating Subsequent Skeletal-related Events From Bone Metastases?当前的生存预测工具在治疗骨转移后的骨骼相关事件时有用吗?
Clin Orthop Relat Res. 2024 Sep 1;482(9):1710-1721. doi: 10.1097/CORR.0000000000003030. Epub 2024 Mar 22.
4
Structural dynamics of IDR interactions in human SFPQ and implications for liquid-liquid phase separation.人类SFPQ中IDR相互作用的结构动力学及其对液-液相分离的影响
Acta Crystallogr D Struct Biol. 2025 Jul 1;81(Pt 7):357-379. doi: 10.1107/S2059798325005303. Epub 2025 Jun 27.
5
Does the Presence of Missing Data Affect the Performance of the SORG Machine-learning Algorithm for Patients With Spinal Metastasis? Development of an Internet Application Algorithm.缺失数据的存在是否会影响 SORG 机器学习算法在脊柱转移瘤患者中的性能?开发一种互联网应用算法。
Clin Orthop Relat Res. 2024 Jan 1;482(1):143-157. doi: 10.1097/CORR.0000000000002706. Epub 2023 Jun 12.
6
Short-Term Memory Impairment短期记忆障碍
7
The Black Book of Psychotropic Dosing and Monitoring.《精神药物剂量与监测黑皮书》
Psychopharmacol Bull. 2024 Jul 8;54(3):8-59.
8
Development and Validation of a Convolutional Neural Network Model to Predict a Pathologic Fracture in the Proximal Femur Using Abdomen and Pelvis CT Images of Patients With Advanced Cancer.利用晚期癌症患者腹部和骨盆 CT 图像建立卷积神经网络模型预测股骨近端病理性骨折的研究
Clin Orthop Relat Res. 2023 Nov 1;481(11):2247-2256. doi: 10.1097/CORR.0000000000002771. Epub 2023 Aug 23.
9
Sexual Harassment and Prevention Training性骚扰与预防培训
10
Cost-effectiveness of using prognostic information to select women with breast cancer for adjuvant systemic therapy.利用预后信息为乳腺癌患者选择辅助性全身治疗的成本效益
Health Technol Assess. 2006 Sep;10(34):iii-iv, ix-xi, 1-204. doi: 10.3310/hta10340.

本文引用的文献

1
Interplay of α-synuclein with Lipid Membranes: Cooperative Adsorption, Membrane Remodeling and Coaggregation.α-突触核蛋白与脂质膜的相互作用:协同吸附、膜重塑和共聚集
JACS Au. 2024 Mar 19;4(4):1250-1262. doi: 10.1021/jacsau.3c00579. eCollection 2024 Apr 22.
2
Dynamic action of an intrinsically disordered protein in DNA compaction that induces mycobacterial dormancy.一种固有无序蛋白在 DNA 紧缩中的动态作用,诱导分枝杆菌休眠。
Nucleic Acids Res. 2024 Jan 25;52(2):816-830. doi: 10.1093/nar/gkad1149.
3
Hormone-induced enhancer assembly requires an optimal level of hormone receptor multivalent interactions.
激素诱导的增强子组装需要激素受体多价相互作用的最佳水平。
Mol Cell. 2023 Oct 5;83(19):3438-3456.e12. doi: 10.1016/j.molcel.2023.08.027. Epub 2023 Sep 21.
4
Fuzzy Drug Targets: Disordered Proteins in the Drug-Discovery Realm.模糊的药物靶点:药物研发领域中的无序蛋白质
ACS Omega. 2023 Mar 8;8(11):9729-9747. doi: 10.1021/acsomega.2c07708. eCollection 2023 Mar 21.
5
Bacteria require phase separation for fitness in the mammalian gut.细菌在哺乳动物肠道中需要相分离才能适应。
Science. 2023 Mar 17;379(6637):1149-1156. doi: 10.1126/science.abn7229. Epub 2023 Mar 16.
6
The Gene Ontology knowledgebase in 2023.2023 版基因本体论知识库。
Genetics. 2023 May 4;224(1). doi: 10.1093/genetics/iyad031.
7
Computational prediction of disordered binding regions.无序结合区域的计算预测
Comput Struct Biotechnol J. 2023 Feb 10;21:1487-1497. doi: 10.1016/j.csbj.2023.02.018. eCollection 2023.
8
Aggregation of Disordered Proteins Associated with Neurodegeneration.与神经退行性疾病相关的无序蛋白质聚集。
Int J Mol Sci. 2023 Feb 8;24(4):3380. doi: 10.3390/ijms24043380.
9
Prediction of protein-protein interaction sites in intrinsically disordered proteins.内在无序蛋白质中蛋白质-蛋白质相互作用位点的预测
Front Mol Biosci. 2022 Sep 30;9:985022. doi: 10.3389/fmolb.2022.985022. eCollection 2022.
10
Drugging p53 in cancer: one protein, many targets.在癌症中靶向 p53:一种蛋白,多个靶点。
Nat Rev Drug Discov. 2023 Feb;22(2):127-144. doi: 10.1038/s41573-022-00571-8. Epub 2022 Oct 10.