• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

通过深度学习系统地预测降解信号和 E3 泛素连接酶结合。

Systematic prediction of degrons and E3 ubiquitin ligase binding via deep learning.

机构信息

Department of Biomedical Informatics, School of Basic Medical Sciences, Peking University Health Science Center, Beijing, 100191, China.

Key Laboratory for Neuroscience, Ministry of Education/National Health Commission of China, Peking University, Beijing, 100191, China.

出版信息

BMC Biol. 2022 Jul 14;20(1):162. doi: 10.1186/s12915-022-01364-6.

DOI:10.1186/s12915-022-01364-6
PMID:35836176
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9281121/
Abstract

BACKGROUND

Degrons are short linear motifs, bound by E3 ubiquitin ligase to target protein substrates to be degraded by the ubiquitin-proteasome system. Mutations leading to deregulation of degron functionality disrupt control of protein abundance due to mistargeting of proteins destined for degradation and often result in pathologies. Targeting degrons by small molecules also emerges as an exciting drug design strategy to upregulate the expression of specific proteins. Despite their essential function and disease targetability, reliable identification of degrons remains a conundrum. Here, we developed a deep learning-based model named Degpred that predicts general degrons directly from protein sequences.

RESULTS

We showed that the BERT-based model performed well in predicting degrons singly from protein sequences. Then, we used the deep learning model Degpred to predict degrons proteome-widely. Degpred successfully captured typical degron-related sequence properties and predicted degrons beyond those from motif-based methods which use a handful of E3 motifs to match possible degrons. Furthermore, we calculated E3 motifs using predicted degrons on the substrates in our collected E3-substrate interaction dataset and constructed a regulatory network of protein degradation by assigning predicted degrons to specific E3s with calculated motifs. Critically, we experimentally verified that a predicted SPOP binding degron on CBX6 prompts CBX6 degradation and mediates the interaction with SPOP. We also showed that the protein degradation regulatory system is important in tumorigenesis by surveying degron-related mutations in TCGA.

CONCLUSIONS

Degpred provides an efficient tool to proteome-wide prediction of degrons and binding E3s singly from protein sequences. Degpred successfully captures typical degron-related sequence properties and predicts degrons beyond those from previously used motif-based methods, thus greatly expanding the degron landscape, which should advance the understanding of protein degradation, and allow exploration of uncharacterized alterations of proteins in diseases. To make it easier for readers to access collected and predicted datasets, we integrated these data into the website http://degron.phasep.pro/ .

摘要

背景

降解肽是短的线性基序,与 E3 泛素连接酶结合,将靶蛋白底物靶向到泛素-蛋白酶体系统中降解。导致降解肽功能失调的突变会破坏蛋白质丰度的控制,因为蛋白质的靶向错误,这些蛋白质注定要降解,并且经常导致病理学。通过小分子靶向降解肽也成为一种令人兴奋的药物设计策略,以上调特定蛋白质的表达。尽管它们具有重要的功能和疾病靶向性,但可靠地识别降解肽仍然是一个难题。在这里,我们开发了一种基于深度学习的模型,命名为 Degpred,该模型可以直接从蛋白质序列中预测一般的降解肽。

结果

我们表明,基于 BERT 的模型在从蛋白质序列中单预测降解肽方面表现良好。然后,我们使用深度学习模型 Degpred 在蛋白质组范围内预测降解肽。Degpred 成功地捕获了典型的降解肽相关序列特性,并预测了那些基于基序的方法无法预测的降解肽,这些方法仅使用少数 E3 基序来匹配可能的降解肽。此外,我们在我们收集的 E3-底物相互作用数据集中,使用预测的降解肽在底物上计算 E3 基序,并通过将预测的降解肽分配给具有计算基序的特定 E3,构建蛋白质降解的调控网络。至关重要的是,我们通过在 TCGA 中调查降解肽相关突变,实验验证了预测的 SPOP 结合降解肽在 CBX6 上促使 CBX6 降解并介导与 SPOP 的相互作用。我们还表明,通过调查 TCGA 中的降解肽相关突变,蛋白质降解调控系统在肿瘤发生中很重要。

结论

Degpred 提供了一种从蛋白质序列中单预测降解肽和结合 E3 的有效工具。Degpred 成功地捕获了典型的降解肽相关序列特性,并预测了那些基于先前使用的基序方法无法预测的降解肽,从而大大扩展了降解肽景观,这应该有助于深入了解蛋白质降解,并允许探索疾病中未表征的蛋白质改变。为了方便读者访问收集和预测的数据集,我们将这些数据集成到网站 http://degron.phasep.pro/ 中。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/75bc/9281121/cc6aee5253c0/12915_2022_1364_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/75bc/9281121/8f10f3fb105e/12915_2022_1364_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/75bc/9281121/e2a919426f9a/12915_2022_1364_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/75bc/9281121/b92d2f6582ba/12915_2022_1364_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/75bc/9281121/2e5aff5ae528/12915_2022_1364_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/75bc/9281121/7af09c1e22e9/12915_2022_1364_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/75bc/9281121/87395f10e3c4/12915_2022_1364_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/75bc/9281121/cc6aee5253c0/12915_2022_1364_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/75bc/9281121/8f10f3fb105e/12915_2022_1364_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/75bc/9281121/e2a919426f9a/12915_2022_1364_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/75bc/9281121/b92d2f6582ba/12915_2022_1364_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/75bc/9281121/2e5aff5ae528/12915_2022_1364_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/75bc/9281121/7af09c1e22e9/12915_2022_1364_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/75bc/9281121/87395f10e3c4/12915_2022_1364_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/75bc/9281121/cc6aee5253c0/12915_2022_1364_Fig7_HTML.jpg

相似文献

1
Systematic prediction of degrons and E3 ubiquitin ligase binding via deep learning.通过深度学习系统地预测降解信号和 E3 泛素连接酶结合。
BMC Biol. 2022 Jul 14;20(1):162. doi: 10.1186/s12915-022-01364-6.
2
MetaDegron: multimodal feature-integrated protein language model for predicting E3 ligase targeted degrons.MetaDegron:用于预测 E3 连接酶靶向降解结构域的多模态特征集成蛋白质语言模型。
Brief Bioinform. 2024 Sep 23;25(6). doi: 10.1093/bib/bbae519.
3
Elucidation of E3 ubiquitin ligase specificity through proteome-wide internal degron mapping.通过全蛋白质组内部降解信号区域作图阐明 E3 泛素连接酶的特异性。
Mol Cell. 2023 Sep 21;83(18):3377-3392.e6. doi: 10.1016/j.molcel.2023.08.022.
4
Principles of paralog-specific targeted protein degradation engaging the C-degron E3 KLHDC2.利用 C 降解基序 E3 KLHDC2 进行旁系同源特异性靶向蛋白降解的原理。
Nat Commun. 2024 Oct 12;15(1):8829. doi: 10.1038/s41467-024-52966-3.
5
Protein quality control degron-containing substrates are differentially targeted in the cytoplasm and nucleus by ubiquitin ligases.含蛋白质量控制 degron 的底物通过泛素连接酶在细胞质和细胞核中被靶向。
Genetics. 2021 Mar 3;217(1):1-19. doi: 10.1093/genetics/iyaa031.
6
DegronMD: Leveraging Evolutionary and Structural Features for Deciphering Protein-Targeted Degradation, Mutations, and Drug Response to Degrons.DegronMD:利用进化和结构特征来破译靶向蛋白降解、突变和 Degron 药物反应
Mol Biol Evol. 2023 Dec 1;40(12). doi: 10.1093/molbev/msad253.
7
Molecular basis for C-degron recognition by CRL2 ubiquitin ligase.CRL2 泛素连接酶识别 C-DEGRON 的分子基础。
Proc Natl Acad Sci U S A. 2023 Oct 24;120(43):e2308870120. doi: 10.1073/pnas.2308870120. Epub 2023 Oct 16.
8
DEGRONOPEDIA: a web server for proteome-wide inspection of degrons.DEGRONOPEDIA:一个用于蛋白质组范围内检查 degrons 的网络服务器。
Nucleic Acids Res. 2024 Jul 5;52(W1):W221-W232. doi: 10.1093/nar/gkae238.
9
The Eukaryotic Proteome Is Shaped by E3 Ubiquitin Ligases Targeting C-Terminal Degrons.真核生物蛋白质组由靶向 C 端降解结构域的 E3 泛素连接酶塑造。
Cell. 2018 Jun 14;173(7):1622-1635.e14. doi: 10.1016/j.cell.2018.04.028. Epub 2018 May 17.
10
How the ends signal the end: Regulation by E3 ubiquitin ligases recognizing protein termini.末端如何传递信号:E3 泛素连接酶识别蛋白末端的调控。
Mol Cell. 2022 Apr 21;82(8):1424-1438. doi: 10.1016/j.molcel.2022.02.004. Epub 2022 Mar 4.

引用本文的文献

1
Tissue-specific consequences of tag fusions on protein expression in transgenic mice.标签融合对转基因小鼠蛋白质表达的组织特异性影响。
PLoS Genet. 2025 Aug 25;21(8):e1011830. doi: 10.1371/journal.pgen.1011830. eCollection 2025 Aug.
2
A complete map of human cytosolic degrons and their relevance for disease.人类胞质降解子的完整图谱及其与疾病的关联。
bioRxiv. 2025 May 15:2025.05.10.653233. doi: 10.1101/2025.05.10.653233.
3
A hybrid machine learning model with attention mechanism and multidimensional multivariate feature coding for essential gene prediction.

本文引用的文献

1
Systematic analysis of alterations in the ubiquitin proteolysis system reveals its contribution to driver mutations in cancer.对泛素蛋白水解系统改变的系统分析揭示了其对癌症驱动突变的作用。
Nat Cancer. 2020 Jan;1(1):122-135. doi: 10.1038/s43018-019-0001-2. Epub 2019 Dec 2.
2
Genetic fusions favor tumorigenesis through degron loss in oncogenes.遗传融合通过癌基因去稳定域的丢失促进肿瘤发生。
Nat Commun. 2021 Nov 18;12(1):6704. doi: 10.1038/s41467-021-26871-y.
3
UbiBrowser 2.0: a comprehensive resource for proteome-wide known and predicted ubiquitin ligase/deubiquitinase-substrate interactions in eukaryotic species.
一种具有注意力机制和多维多变量特征编码的混合机器学习模型用于必需基因预测。
BMC Biol. 2025 Apr 24;23(1):108. doi: 10.1186/s12915-025-02209-8.
4
Computational Prediction of Linear Interacting Peptides.线性相互作用肽的计算预测。
Methods Mol Biol. 2025;2867:233-245. doi: 10.1007/978-1-0716-4196-5_14.
5
DescribePROT Database of Residue-Level Protein Structure and Function Annotations.描述 PROT 数据库中的残基水平的蛋白质结构和功能注释。
Methods Mol Biol. 2025;2867:169-184. doi: 10.1007/978-1-0716-4196-5_10.
6
SeqDance: A Protein Language Model for Representing Protein Dynamic Properties.SeqDance:一种用于表示蛋白质动态特性的蛋白质语言模型。
bioRxiv. 2024 Oct 15:2024.10.11.617911. doi: 10.1101/2024.10.11.617911.
7
MetaDegron: multimodal feature-integrated protein language model for predicting E3 ligase targeted degrons.MetaDegron:用于预测 E3 连接酶靶向降解结构域的多模态特征集成蛋白质语言模型。
Brief Bioinform. 2024 Sep 23;25(6). doi: 10.1093/bib/bbae519.
8
Machine learning and multi-omics data reveal driver gene-based molecular subtypes in hepatocellular carcinoma for precision treatment.机器学习和多组学数据揭示了基于驱动基因的肝细胞癌分子亚型,以实现精准治疗。
PLoS Comput Biol. 2024 May 10;20(5):e1012113. doi: 10.1371/journal.pcbi.1012113. eCollection 2024 May.
9
DEGRONOPEDIA: a web server for proteome-wide inspection of degrons.DEGRONOPEDIA:一个用于蛋白质组范围内检查 degrons 的网络服务器。
Nucleic Acids Res. 2024 Jul 5;52(W1):W221-W232. doi: 10.1093/nar/gkae238.
10
Cadmium-induced Carcinogenesis in Respiratory Organs and the Prostate: Insights from Three Perspectives on Toxicogenomic Approach.镉诱导的呼吸器官和前列腺致癌作用:毒理基因组学方法三个视角的见解
J Cancer Prev. 2023 Dec 30;28(4):150-159. doi: 10.15430/JCP.2023.28.4.150.
UbiBrowser 2.0:真核生物中蛋白质组范围内已知和预测的泛素连接酶/去泛素化酶-底物相互作用的综合资源。
Nucleic Acids Res. 2022 Jan 7;50(D1):D719-D728. doi: 10.1093/nar/gkab962.
4
Proteome-wide mapping of short-lived proteins in human cells.人类细胞内短寿命蛋白质的蛋白质组范围图谱绘制。
Mol Cell. 2021 Nov 18;81(22):4722-4735.e5. doi: 10.1016/j.molcel.2021.09.015. Epub 2021 Oct 8.
5
Computational strategy for intrinsically disordered protein ligand design leads to the discovery of p53 transactivation domain I binding compounds that activate the p53 pathway.用于内在无序蛋白配体设计的计算策略导致发现激活p53途径的p53反式激活结构域I结合化合物。
Chem Sci. 2020 Dec 28;12(8):3004-3016. doi: 10.1039/d0sc04670a.
6
Biological structure and function emerge from scaling unsupervised learning to 250 million protein sequences.生物结构和功能源于将无监督学习扩展到 2.5 亿个蛋白质序列。
Proc Natl Acad Sci U S A. 2021 Apr 13;118(15). doi: 10.1073/pnas.2016239118.
7
Mutations of Intrinsically Disordered Protein Regions Can Drive Cancer but Lack Therapeutic Strategies.内在无序蛋白区域的突变可引发癌症,但缺乏治疗策略。
Biomolecules. 2021 Mar 4;11(3):381. doi: 10.3390/biom11030381.
8
Systematic characterization of mutations altering protein degradation in human cancers.系统性鉴定导致人类癌症中蛋白质降解改变的突变。
Mol Cell. 2021 Mar 18;81(6):1292-1308.e11. doi: 10.1016/j.molcel.2021.01.020. Epub 2021 Feb 9.
9
Evaluating Protein Transfer Learning with TAPE.使用TAPE评估蛋白质迁移学习。
Adv Neural Inf Process Syst. 2019 Dec;32:9689-9701.
10
DescribePROT: database of amino acid-level protein structure and function predictions.DescribePROT:氨基酸水平的蛋白质结构和功能预测数据库。
Nucleic Acids Res. 2021 Jan 8;49(D1):D298-D308. doi: 10.1093/nar/gkaa931.