• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

将机器学习与基于结构的蛋白质设计相结合,以预测和设计蛋白质的翻译后修饰。

Combining machine learning with structure-based protein design to predict and engineer post-translational modifications of proteins.

机构信息

Institute for Drug Discovery, Leipzig University Medical Faculty, Leipzig, Germany.

Center for Scalable Data Analytics and Artificial Intelligence ScaDS.AI, Dresden/Leipzig, Germany.

出版信息

PLoS Comput Biol. 2024 Mar 14;20(3):e1011939. doi: 10.1371/journal.pcbi.1011939. eCollection 2024 Mar.

DOI:10.1371/journal.pcbi.1011939
PMID:38484014
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10965067/
Abstract

Post-translational modifications (PTMs) of proteins play a vital role in their function and stability. These modifications influence protein folding, signaling, protein-protein interactions, enzyme activity, binding affinity, aggregation, degradation, and much more. To date, over 400 types of PTMs have been described, representing chemical diversity well beyond the genetically encoded amino acids. Such modifications pose a challenge to the successful design of proteins, but also represent a major opportunity to diversify the protein engineering toolbox. To this end, we first trained artificial neural networks (ANNs) to predict eighteen of the most abundant PTMs, including protein glycosylation, phosphorylation, methylation, and deamidation. In a second step, these models were implemented inside the computational protein modeling suite Rosetta, which allows flexible combination with existing protocols to model the modified sites and understand their impact on protein stability as well as function. Lastly, we developed a new design protocol that either maximizes or minimizes the predicted probability of a particular site being modified. We find that this combination of ANN prediction and structure-based design can enable the modification of existing, as well as the introduction of novel, PTMs. The potential applications of our work include, but are not limited to, glycan masking of epitopes, strengthening protein-protein interactions through phosphorylation, as well as protecting proteins from deamidation liabilities. These applications are especially important for the design of new protein therapeutics where PTMs can drastically change the therapeutic properties of a protein. Our work adds novel tools to Rosetta's protein engineering toolbox that allow for the rational design of PTMs.

摘要

蛋白质的翻译后修饰(PTMs)在其功能和稳定性中起着至关重要的作用。这些修饰影响蛋白质折叠、信号转导、蛋白质-蛋白质相互作用、酶活性、结合亲和力、聚集、降解等等。迄今为止,已经描述了超过 400 种 PTMs,代表了远超遗传编码氨基酸的化学多样性。这些修饰对蛋白质的成功设计构成了挑战,但也代表了使蛋白质工程工具多样化的主要机会。为此,我们首先训练人工神经网络(ANNs)来预测十八种最丰富的 PTMs,包括蛋白质糖基化、磷酸化、甲基化和脱酰胺。在第二步中,这些模型被实施在计算蛋白质建模套件 Rosetta 中,这允许灵活地与现有协议结合,以模拟修饰位点,并了解它们对蛋白质稳定性和功能的影响。最后,我们开发了一种新的设计方案,该方案要么最大化,要么最小化特定位点被修饰的预测概率。我们发现,这种 ANN 预测和基于结构的设计的组合可以实现现有 PTMs 的修饰,以及引入新的 PTMs。我们工作的潜在应用包括但不限于糖基化掩盖表位、通过磷酸化增强蛋白质-蛋白质相互作用,以及保护蛋白质免受脱酰胺缺陷的影响。这些应用在设计新的蛋白质治疗药物时尤为重要,因为 PTMs 可以极大地改变蛋白质的治疗特性。我们的工作为 Rosetta 的蛋白质工程工具箱添加了新的工具,允许对 PTMs 进行合理设计。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/10b0/10965067/36996e41c38d/pcbi.1011939.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/10b0/10965067/5dd3184d5f44/pcbi.1011939.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/10b0/10965067/9770cbf9f324/pcbi.1011939.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/10b0/10965067/98e4b45e1571/pcbi.1011939.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/10b0/10965067/614c51a4a3ba/pcbi.1011939.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/10b0/10965067/36996e41c38d/pcbi.1011939.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/10b0/10965067/5dd3184d5f44/pcbi.1011939.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/10b0/10965067/9770cbf9f324/pcbi.1011939.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/10b0/10965067/98e4b45e1571/pcbi.1011939.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/10b0/10965067/614c51a4a3ba/pcbi.1011939.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/10b0/10965067/36996e41c38d/pcbi.1011939.g005.jpg

相似文献

1
Combining machine learning with structure-based protein design to predict and engineer post-translational modifications of proteins.将机器学习与基于结构的蛋白质设计相结合,以预测和设计蛋白质的翻译后修饰。
PLoS Comput Biol. 2024 Mar 14;20(3):e1011939. doi: 10.1371/journal.pcbi.1011939. eCollection 2024 Mar.
2
A machine learning strategy for predicting localization of post-translational modification sites in protein-protein interacting regions.一种用于预测蛋白质-蛋白质相互作用区域中翻译后修饰位点定位的机器学习策略。
BMC Bioinformatics. 2016 Aug 17;17(1):307. doi: 10.1186/s12859-016-1165-8.
3
A Text Mining and Machine Learning Protocol for Extracting Posttranslational Modifications of Proteins from PubMed: A Special Focus on Glycosylation, Acetylation, Methylation, Hydroxylation, and Ubiquitination.一种从 PubMed 中提取蛋白质翻译后修饰的文本挖掘和机器学习协议:特别关注糖基化、乙酰化、甲基化、羟化和泛素化。
Methods Mol Biol. 2022;2496:179-202. doi: 10.1007/978-1-0716-2305-3_10.
4
Current status of PTMs structural databases: applications, limitations and prospects.PTMs 结构数据库的现状:应用、局限性和前景。
Amino Acids. 2022 Apr;54(4):575-590. doi: 10.1007/s00726-021-03119-z. Epub 2022 Jan 12.
5
Novel Post-translational Modifications in Human Serum Albumin.人血清白蛋白的新型翻译后修饰。
Protein Pept Lett. 2022;29(5):473-484. doi: 10.2174/0929866529666220318152509.
6
Post-translational modifications in the Protein Data Bank.蛋白质数据库中的翻译后修饰。
Acta Crystallogr D Struct Biol. 2024 Sep 1;80(Pt 9):647-660. doi: 10.1107/S2059798324007794. Epub 2024 Aug 29.
7
Tau Post-translational Modifications: Dynamic Transformers of Tau Function, Degradation, and Aggregation.tau蛋白的翻译后修饰:tau蛋白功能、降解及聚集的动态转变因素
Front Neurol. 2021 Jan 7;11:595532. doi: 10.3389/fneur.2020.595532. eCollection 2020.
8
Prediction of post-translational glycosylation and phosphorylation of proteins from the amino acid sequence.根据氨基酸序列预测蛋白质的翻译后糖基化和磷酸化。
Proteomics. 2004 Jun;4(6):1633-49. doi: 10.1002/pmic.200300771.
9
Proteome-wide profiling and mapping of post translational modifications in human hearts.人类心脏中翻译后修饰的蛋白质组学全面分析和作图。
Sci Rep. 2021 Jan 26;11(1):2184. doi: 10.1038/s41598-021-81986-y.
10
PRISMOID: a comprehensive 3D structure database for post-translational modifications and mutations with functional impact.棱柱体数据库:一个用于翻译后修饰和具有功能影响的突变的综合三维结构数据库。
Brief Bioinform. 2020 May 21;21(3):1069-1079. doi: 10.1093/bib/bbz050.

引用本文的文献

1
Fungi-Kcr: a language model for predicting lysine crotonylation in pathogenic fungal proteins.真菌Kcr:一种用于预测致病真菌蛋白质中赖氨酸巴豆酰化的语言模型。
Front Cell Infect Microbiol. 2025 Jul 15;15:1615443. doi: 10.3389/fcimb.2025.1615443. eCollection 2025.
2
Large Language Model (LLM)-Based Advances in Prediction of Post-translational Modification Sites in Proteins.基于大语言模型(LLM)在蛋白质翻译后修饰位点预测方面的进展。
Methods Mol Biol. 2025;2941:313-355. doi: 10.1007/978-1-0716-4623-6_19.
3
MTPrompt-PTM: A Multi-Task Method for Post-Translational Modification Prediction Using Prompt Tuning on a Structure-Aware Protein Language Model.

本文引用的文献

1
Growing Glycans in Rosetta: Accurate de novo glycan modeling, density fitting, and rational sequon design.在 Rosetta 中生成聚糖:准确从头聚糖建模、密度拟合和合理的序列设计。
PLoS Comput Biol. 2024 Jun 24;20(6):e1011895. doi: 10.1371/journal.pcbi.1011895. eCollection 2024 Jun.
2
Rational Design of Phosphorylation-Responsive Coiled Coil-Peptide Assemblies.磷酸化响应性螺旋-肽组装体的合理设计。
ACS Synth Biol. 2023 Apr 21;12(4):1308-1319. doi: 10.1021/acssynbio.3c00064. Epub 2023 Mar 29.
3
UniProt: the Universal Protein Knowledgebase in 2023.
MTPrompt-PTM:一种基于结构感知蛋白质语言模型的提示调整用于翻译后修饰预测的多任务方法。
Biomolecules. 2025 Jun 9;15(6):843. doi: 10.3390/biom15060843.
4
Self-supervised machine learning methods for protein design improve sampling but not the identification of high-fitness variants.用于蛋白质设计的自监督机器学习方法可改善采样,但无法识别高适应性变体。
Sci Adv. 2025 Feb 14;11(7):eadr7338. doi: 10.1126/sciadv.adr7338. Epub 2025 Feb 12.
5
Artificial Intelligence Transforming Post-Translational Modification Research.人工智能正在改变翻译后修饰研究。
Bioengineering (Basel). 2024 Dec 31;12(1):26. doi: 10.3390/bioengineering12010026.
6
DLBWE-Cys: a deep-learning-based tool for identifying cysteine S-carboxyethylation sites using binary-weight encoding.DLBWE-Cys:一种基于深度学习的工具,用于使用二进制权重编码识别半胱氨酸S-羧乙基化位点。
Front Genet. 2025 Jan 8;15:1464976. doi: 10.3389/fgene.2024.1464976. eCollection 2024.
7
Integrative Multi-PTM Proteomics Reveals Dynamic Global, Redox, Phosphorylation, and Acetylation Regulation in Cytokine-Treated Pancreatic Beta Cells.整合多翻译后修饰蛋白质组学揭示细胞因子处理的胰岛β细胞中的动态全局、氧化还原、磷酸化和乙酰化调控
Mol Cell Proteomics. 2024 Dec;23(12):100881. doi: 10.1016/j.mcpro.2024.100881. Epub 2024 Nov 15.
8
Current computational tools for protein lysine acylation site prediction.当前用于预测蛋白质赖氨酸酰化位点的计算工具。
Brief Bioinform. 2024 Sep 23;25(6). doi: 10.1093/bib/bbae469.
9
Combining Rosetta Sequence Design with Protein Language Model Predictions Using Evolutionary Scale Modeling (ESM) as Restraint.结合罗塞塔序列设计与蛋白质语言模型预测,使用进化规模建模(ESM)作为约束条件。
ACS Synth Biol. 2024 Apr 19;13(4):1085-1092. doi: 10.1021/acssynbio.3c00753. Epub 2024 Apr 3.
UniProt:2023 年的通用蛋白质知识库。
Nucleic Acids Res. 2023 Jan 6;51(D1):D523-D531. doi: 10.1093/nar/gkac1052.
4
Deciphering deamidation and isomerization in therapeutic proteins: Effect of neighboring residue.解析治疗性蛋白质中的脱酰胺和异构化:邻近残基的影响。
MAbs. 2022 Jan-Dec;14(1):2143006. doi: 10.1080/19420862.2022.2143006.
5
Differential T cell immune responses to deamidated adeno-associated virus vector.对脱酰胺化腺相关病毒载体的差异性T细胞免疫反应
Mol Ther Methods Clin Dev. 2022 Jan 18;24:255-267. doi: 10.1016/j.omtm.2022.01.005. eCollection 2022 Mar 10.
6
DeepNGlyPred: A Deep Neural Network-Based Approach for Human N-Linked Glycosylation Site Prediction.DeepNGlyPred:一种基于深度神经网络的人类 N 连接糖基化位点预测方法。
Molecules. 2021 Dec 2;26(23):7314. doi: 10.3390/molecules26237314.
7
Ensuring scientific reproducibility in bio-macromolecular modeling via extensive, automated benchmarks.通过广泛的自动化基准测试,确保生物大分子建模中的科学可重复性。
Nat Commun. 2021 Nov 29;12(1):6947. doi: 10.1038/s41467-021-27222-7.
8
AlphaFold Protein Structure Database: massively expanding the structural coverage of protein-sequence space with high-accuracy models.AlphaFold 蛋白质结构数据库:用高精度模型极大地扩展蛋白质序列空间的结构覆盖范围。
Nucleic Acids Res. 2022 Jan 7;50(D1):D439-D444. doi: 10.1093/nar/gkab1061.
9
dbPTM in 2022: an updated database for exploring regulatory networks and functional associations of protein post-translational modifications.dbPTM 在 2022 年:一个更新的数据库,用于探索蛋白质翻译后修饰的调控网络和功能关联。
Nucleic Acids Res. 2022 Jan 7;50(D1):D471-D479. doi: 10.1093/nar/gkab1017.
10
De novo design of tyrosine and serine kinase-driven protein switches.从头设计酪氨酸和丝氨酸激酶驱动的蛋白开关。
Nat Struct Mol Biol. 2021 Sep;28(9):762-770. doi: 10.1038/s41594-021-00649-8. Epub 2021 Sep 13.