Suppr
超能文献

将机器学习与基于结构的蛋白质设计相结合，以预测和设计蛋白质的翻译后修饰。

Combining machine learning with structure-based protein design to predict and engineer post-translational modifications of proteins.

机构信息

Institute for Drug Discovery, Leipzig University Medical Faculty, Leipzig, Germany.

Center for Scalable Data Analytics and Artificial Intelligence ScaDS.AI, Dresden/Leipzig, Germany.

出版信息

PLoS Comput Biol. 2024 Mar 14;20(3):e1011939. doi: 10.1371/journal.pcbi.1011939. eCollection 2024 Mar.

DOI:10.1371/journal.pcbi.1011939

PMID:38484014

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10965067/

Abstract

Post-translational modifications (PTMs) of proteins play a vital role in their function and stability. These modifications influence protein folding, signaling, protein-protein interactions, enzyme activity, binding affinity, aggregation, degradation, and much more. To date, over 400 types of PTMs have been described, representing chemical diversity well beyond the genetically encoded amino acids. Such modifications pose a challenge to the successful design of proteins, but also represent a major opportunity to diversify the protein engineering toolbox. To this end, we first trained artificial neural networks (ANNs) to predict eighteen of the most abundant PTMs, including protein glycosylation, phosphorylation, methylation, and deamidation. In a second step, these models were implemented inside the computational protein modeling suite Rosetta, which allows flexible combination with existing protocols to model the modified sites and understand their impact on protein stability as well as function. Lastly, we developed a new design protocol that either maximizes or minimizes the predicted probability of a particular site being modified. We find that this combination of ANN prediction and structure-based design can enable the modification of existing, as well as the introduction of novel, PTMs. The potential applications of our work include, but are not limited to, glycan masking of epitopes, strengthening protein-protein interactions through phosphorylation, as well as protecting proteins from deamidation liabilities. These applications are especially important for the design of new protein therapeutics where PTMs can drastically change the therapeutic properties of a protein. Our work adds novel tools to Rosetta's protein engineering toolbox that allow for the rational design of PTMs.

摘要

蛋白质的翻译后修饰（PTMs）在其功能和稳定性中起着至关重要的作用。这些修饰影响蛋白质折叠、信号转导、蛋白质-蛋白质相互作用、酶活性、结合亲和力、聚集、降解等等。迄今为止，已经描述了超过 400 种 PTMs，代表了远超遗传编码氨基酸的化学多样性。这些修饰对蛋白质的成功设计构成了挑战，但也代表了使蛋白质工程工具多样化的主要机会。为此，我们首先训练人工神经网络（ANNs）来预测十八种最丰富的 PTMs，包括蛋白质糖基化、磷酸化、甲基化和脱酰胺。在第二步中，这些模型被实施在计算蛋白质建模套件 Rosetta 中，这允许灵活地与现有协议结合，以模拟修饰位点，并了解它们对蛋白质稳定性和功能的影响。最后，我们开发了一种新的设计方案，该方案要么最大化，要么最小化特定位点被修饰的预测概率。我们发现，这种 ANN 预测和基于结构的设计的组合可以实现现有 PTMs 的修饰，以及引入新的 PTMs。我们工作的潜在应用包括但不限于糖基化掩盖表位、通过磷酸化增强蛋白质-蛋白质相互作用，以及保护蛋白质免受脱酰胺缺陷的影响。这些应用在设计新的蛋白质治疗药物时尤为重要，因为 PTMs 可以极大地改变蛋白质的治疗特性。我们的工作为 Rosetta 的蛋白质工程工具箱添加了新的工具，允许对 PTMs 进行合理设计。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/10b0/10965067/5dd3184d5f44/pcbi.1011939.g001.jpg

相似文献

Combining machine learning with structure-based protein design to predict and engineer post-translational modifications of proteins.

PLoS Comput Biol. 2024 Mar 14;20(3):e1011939. doi: 10.1371/journal.pcbi.1011939. eCollection 2024 Mar.

A machine learning strategy for predicting localization of post-translational modification sites in protein-protein interacting regions.

BMC Bioinformatics. 2016 Aug 17;17(1):307. doi: 10.1186/s12859-016-1165-8.

A Text Mining and Machine Learning Protocol for Extracting Posttranslational Modifications of Proteins from PubMed: A Special Focus on Glycosylation, Acetylation, Methylation, Hydroxylation, and Ubiquitination.

Methods Mol Biol. 2022;2496:179-202. doi: 10.1007/978-1-0716-2305-3_10.

Current status of PTMs structural databases: applications, limitations and prospects.

Amino Acids. 2022 Apr;54(4):575-590. doi: 10.1007/s00726-021-03119-z. Epub 2022 Jan 12.

Novel Post-translational Modifications in Human Serum Albumin.

Protein Pept Lett. 2022;29(5):473-484. doi: 10.2174/0929866529666220318152509.

Post-translational modifications in the Protein Data Bank.

Acta Crystallogr D Struct Biol. 2024 Sep 1;80(Pt 9):647-660. doi: 10.1107/S2059798324007794. Epub 2024 Aug 29.

Tau Post-translational Modifications: Dynamic Transformers of Tau Function, Degradation, and Aggregation.

Front Neurol. 2021 Jan 7;11:595532. doi: 10.3389/fneur.2020.595532. eCollection 2020.

Prediction of post-translational glycosylation and phosphorylation of proteins from the amino acid sequence.

Proteomics. 2004 Jun;4(6):1633-49. doi: 10.1002/pmic.200300771.

Proteome-wide profiling and mapping of post translational modifications in human hearts.

Sci Rep. 2021 Jan 26;11(1):2184. doi: 10.1038/s41598-021-81986-y.

PRISMOID: a comprehensive 3D structure database for post-translational modifications and mutations with functional impact.

Brief Bioinform. 2020 May 21;21(3):1069-1079. doi: 10.1093/bib/bbz050.

引用本文的文献

Fungi-Kcr: a language model for predicting lysine crotonylation in pathogenic fungal proteins.

Front Cell Infect Microbiol. 2025 Jul 15;15:1615443. doi: 10.3389/fcimb.2025.1615443. eCollection 2025.

Large Language Model (LLM)-Based Advances in Prediction of Post-translational Modification Sites in Proteins.

Methods Mol Biol. 2025;2941:313-355. doi: 10.1007/978-1-0716-4623-6_19.

MTPrompt-PTM: A Multi-Task Method for Post-Translational Modification Prediction Using Prompt Tuning on a Structure-Aware Protein Language Model.

Biomolecules. 2025 Jun 9;15(6):843. doi: 10.3390/biom15060843.

Self-supervised machine learning methods for protein design improve sampling but not the identification of high-fitness variants.

Sci Adv. 2025 Feb 14;11(7):eadr7338. doi: 10.1126/sciadv.adr7338. Epub 2025 Feb 12.

Artificial Intelligence Transforming Post-Translational Modification Research.

Bioengineering (Basel). 2024 Dec 31;12(1):26. doi: 10.3390/bioengineering12010026.

DLBWE-Cys: a deep-learning-based tool for identifying cysteine S-carboxyethylation sites using binary-weight encoding.

Front Genet. 2025 Jan 8;15:1464976. doi: 10.3389/fgene.2024.1464976. eCollection 2024.

Integrative Multi-PTM Proteomics Reveals Dynamic Global, Redox, Phosphorylation, and Acetylation Regulation in Cytokine-Treated Pancreatic Beta Cells.

Mol Cell Proteomics. 2024 Dec;23(12):100881. doi: 10.1016/j.mcpro.2024.100881. Epub 2024 Nov 15.

Current computational tools for protein lysine acylation site prediction.

Brief Bioinform. 2024 Sep 23;25(6). doi: 10.1093/bib/bbae469.

Combining Rosetta Sequence Design with Protein Language Model Predictions Using Evolutionary Scale Modeling (ESM) as Restraint.

ACS Synth Biol. 2024 Apr 19;13(4):1085-1092. doi: 10.1021/acssynbio.3c00753. Epub 2024 Apr 3.

本文引用的文献

Growing Glycans in Rosetta: Accurate de novo glycan modeling, density fitting, and rational sequon design.

PLoS Comput Biol. 2024 Jun 24;20(6):e1011895. doi: 10.1371/journal.pcbi.1011895. eCollection 2024 Jun.

Rational Design of Phosphorylation-Responsive Coiled Coil-Peptide Assemblies.

ACS Synth Biol. 2023 Apr 21;12(4):1308-1319. doi: 10.1021/acssynbio.3c00064. Epub 2023 Mar 29.

UniProt: the Universal Protein Knowledgebase in 2023.

Nucleic Acids Res. 2023 Jan 6;51(D1):D523-D531. doi: 10.1093/nar/gkac1052.

Deciphering deamidation and isomerization in therapeutic proteins: Effect of neighboring residue.

MAbs. 2022 Jan-Dec;14(1):2143006. doi: 10.1080/19420862.2022.2143006.

Differential T cell immune responses to deamidated adeno-associated virus vector.

Mol Ther Methods Clin Dev. 2022 Jan 18;24:255-267. doi: 10.1016/j.omtm.2022.01.005. eCollection 2022 Mar 10.

DeepNGlyPred: A Deep Neural Network-Based Approach for Human N-Linked Glycosylation Site Prediction.

Molecules. 2021 Dec 2;26(23):7314. doi: 10.3390/molecules26237314.

Ensuring scientific reproducibility in bio-macromolecular modeling via extensive, automated benchmarks.

Nat Commun. 2021 Nov 29;12(1):6947. doi: 10.1038/s41467-021-27222-7.

AlphaFold Protein Structure Database: massively expanding the structural coverage of protein-sequence space with high-accuracy models.

Nucleic Acids Res. 2022 Jan 7;50(D1):D439-D444. doi: 10.1093/nar/gkab1061.

dbPTM in 2022: an updated database for exploring regulatory networks and functional associations of protein post-translational modifications.

Nucleic Acids Res. 2022 Jan 7;50(D1):D471-D479. doi: 10.1093/nar/gkab1017.

De novo design of tyrosine and serine kinase-driven protein switches.

Nat Struct Mol Biol. 2021 Sep;28(9):762-770. doi: 10.1038/s41594-021-00649-8. Epub 2021 Sep 13.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

Suppr超能文献

将机器学习与基于结构的蛋白质设计相结合，以预测和设计蛋白质的翻译后修饰。

Combining machine learning with structure-based protein design to predict and engineer post-translational modifications of proteins.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译