基于数据驱动的酶热稳定性计算设计策略：趋势、观点和展望。

Data-driven strategies for the computational design of enzyme thermal stability: trends, perspectives, and prospects.

机构信息

State Key Laboratory of Microbial Technology, Shandong University, Qingdao 266237, China.

School of Software, Shandong University, Jinan 250101, China.

出版信息

Acta Biochim Biophys Sin (Shanghai). 2023 Mar 25;55(3):343-355. doi: 10.3724/abbs.2023033.

DOI:10.3724/abbs.2023033

PMID:37143326

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10160227/

Abstract

Thermal stability is one of the most important properties of enzymes, which sustains life and determines the potential for the industrial application of biocatalysts. Although traditional methods such as directed evolution and classical rational design contribute greatly to this field, the enormous sequence space of proteins implies costly and arduous experiments. The development of enzyme engineering focuses on automated and efficient strategies because of the breakthrough of high-throughput DNA sequencing and machine learning models. In this review, we propose a data-driven architecture for enzyme thermostability engineering and summarize some widely adopted datasets, as well as machine learning-driven approaches for designing the thermal stability of enzymes. In addition, we present a series of existing challenges while applying machine learning in enzyme thermostability design, such as the data dilemma, model training, and use of the proposed models. Additionally, a few promising directions for enhancing the performance of the models are discussed. We anticipate that the efficient incorporation of machine learning can provide more insights and solutions for the design of enzyme thermostability in the coming years.

摘要

热稳定性是酶的最重要性质之一，它维持着生命的存在，并决定了生物催化剂在工业应用中的潜力。尽管定向进化和经典理性设计等传统方法对此领域贡献巨大，但蛋白质的巨大序列空间意味着需要进行昂贵且艰巨的实验。由于高通量 DNA 测序和机器学习模型的突破，酶工程的发展侧重于自动化和高效的策略。在这篇综述中，我们提出了一种用于酶热稳定性工程的基于数据的架构，并总结了一些广泛采用的数据集，以及用于设计酶热稳定性的机器学习驱动方法。此外，我们还提出了在应用机器学习进行酶热稳定性设计时存在的一系列挑战，例如数据困境、模型训练以及所提出模型的使用。此外，还讨论了增强模型性能的一些有前途的方向。我们预计，在未来几年，机器学习的有效结合将为酶热稳定性设计提供更多的见解和解决方案。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bb4f/10160227/a40349c1e1fc/ABBS-2022-496-t1.jpg

相似文献

Data-driven strategies for the computational design of enzyme thermal stability: trends, perspectives, and prospects.

Acta Biochim Biophys Sin (Shanghai). 2023 Mar 25;55(3):343-355. doi: 10.3724/abbs.2023033.

Machine learning-assisted enzyme engineering.

Methods Enzymol. 2020;643:281-315. doi: 10.1016/bs.mie.2020.05.005. Epub 2020 Jun 12.

Unlocking the potential of enzyme engineering via rational computational design strategies.

Biotechnol Adv. 2024 Jul-Aug;73:108376. doi: 10.1016/j.biotechadv.2024.108376. Epub 2024 May 11.

[Progress in the application of artificial intelligence-assisted molecular modification of enzymes].

Sheng Wu Gong Cheng Xue Bao. 2024 Jun 25;40(6):1728-1741. doi: 10.13345/j.cjb.230748.

High-Throughput Screening Techniques for the Selection of Thermostable Enzymes.

J Agric Food Chem. 2024 Feb 28;72(8):3833-3845. doi: 10.1021/acs.jafc.3c07554. Epub 2024 Jan 29.

Overview of strategies for developing high thermostability industrial enzymes: Discovery, mechanism, modification and challenges.

Crit Rev Food Sci Nutr. 2023;63(14):2057-2073. doi: 10.1080/10408398.2021.1970508. Epub 2021 Aug 26.

Revolutionizing enzyme engineering through artificial intelligence and machine learning.

Emerg Top Life Sci. 2021 May 14;5(1):113-125. doi: 10.1042/ETLS20200257.

Rational-Design Engineering to Improve Enzyme Thermostability.

Methods Mol Biol. 2022;2397:159-178. doi: 10.1007/978-1-0716-1826-4_9.

High-throughput screening, next generation sequencing and machine learning: advanced methods in enzyme engineering.

Chem Commun (Camb). 2022 Feb 17;58(15):2455-2467. doi: 10.1039/d1cc04635g.

Machine learning-guided multi-site combinatorial mutagenesis enhances the thermostability of pectin lyase.

Int J Biol Macromol. 2024 Oct;277(Pt 4):134530. doi: 10.1016/j.ijbiomac.2024.134530. Epub 2024 Aug 5.

引用本文的文献

Tailoring industrial enzymes for thermostability and activity evolution by the machine learning-based iCASE strategy.

Nat Commun. 2025 Jan 11;16(1):604. doi: 10.1038/s41467-025-55944-5.

Enhancing Machine-Learning Prediction of Enzyme Catalytic Temperature Optima through Amino Acid Conservation Analysis.

Int J Mol Sci. 2024 Jun 6;25(11):6252. doi: 10.3390/ijms25116252.

Advancing Enzyme's Stability and Catalytic Efficiency through Synergy of Force-Field Calculations, Evolutionary Analysis, and Machine Learning.

ACS Catal. 2023 Sep 11;13(19):12506-12518. doi: 10.1021/acscatal.3c02575. eCollection 2023 Oct 6.

本文引用的文献

A novel thermophilic chitinase directly mined from the marine metagenome using the deep learning tool Preoptem.

Bioresour Bioprocess. 2022 May 16;9(1):54. doi: 10.1186/s40643-022-00543-1.

Learning deep representations of enzyme thermal adaptation.

Protein Sci. 2022 Dec;31(12):e4480. doi: 10.1002/pro.4480.

ProtGPT2 is a deep unsupervised language model for protein design.

Nat Commun. 2022 Jul 27;13(1):4348. doi: 10.1038/s41467-022-32007-7.

Extremophilic lipases for industrial applications: A general review.

Biotechnol Adv. 2022 Nov;60:108002. doi: 10.1016/j.biotechadv.2022.108002. Epub 2022 Jun 7.

NetSurfP-3.0: accurate and fast prediction of protein structural features by protein language models and deep learning.

Nucleic Acids Res. 2022 Jul 5;50(W1):W510-W515. doi: 10.1093/nar/gkac439.

Machine learning-aided engineering of hydrolases for PET depolymerization.

Nature. 2022 Apr;604(7907):662-667. doi: 10.1038/s41586-022-04599-z. Epub 2022 Apr 27.

Large-scale design and refinement of stable proteins using sequence-only models.

PLoS One. 2022 Mar 14;17(3):e0265020. doi: 10.1371/journal.pone.0265020. eCollection 2022.

iThermo: A Sequence-Based Model for Identifying Thermophilic Proteins Using a Multi-Feature Fusion Strategy.

Front Microbiol. 2022 Feb 22;13:790063. doi: 10.3389/fmicb.2022.790063. eCollection 2022.

Engineering Strategies to Overcome the Stability-Function Trade-Off in Proteins.

ACS Synth Biol. 2022 Mar 18;11(3):1030-1039. doi: 10.1021/acssynbio.1c00512. Epub 2022 Mar 8.

TMPpred: A support vector machine-based thermophilic protein identifier.

Anal Biochem. 2022 May 15;645:114625. doi: 10.1016/j.ab.2022.114625. Epub 2022 Feb 23.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

基于数据驱动的酶热稳定性计算设计策略：趋势、观点和展望。

Data-driven strategies for the computational design of enzyme thermal stability: trends, perspectives, and prospects.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献