• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

机器学习引导的蛋白质工程

Machine Learning-Guided Protein Engineering.

作者信息

Kouba Petr, Kohout Pavel, Haddadi Faraneh, Bushuiev Anton, Samusevich Raman, Sedlar Jiri, Damborsky Jiri, Pluskal Tomas, Sivic Josef, Mazurenko Stanislav

机构信息

Loschmidt Laboratories, Department of Experimental Biology and RECETOX, Faculty of Science, Masaryk University, Kamenice 5, 625 00 Brno, Czech Republic.

Czech Institute of Informatics, Robotics and Cybernetics, Czech Technical University in Prague, Jugoslavskych partyzanu 1580/3, 160 00 Prague 6, Czech Republic.

出版信息

ACS Catal. 2023 Oct 13;13(21):13863-13895. doi: 10.1021/acscatal.3c02743. eCollection 2023 Nov 3.

DOI:10.1021/acscatal.3c02743
PMID:37942269
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10629210/
Abstract

Recent progress in engineering highly promising biocatalysts has increasingly involved machine learning methods. These methods leverage existing experimental and simulation data to aid in the discovery and annotation of promising enzymes, as well as in suggesting beneficial mutations for improving known targets. The field of machine learning for protein engineering is gathering steam, driven by recent success stories and notable progress in other areas. It already encompasses ambitious tasks such as understanding and predicting protein structure and function, catalytic efficiency, enantioselectivity, protein dynamics, stability, solubility, aggregation, and more. Nonetheless, the field is still evolving, with many challenges to overcome and questions to address. In this Perspective, we provide an overview of ongoing trends in this domain, highlight recent case studies, and examine the current limitations of machine learning-based methods. We emphasize the crucial importance of thorough experimental validation of emerging models before their use for rational protein design. We present our opinions on the fundamental problems and outline the potential directions for future research.

摘要

在设计极具潜力的生物催化剂方面,近期的进展越来越多地涉及机器学习方法。这些方法利用现有的实验和模拟数据,以协助发现和注释有前景的酶,并为改进已知靶点提出有益的突变。蛋白质工程的机器学习领域正在蓬勃发展,这得益于近期的成功案例以及其他领域的显著进展。它已经涵盖了诸如理解和预测蛋白质结构与功能、催化效率、对映选择性、蛋白质动力学、稳定性、溶解度、聚集等雄心勃勃的任务。尽管如此,该领域仍在不断发展,有许多挑战需要克服,也有诸多问题需要解决。在这篇综述文章中,我们概述了该领域当前的发展趋势,突出了近期的案例研究,并审视了基于机器学习方法目前存在的局限性。我们强调在将新兴模型用于理性蛋白质设计之前,对其进行全面实验验证的至关重要性。我们阐述了对基本问题的看法,并概述了未来研究的潜在方向。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2865/10629210/dbbdd71bfd2b/cs3c02743_0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2865/10629210/ef480cf85080/cs3c02743_0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2865/10629210/c1291a6717f2/cs3c02743_0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2865/10629210/af9f741cd26c/cs3c02743_0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2865/10629210/dbbdd71bfd2b/cs3c02743_0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2865/10629210/ef480cf85080/cs3c02743_0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2865/10629210/c1291a6717f2/cs3c02743_0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2865/10629210/af9f741cd26c/cs3c02743_0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2865/10629210/dbbdd71bfd2b/cs3c02743_0004.jpg

相似文献

1
Machine Learning-Guided Protein Engineering.机器学习引导的蛋白质工程
ACS Catal. 2023 Oct 13;13(21):13863-13895. doi: 10.1021/acscatal.3c02743. eCollection 2023 Nov 3.
2
Macromolecular crowding: chemistry and physics meet biology (Ascona, Switzerland, 10-14 June 2012).大分子拥挤现象:化学与物理邂逅生物学(瑞士阿斯科纳,2012年6月10日至14日)
Phys Biol. 2013 Aug;10(4):040301. doi: 10.1088/1478-3975/10/4/040301. Epub 2013 Aug 2.
3
Unlocking the potential of enzyme engineering via rational computational design strategies.通过合理的计算设计策略挖掘酶工程的潜力。
Biotechnol Adv. 2024 Jul-Aug;73:108376. doi: 10.1016/j.biotechadv.2024.108376. Epub 2024 May 11.
4
Machine learning-assisted enzyme engineering.机器学习辅助酶工程。
Methods Enzymol. 2020;643:281-315. doi: 10.1016/bs.mie.2020.05.005. Epub 2020 Jun 12.
5
Data-driven strategies for the computational design of enzyme thermal stability: trends, perspectives, and prospects.基于数据驱动的酶热稳定性计算设计策略:趋势、观点和展望。
Acta Biochim Biophys Sin (Shanghai). 2023 Mar 25;55(3):343-355. doi: 10.3724/abbs.2023033.
6
Hybrid schemes based on quantum mechanics/molecular mechanics simulations goals to success, problems, and perspectives.基于量子力学/分子力学模拟的混合方案的目标、问题和展望。
Adv Protein Chem Struct Biol. 2011;85:81-142. doi: 10.1016/B978-0-12-386485-7.00003-X.
7
Annual Research Review: Translational machine learning for child and adolescent psychiatry.年度研究综述:儿童和青少年精神病学的转化机器学习。
J Child Psychol Psychiatry. 2022 Apr;63(4):421-443. doi: 10.1111/jcpp.13545. Epub 2022 Jan 17.
8
Current status and emerging frontiers in enzyme engineering: An industrial perspective.酶工程的现状与新兴前沿:工业视角
Heliyon. 2024 Jun 7;10(11):e32673. doi: 10.1016/j.heliyon.2024.e32673. eCollection 2024 Jun 15.
9
Machine Learning in Tissue Engineering.机器学习在组织工程中的应用。
Tissue Eng Part A. 2023 Jan;29(1-2):2-19. doi: 10.1089/ten.TEA.2022.0128. Epub 2022 Sep 26.
10
Improving Enzyme Fitness with Machine Learning.利用机器学习提高酶的适应性。
Chimia (Aarau). 2023 Mar 29;77(3):116-121. doi: 10.2533/chimia.2023.116.

引用本文的文献

1
AlphaCD: a machine learning model capable of highly accurate characterization for 21,335 cytidine deaminases.AlphaCD:一种能够对21335种胞嘧啶脱氨酶进行高精度表征的机器学习模型。
Cell Res. 2025 Aug 18. doi: 10.1038/s41422-025-01164-x.
2
Sequence-only prediction of binding affinity changes: a robust and interpretable model for antibody engineering.结合亲和力变化的仅序列预测:一种用于抗体工程的稳健且可解释的模型。
Bioinformatics. 2025 Aug 2;41(8). doi: 10.1093/bioinformatics/btaf446.
3
Characterizing and engineering post-translational modifications with high-throughput cell-free expression.

本文引用的文献

1
MDverse, shedding light on the dark matter of molecular dynamics simulations.MDverse,揭示分子动力学模拟的暗物质。
Elife. 2024 Aug 30;12:RP90061. doi: 10.7554/eLife.90061.
2
Stability Oracle: a structure-based graph-transformer framework for identifying stabilizing mutations.稳定性预测器:一种基于结构的图变换框架,用于识别稳定化突变。
Nat Commun. 2024 Jul 23;15(1):6170. doi: 10.1038/s41467-024-49780-2.
3
Protein structure generation via folding diffusion.通过折叠扩散生成蛋白质结构
利用高通量无细胞表达技术对翻译后修饰进行表征和工程改造。
Nat Commun. 2025 Aug 5;16(1):7215. doi: 10.1038/s41467-025-60526-6.
4
Protein Structure-Function Relationship: A Kernel-PCA Approach for Reaction Coordinate Identification.蛋白质结构-功能关系:一种用于反应坐标识别的核主成分分析方法。
J Chem Theory Comput. 2025 Jul 22;21(14):7122-7130. doi: 10.1021/acs.jctc.5c00483. Epub 2025 Jul 14.
5
From reactants to products: computational methods for biosynthetic pathway design.从反应物到产物:生物合成途径设计的计算方法
Synth Syst Biotechnol. 2025 May 15;10(3):1038-1049. doi: 10.1016/j.synbio.2025.05.005. eCollection 2025 Sep.
6
Semi-supervised prediction of protein fitness for data-driven protein engineering.用于数据驱动蛋白质工程的蛋白质适应性半监督预测
J Cheminform. 2025 May 31;17(1):88. doi: 10.1186/s13321-025-01029-w.
7
Using Machine Learning to Analyze Molecular Dynamics Simulations of Biomolecules.利用机器学习分析生物分子的分子动力学模拟
J Phys Chem B. 2025 Jun 5;129(22):5375-5385. doi: 10.1021/acs.jpcb.4c08824. Epub 2025 May 27.
8
Revealing arginine-cysteine and glycine-cysteine NOS linkages by a systematic re-evaluation of protein structures.通过对蛋白质结构进行系统的重新评估来揭示精氨酸-半胱氨酸和甘氨酸-半胱氨酸一氧化氮合酶连接
Commun Chem. 2025 May 13;8(1):146. doi: 10.1038/s42004-025-01535-w.
9
Neural network conditioned to produce thermophilic protein sequences can increase thermal stability.经过训练以生成嗜热蛋白序列的神经网络可以提高热稳定性。
Sci Rep. 2025 Apr 23;15(1):14124. doi: 10.1038/s41598-025-90828-0.
10
Designing and implementing SMILE: An AI-driven platform for enhancing clinical decision-making in mental health and neurodivergence management.设计与实施SMILE:一个用于加强心理健康和神经差异管理中临床决策的人工智能驱动平台。
Comput Struct Biotechnol J. 2025 Feb 22;27:785-803. doi: 10.1016/j.csbj.2025.02.022. eCollection 2025.
Nat Commun. 2024 Feb 5;15(1):1059. doi: 10.1038/s41467-024-45051-2.
4
Prop3D: A flexible, Python-based platform for machine learning with protein structural properties and biophysical data.Prop3D:一个灵活的、基于 Python 的机器学习平台,用于处理蛋白质结构性质和生物物理数据。
BMC Bioinformatics. 2024 Jan 4;25(1):11. doi: 10.1186/s12859-023-05586-5.
5
EnzymeMap: curation, validation and data-driven prediction of enzymatic reactions.酶图谱:酶促反应的整理、验证及数据驱动预测
Chem Sci. 2023 Nov 22;14(48):14229-14242. doi: 10.1039/d3sc02048g. eCollection 2023 Dec 13.
6
ProGen2: Exploring the boundaries of protein language models.ProGen2:探索蛋白质语言模型的边界。
Cell Syst. 2023 Nov 15;14(11):968-978.e3. doi: 10.1016/j.cels.2023.10.002. Epub 2023 Oct 30.
7
Automated benchmarking of combined protein structure and ligand conformation prediction.自动化综合蛋白质结构和配体构象预测的基准测试。
Proteins. 2023 Dec;91(12):1912-1924. doi: 10.1002/prot.26605. Epub 2023 Oct 26.
8
Advancing Enzyme's Stability and Catalytic Efficiency through Synergy of Force-Field Calculations, Evolutionary Analysis, and Machine Learning.通过力场计算、进化分析和机器学习的协同作用提高酶的稳定性和催化效率
ACS Catal. 2023 Sep 11;13(19):12506-12518. doi: 10.1021/acscatal.3c02575. eCollection 2023 Oct 6.
9
Dataset Distillation: A Comprehensive Review.数据集提炼:全面综述。
IEEE Trans Pattern Anal Mach Intell. 2024 Jan;46(1):150-170. doi: 10.1109/TPAMI.2023.3323376. Epub 2023 Dec 5.
10
A Comprehensive Survey of Dataset Distillation.数据集提炼的全面调查。
IEEE Trans Pattern Anal Mach Intell. 2024 Jan;46(1):17-32. doi: 10.1109/TPAMI.2023.3322540. Epub 2023 Dec 5.