• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

机器学习辅助酶工程面临的机遇与挑战

Opportunities and Challenges for Machine Learning-Assisted Enzyme Engineering.

作者信息

Yang Jason, Li Francesca-Zhoufan, Arnold Frances H

机构信息

Division of Chemistry and Chemical Engineering, California Institute of Technology, Pasadena, California 91125, United States.

Division of Biology and Biological Engineering, California Institute of Technology, Pasadena, California 91125, United States.

出版信息

ACS Cent Sci. 2024 Feb 5;10(2):226-241. doi: 10.1021/acscentsci.3c01275. eCollection 2024 Feb 28.

DOI:10.1021/acscentsci.3c01275
PMID:38435522
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10906252/
Abstract

Enzymes can be engineered at the level of their amino acid sequences to optimize key properties such as expression, stability, substrate range, and catalytic efficiency-or even to unlock new catalytic activities not found in nature. Because the search space of possible proteins is vast, enzyme engineering usually involves discovering an enzyme starting point that has some level of the desired activity followed by directed evolution to improve its "fitness" for a desired application. Recently, machine learning (ML) has emerged as a powerful tool to complement this empirical process. ML models can contribute to (1) starting point discovery by functional annotation of known protein sequences or generating novel protein sequences with desired functions and (2) navigating protein fitness landscapes for fitness optimization by learning mappings between protein sequences and their associated fitness values. In this Outlook, we explain how ML complements enzyme engineering and discuss its future potential to unlock improved engineering outcomes.

摘要

酶可以在其氨基酸序列水平上进行改造,以优化关键特性,如表达、稳定性、底物范围和催化效率,甚至还能开启自然界中未发现的新催化活性。由于可能的蛋白质搜索空间非常庞大,酶工程通常包括找到一个具有一定程度所需活性的酶起始点,然后通过定向进化来提高其在特定应用中的“适应性”。最近,机器学习(ML)已成为补充这一经验过程的强大工具。ML模型可有助于:(1)通过对已知蛋白质序列进行功能注释或生成具有所需功能的新蛋白质序列来发现起始点;(2)通过学习蛋白质序列与其相关适应性值之间的映射关系,在蛋白质适应性景观中导航以优化适应性。在本展望文章中,我们解释了ML如何补充酶工程,并讨论了其未来解锁更好工程成果的潜力。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7ce3/10906252/4d66a9b682f1/oc3c01275_0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7ce3/10906252/265d797ec161/oc3c01275_0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7ce3/10906252/08773d2356de/oc3c01275_0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7ce3/10906252/e021de64bac8/oc3c01275_0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7ce3/10906252/4d66a9b682f1/oc3c01275_0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7ce3/10906252/265d797ec161/oc3c01275_0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7ce3/10906252/08773d2356de/oc3c01275_0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7ce3/10906252/e021de64bac8/oc3c01275_0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7ce3/10906252/4d66a9b682f1/oc3c01275_0004.jpg

相似文献

1
Opportunities and Challenges for Machine Learning-Assisted Enzyme Engineering.机器学习辅助酶工程面临的机遇与挑战
ACS Cent Sci. 2024 Feb 5;10(2):226-241. doi: 10.1021/acscentsci.3c01275. eCollection 2024 Feb 28.
2
Data-Driven Protein Engineering for Improving Catalytic Activity and Selectivity.基于数据的蛋白质工程提高催化活性和选择性。
Chembiochem. 2024 Feb 1;25(3):e202300754. doi: 10.1002/cbic.202300754. Epub 2023 Dec 11.
3
High-throughput screening, next generation sequencing and machine learning: advanced methods in enzyme engineering.高通量筛选、下一代测序和机器学习:酶工程的先进方法。
Chem Commun (Camb). 2022 Feb 17;58(15):2455-2467. doi: 10.1039/d1cc04635g.
4
Machine learning-assisted enzyme engineering.机器学习辅助酶工程。
Methods Enzymol. 2020;643:281-315. doi: 10.1016/bs.mie.2020.05.005. Epub 2020 Jun 12.
5
Current status and emerging frontiers in enzyme engineering: An industrial perspective.酶工程的现状与新兴前沿:工业视角
Heliyon. 2024 Jun 7;10(11):e32673. doi: 10.1016/j.heliyon.2024.e32673. eCollection 2024 Jun 15.
6
100th Anniversary of Macromolecular Science Viewpoint: Data-Driven Protein Design.高分子科学观点100周年:数据驱动的蛋白质设计
ACS Macro Lett. 2021 Mar 16;10(3):327-340. doi: 10.1021/acsmacrolett.0c00885. Epub 2021 Feb 8.
7
Machine learning-guided co-optimization of fitness and diversity facilitates combinatorial library design in enzyme engineering.机器学习引导的适应性和多样性协同优化促进了酶工程组合文库设计。
Nat Commun. 2024 Jul 29;15(1):6392. doi: 10.1038/s41467-024-50698-y.
8
DeCOIL: Optimization of Degenerate Codon Libraries for Machine Learning-Assisted Protein Engineering.DeCOIL:用于机器学习辅助蛋白质工程的简并密码子文库优化。
ACS Synth Biol. 2023 Aug 18;12(8):2444-2454. doi: 10.1021/acssynbio.3c00301. Epub 2023 Jul 31.
9
Machine learning-assisted directed protein evolution with combinatorial libraries.机器学习辅助的组合文库定向蛋白质进化。
Proc Natl Acad Sci U S A. 2019 Apr 30;116(18):8852-8858. doi: 10.1073/pnas.1901979116. Epub 2019 Apr 12.
10
Machine learning to predict continuous protein properties from binary cell sorting data and map unseen sequence space.机器学习预测连续蛋白质特性从二进制细胞排序数据和映射未见序列空间。
Proc Natl Acad Sci U S A. 2024 Mar 12;121(11):e2311726121. doi: 10.1073/pnas.2311726121. Epub 2024 Mar 7.

引用本文的文献

1
Targeting the selectivity filter to drastically alter the activity and substrate spectrum of a promiscuous metal transporter.针对选择性过滤器以大幅改变一种混杂金属转运蛋白的活性和底物谱。
Chem Sci. 2025 Aug 26. doi: 10.1039/d5sc03700j.
2
Rational protein engineering using an omni-directional multipoint mutagenesis generation pipeline.利用全向多点诱变生成流程进行合理的蛋白质工程。
iScience. 2025 Aug 5;28(9):113273. doi: 10.1016/j.isci.2025.113273. eCollection 2025 Sep 19.
3
Active learning-guided optimization of cell-free biosensors for lead testing in drinking water.

本文引用的文献

1
The genetic architecture of protein stability.蛋白质稳定性的遗传结构。
Nature. 2024 Oct;634(8035):995-1003. doi: 10.1038/s41586-024-07966-0. Epub 2024 Sep 25.
2
The simplicity of protein sequence-function relationships.蛋白质序列与功能关系的简单性。
Nat Commun. 2024 Sep 11;15(1):7953. doi: 10.1038/s41467-024-51895-5.
3
Fine-tuning protein language models boosts predictions across diverse tasks.微调蛋白质语言模型可提高跨多种任务的预测能力。
主动学习引导的用于饮用水中铅检测的无细胞生物传感器优化
bioRxiv. 2025 Aug 22:2025.08.20.671382. doi: 10.1101/2025.08.20.671382.
4
Advancing cell therapies with artificial intelligence and synthetic biology.利用人工智能和合成生物学推动细胞疗法发展。
Curr Opin Biomed Eng. 2025 Jun;34. doi: 10.1016/j.cobme.2025.100580. Epub 2025 Feb 3.
5
Advancing Biocatalysis Education: Sustaining the Future of Industrial Biotechnology.推进生物催化教育:维系工业生物技术的未来
JACS Au. 2025 Jul 16;5(7):2932-2938. doi: 10.1021/jacsau.5c00369. eCollection 2025 Jul 28.
6
Mechanisms and Strategies for Engineering Oxidative Stress Resistance in .工程化抗氧化应激的机制与策略
Chem Bio Eng. 2025 May 29;2(7):409-422. doi: 10.1021/cbe.5c00021. eCollection 2025 Jul 24.
7
Mechanistic modeling or machine learning for detecting variants of concern: Why not both?用于检测关注变体的机制建模或机器学习:为何不两者兼用?
Proc Natl Acad Sci U S A. 2025 Jul 29;122(30):e2513608122. doi: 10.1073/pnas.2513608122. Epub 2025 Jul 21.
8
A generalized platform for artificial intelligence-powered autonomous enzyme engineering.用于人工智能驱动的自主酶工程的通用平台。
Nat Commun. 2025 Jul 1;16(1):5648. doi: 10.1038/s41467-025-61209-y.
9
Review on Advancement of AI in Cell Engineering and Molecular Biology.人工智能在细胞工程和分子生物学中的进展综述
Methods Mol Biol. 2025;2952:169-192. doi: 10.1007/978-1-0716-4690-8_10.
10
From reactants to products: computational methods for biosynthetic pathway design.从反应物到产物:生物合成途径设计的计算方法
Synth Syst Biotechnol. 2025 May 15;10(3):1038-1049. doi: 10.1016/j.synbio.2025.05.005. eCollection 2025 Sep.
Nat Commun. 2024 Aug 28;15(1):7407. doi: 10.1038/s41467-024-51844-2.
4
Neural network extrapolation to distant regions of the protein fitness landscape.神经网络对蛋白质适应度景观的遥远区域进行外推。
Nat Commun. 2024 Jul 30;15(1):6405. doi: 10.1038/s41467-024-50712-3.
5
Zero-shot prediction of mutation effects with multimodal deep representation learning guides protein engineering.基于多模态深度表示学习的零-shot 突变效应预测指导蛋白质工程。
Cell Res. 2024 Sep;34(9):630-647. doi: 10.1038/s41422-024-00989-2. Epub 2024 Jul 5.
6
Unsupervised evolution of protein and antibody complexes with a structure-informed language model.无监督的蛋白质和抗体复合物的进化与结构信息语言模型。
Science. 2024 Jul 5;385(6704):46-53. doi: 10.1126/science.adk8946. Epub 2024 Jul 4.
7
Generative artificial intelligence for de novo protein design.用于全新蛋白质设计的生成式人工智能。
Curr Opin Struct Biol. 2024 Jun;86:102794. doi: 10.1016/j.sbi.2024.102794. Epub 2024 Apr 24.
8
Computational scoring and experimental evaluation of enzymes generated by neural networks.神经网络生成的酶的计算评分与实验评估
Nat Biotechnol. 2025 Mar;43(3):396-405. doi: 10.1038/s41587-024-02214-2. Epub 2024 Apr 23.
9
Flattening the curve-How to get better results with small deep-mutational-scanning datasets.拉平曲线——如何从小规模深度突变扫描数据集获得更好的结果。
Proteins. 2024 Jul;92(7):886-902. doi: 10.1002/prot.26686. Epub 2024 Mar 19.
10
Self-driving laboratories to autonomously navigate the protein fitness landscape.自动驾驶实验室自主探索蛋白质适应性景观。
Nat Chem Eng. 2024 Jan;1(1):97-107. doi: 10.1038/s44286-023-00002-4. Epub 2024 Jan 11.