• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

主动学习辅助的定向进化

Active learning-assisted directed evolution.

作者信息

Yang Jason, Lal Ravi G, Bowden James C, Astudillo Raul, Hameedi Mikhail A, Kaur Sukhvinder, Hill Matthew, Yue Yisong, Arnold Frances H

机构信息

Division of Chemistry and Chemical Engineering, California Institute of Technology, Pasadena, CA, USA.

Division of Engineering and Applied Sciences, California Institute of Technology, Pasadena, CA, USA.

出版信息

Nat Commun. 2025 Jan 16;16(1):714. doi: 10.1038/s41467-025-55987-8.

DOI:10.1038/s41467-025-55987-8
PMID:39821082
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11739421/
Abstract

Directed evolution (DE) is a powerful tool to optimize protein fitness for a specific application. However, DE can be inefficient when mutations exhibit non-additive, or epistatic, behavior. Here, we present Active Learning-assisted Directed Evolution (ALDE), an iterative machine learning-assisted DE workflow that leverages uncertainty quantification to explore the search space of proteins more efficiently than current DE methods. We apply ALDE to an engineering landscape that is challenging for DE: optimization of five epistatic residues in the active site of an enzyme. In three rounds of wet-lab experimentation, we improve the yield of a desired product of a non-native cyclopropanation reaction from 12% to 93%. We also perform computational simulations on existing protein sequence-fitness datasets to support our argument that ALDE can be more effective than DE. Overall, ALDE is a practical and broadly applicable strategy to unlock improved protein engineering outcomes.

摘要

定向进化(DE)是一种强大的工具,可针对特定应用优化蛋白质适应性。然而,当突变表现出非加性或上位性时,定向进化可能效率低下。在此,我们提出主动学习辅助定向进化(ALDE),这是一种迭代的机器学习辅助定向进化工作流程,它利用不确定性量化比当前的定向进化方法更有效地探索蛋白质的搜索空间。我们将ALDE应用于一个对定向进化具有挑战性的工程领域:优化一种酶活性位点中的五个上位性残基。在三轮湿实验室实验中,我们将非天然环丙烷化反应所需产物的产率从12%提高到了93%。我们还对现有的蛋白质序列适应性数据集进行了计算模拟,以支持我们的观点,即ALDE比定向进化更有效。总体而言,ALDE是一种实用且广泛适用的策略,可实现更好的蛋白质工程成果。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1d2b/11739421/23337599d78e/41467_2025_55987_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1d2b/11739421/3fde8738bd44/41467_2025_55987_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1d2b/11739421/c04d0847d4fd/41467_2025_55987_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1d2b/11739421/b71de2e9661b/41467_2025_55987_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1d2b/11739421/84bcab6d6b43/41467_2025_55987_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1d2b/11739421/23337599d78e/41467_2025_55987_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1d2b/11739421/3fde8738bd44/41467_2025_55987_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1d2b/11739421/c04d0847d4fd/41467_2025_55987_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1d2b/11739421/b71de2e9661b/41467_2025_55987_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1d2b/11739421/84bcab6d6b43/41467_2025_55987_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1d2b/11739421/23337599d78e/41467_2025_55987_Fig5_HTML.jpg

相似文献

1
Active learning-assisted directed evolution.主动学习辅助的定向进化
Nat Commun. 2025 Jan 16;16(1):714. doi: 10.1038/s41467-025-55987-8.
2
Learning Strategies in Protein Directed Evolution.蛋白质定向进化中的学习策略。
Methods Mol Biol. 2022;2461:225-275. doi: 10.1007/978-1-0716-2152-3_15.
3
A combinatorially complete epistatic fitness landscape in an enzyme active site.酶活性位点中的组合完全上位适合度景观。
Proc Natl Acad Sci U S A. 2024 Aug 6;121(32):e2400439121. doi: 10.1073/pnas.2400439121. Epub 2024 Jul 29.
4
Addressing epistasis in the design of protein function.解决蛋白质功能设计中的上位效应。
Proc Natl Acad Sci U S A. 2024 Aug 20;121(34):e2314999121. doi: 10.1073/pnas.2314999121. Epub 2024 Aug 12.
5
A machine learning approach for reliable prediction of amino acid interactions and its application in the directed evolution of enantioselective enzymes.一种用于可靠预测氨基酸相互作用的机器学习方法及其在对映选择性酶定向进化中的应用。
Sci Rep. 2018 Nov 13;8(1):16757. doi: 10.1038/s41598-018-35033-y.
6
Protein multi-level structure feature-integrated deep learning method for mutational effect prediction.基于蛋白质多层次结构特征的深度学习基因突变效应预测方法。
Biotechnol J. 2024 Aug;19(8):e2400203. doi: 10.1002/biot.202400203.
7
Machine learning-assisted directed protein evolution with combinatorial libraries.机器学习辅助的组合文库定向蛋白质进化。
Proc Natl Acad Sci U S A. 2019 Apr 30;116(18):8852-8858. doi: 10.1073/pnas.1901979116. Epub 2019 Apr 12.
8
Tailoring industrial enzymes for thermostability and activity evolution by the machine learning-based iCASE strategy.通过基于机器学习的iCASE策略定制用于热稳定性和活性进化的工业酶。
Nat Commun. 2025 Jan 11;16(1):604. doi: 10.1038/s41467-025-55944-5.
9
Protein design by directed evolution.通过定向进化进行蛋白质设计。
Annu Rev Biophys. 2008;37:153-73. doi: 10.1146/annurev.biophys.37.032807.125832.
10
Integrating protein language models and automatic biofoundry for enhanced protein evolution.整合蛋白质语言模型与自动化生物铸造厂以促进蛋白质进化。
Nat Commun. 2025 Feb 11;16(1):1553. doi: 10.1038/s41467-025-56751-8.

引用本文的文献

1
Combing Directed Enzyme Evolution with Metabolic Engineering to Develop Efficient Microbial Cell Factories.将定向酶进化与代谢工程相结合以开发高效的微生物细胞工厂。
Chem Bio Eng. 2025 May 1;2(8):449-459. doi: 10.1021/cbe.5c00002. eCollection 2025 Aug 28.
2
Active learning-guided optimization of cell-free biosensors for lead testing in drinking water.主动学习引导的用于饮用水中铅检测的无细胞生物传感器优化
bioRxiv. 2025 Aug 22:2025.08.20.671382. doi: 10.1101/2025.08.20.671382.
3
Mechanistic modeling or machine learning for detecting variants of concern: Why not both?

本文引用的文献

1
Accelerated enzyme engineering by machine-learning guided cell-free expression.通过机器学习引导的无细胞表达实现加速酶工程。
Nat Commun. 2025 Jan 20;16(1):865. doi: 10.1038/s41467-024-55399-0.
2
Benchmarking uncertainty quantification for protein engineering.蛋白质工程中基准不确定性量化
PLoS Comput Biol. 2025 Jan 7;21(1):e1012639. doi: 10.1371/journal.pcbi.1012639. eCollection 2025 Jan.
3
LevSeq: Rapid Generation of Sequence-Function Data for Directed Evolution and Machine Learning.LevSeq:用于定向进化和机器学习的序列-功能数据的快速生成
用于检测关注变体的机制建模或机器学习:为何不两者兼用?
Proc Natl Acad Sci U S A. 2025 Jul 29;122(30):e2513608122. doi: 10.1073/pnas.2513608122. Epub 2025 Jul 21.
4
Advancing genetic engineering with active learning: theory, implementations and potential opportunities.通过主动学习推进基因工程:理论、实现与潜在机遇
Brief Bioinform. 2025 Jul 2;26(4). doi: 10.1093/bib/bbaf286.
5
Efficient Searches in Protein Sequence Space Through AI-Driven Iterative Learning.通过人工智能驱动的迭代学习在蛋白质序列空间中进行高效搜索。
Int J Mol Sci. 2025 May 15;26(10):4741. doi: 10.3390/ijms26104741.
6
Unifying sequence-structure coding for advanced protein engineering a multimodal diffusion transformer.用于先进蛋白质工程的统一序列结构编码——一种多模态扩散变换器。
Chem Sci. 2025 May 15. doi: 10.1039/d5sc02055g.
7
Cell-Free Protein Synthesis as a Method to Rapidly Screen Machine Learning-Generated Protease Variants.无细胞蛋白质合成作为一种快速筛选机器学习生成的蛋白酶变体的方法。
ACS Synth Biol. 2025 May 16;14(5):1710-1718. doi: 10.1021/acssynbio.5c00062. Epub 2025 Apr 30.
8
Deaminase-driven random mutation enables efficient DNA mutagenesis for protein evolution.脱氨酶驱动的随机突变可实现用于蛋白质进化的高效DNA诱变。
Chem Sci. 2025 Apr 15. doi: 10.1039/d5sc00861a.
9
Cell-free protein synthesis platforms for accelerating drug discovery.用于加速药物发现的无细胞蛋白质合成平台。
Biotechnol Notes. 2025 Feb 19;6:126-132. doi: 10.1016/j.biotno.2025.02.001. eCollection 2025.
ACS Synth Biol. 2025 Jan 17;14(1):230-238. doi: 10.1021/acssynbio.4c00625. Epub 2024 Dec 24.
4
Rapid in silico directed evolution by a protein language model with EVOLVEpro.通过带有EVOLVEpro的蛋白质语言模型进行快速计算机辅助定向进化。
Science. 2025 Jan 24;387(6732):eadr6006. doi: 10.1126/science.adr6006.
5
Catalyzing the future: recent advances in chemical synthesis using enzymes.催化未来:酶在化学合成中的最新进展。
Curr Opin Chem Biol. 2024 Dec;83:102536. doi: 10.1016/j.cbpa.2024.102536. Epub 2024 Oct 5.
6
Machine learning-guided co-optimization of fitness and diversity facilitates combinatorial library design in enzyme engineering.机器学习引导的适应性和多样性协同优化促进了酶工程组合文库设计。
Nat Commun. 2024 Jul 29;15(1):6392. doi: 10.1038/s41467-024-50698-y.
7
A combinatorially complete epistatic fitness landscape in an enzyme active site.酶活性位点中的组合完全上位适合度景观。
Proc Natl Acad Sci U S A. 2024 Aug 6;121(32):e2400439121. doi: 10.1073/pnas.2400439121. Epub 2024 Jul 29.
8
Enhanced Sequence-Activity Mapping and Evolution of Artificial Metalloenzymes by Active Learning.通过主动学习增强人工金属酶的序列-活性映射及进化
ACS Cent Sci. 2024 May 22;10(7):1357-1370. doi: 10.1021/acscentsci.4c00258. eCollection 2024 Jul 24.
9
Calibrated geometric deep learning improves kinase-drug binding predictions.校准后的几何深度学习改进了激酶-药物结合预测。
Nat Mach Intell. 2023 Dec;5(12):1390-1401. doi: 10.1038/s42256-023-00751-0. Epub 2023 Nov 6.
10
Stereospecific Enzymatic Conversion of Boronic Acids to Amines.硼酸向胺的立体特异性酶促转化
J Am Chem Soc. 2024 Jul 17;146(28):19160-19167. doi: 10.1021/jacs.4c04190. Epub 2024 Jul 3.