• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

通过主动学习增强人工金属酶的序列-活性映射及进化

Enhanced Sequence-Activity Mapping and Evolution of Artificial Metalloenzymes by Active Learning.

作者信息

Vornholt Tobias, Mutný Mojmír, Schmidt Gregor W, Schellhaas Christian, Tachibana Ryo, Panke Sven, Ward Thomas R, Krause Andreas, Jeschek Markus

机构信息

Department of Biosystems Science and Engineering, ETH Zurich, Mattenstrasse 26, 4058 Basel, Switzerland.

National Centre of Competence in Research (NCCR) Molecular Systems Engineering, 4056 Basel,Switzerland.

出版信息

ACS Cent Sci. 2024 May 22;10(7):1357-1370. doi: 10.1021/acscentsci.4c00258. eCollection 2024 Jul 24.

DOI:10.1021/acscentsci.4c00258
PMID:39071060
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11273458/
Abstract

Tailored enzymes are crucial for the transition to a sustainable bioeconomy. However, enzyme engineering is laborious and failure-prone due to its reliance on serendipity. The efficiency and success rates of engineering campaigns may be improved by applying machine learning to map the sequence-activity landscape based on small experimental data sets. Yet, it often proves challenging to reliably model large sequence spaces while keeping the experimental effort tractable. To address this challenge, we present an integrated pipeline combining large-scale screening with active machine learning, which we applied to engineer an artificial metalloenzyme (ArM) catalyzing a new-to-nature hydroamination reaction. Combining lab automation and next-generation sequencing, we acquired sequence-activity data for several thousand ArM variants. We then used Gaussian process regression to model the activity landscape and guide further screening rounds. Critical characteristics of our pipeline include the cost-effective generation of information-rich data sets, the integration of an explorative round to improve the model's performance, and the inclusion of experimental noise. Our approach led to an order-of-magnitude boost in the hit rate while making efficient use of experimental resources. Search strategies like this should find broad utility in enzyme engineering and accelerate the development of novel biocatalysts.

摘要

定制酶对于向可持续生物经济的转型至关重要。然而,由于酶工程依赖于偶然性,因此它既费力又容易失败。通过应用机器学习基于小型实验数据集绘制序列-活性图谱,可以提高工程改造活动的效率和成功率。然而,在保持实验工作量可控的同时,可靠地对大型序列空间进行建模往往具有挑战性。为了应对这一挑战,我们提出了一种将大规模筛选与主动机器学习相结合的集成流程,并将其应用于设计一种催化新型氢化胺化反应的人工金属酶(ArM)。结合实验室自动化和下一代测序技术,我们获得了数千个ArM变体的序列-活性数据。然后,我们使用高斯过程回归对活性图谱进行建模,并指导进一步的筛选轮次。我们流程的关键特性包括经济高效地生成信息丰富的数据集、整合探索轮次以提高模型性能以及纳入实验噪声。我们的方法在有效利用实验资源的同时,使命中率提高了一个数量级。这样的搜索策略在酶工程中应具有广泛的用途,并加速新型生物催化剂的开发。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2fa7/11273458/fd10d6861d59/oc4c00258_0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2fa7/11273458/118f075b228f/oc4c00258_0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2fa7/11273458/3d4880b22972/oc4c00258_0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2fa7/11273458/7d32c3dc2400/oc4c00258_0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2fa7/11273458/af59da4794b6/oc4c00258_0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2fa7/11273458/fd10d6861d59/oc4c00258_0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2fa7/11273458/118f075b228f/oc4c00258_0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2fa7/11273458/3d4880b22972/oc4c00258_0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2fa7/11273458/7d32c3dc2400/oc4c00258_0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2fa7/11273458/af59da4794b6/oc4c00258_0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2fa7/11273458/fd10d6861d59/oc4c00258_0005.jpg

相似文献

1
Enhanced Sequence-Activity Mapping and Evolution of Artificial Metalloenzymes by Active Learning.通过主动学习增强人工金属酶的序列-活性映射及进化
ACS Cent Sci. 2024 May 22;10(7):1357-1370. doi: 10.1021/acscentsci.4c00258. eCollection 2024 Jul 24.
2
Systematic engineering of artificial metalloenzymes for new-to-nature reactions.用于新型天然反应的人工金属酶的系统工程。
Sci Adv. 2021 Jan 22;7(4). doi: 10.1126/sciadv.abe4208. Print 2021 Jan.
3
Periplasmic Screening for Artificial Metalloenzymes.人工金属酶的周质筛选
Methods Enzymol. 2016;580:539-56. doi: 10.1016/bs.mie.2016.05.037. Epub 2016 Jun 15.
4
PyPEF-An Integrated Framework for Data-Driven Protein Engineering.PyPEF——一个用于数据驱动的蛋白质工程的集成框架。
J Chem Inf Model. 2021 Jul 26;61(7):3463-3476. doi: 10.1021/acs.jcim.1c00099. Epub 2021 Jul 14.
5
Directed evolution of artificial metalloenzymes for in vivo metathesis.人工金属酶的定向进化用于体内复分解反应。
Nature. 2016 Sep 29;537(7622):661-665. doi: 10.1038/nature19114. Epub 2016 Aug 29.
6
Machine learning-assisted enzyme engineering.机器学习辅助酶工程。
Methods Enzymol. 2020;643:281-315. doi: 10.1016/bs.mie.2020.05.005. Epub 2020 Jun 12.
7
Automated in vivo enzyme engineering accelerates biocatalyst optimization.自动化体内酶工程加速生物催化剂优化。
Nat Commun. 2024 Apr 24;15(1):3447. doi: 10.1038/s41467-024-46574-4.
8
Data-driven enzyme engineering to identify function-enhancing enzymes.基于数据的酶工程改造以鉴定功能增强型酶。
Protein Eng Des Sel. 2023 Jan 21;36. doi: 10.1093/protein/gzac009.
9
10
Navigating the protein fitness landscape with Gaussian processes.用高斯过程探索蛋白质适应性景观。
Proc Natl Acad Sci U S A. 2013 Jan 15;110(3):E193-201. doi: 10.1073/pnas.1215251110. Epub 2012 Dec 31.

引用本文的文献

1
Biophysics-based protein language models for protein engineering.用于蛋白质工程的基于生物物理学的蛋白质语言模型。
Nat Methods. 2025 Sep 11. doi: 10.1038/s41592-025-02776-2.
2
Data-driven protease engineering by DNA-recording and epistasis-aware machine learning.通过DNA记录和上位性感知机器学习进行数据驱动的蛋白酶工程。
Nat Commun. 2025 Jul 1;16(1):5466. doi: 10.1038/s41467-025-60622-7.
3
Designing Enzymatic Reactivity with an Expanded Palette.利用扩展的调色板设计酶活性。

本文引用的文献

1
Self-driving laboratories to autonomously navigate the protein fitness landscape.自动驾驶实验室自主探索蛋白质适应性景观。
Nat Chem Eng. 2024 Jan;1(1):97-107. doi: 10.1038/s44286-023-00002-4. Epub 2024 Jan 11.
2
Binding Interactions and Inhibition Mechanisms of Gold Complexes in Thiamine Diphosphate-Dependent Enzymes.硫胺素二磷酸依赖性酶中金配合物的结合相互作用和抑制机制。
Biochemistry. 2023 Nov 21;62(22):3303-3311. doi: 10.1021/acs.biochem.3c00376. Epub 2023 Nov 6.
3
Genome-wide prediction of disease variant effects with a deep protein language model.
Chembiochem. 2025 Jun 3;26(11):e202500076. doi: 10.1002/cbic.202500076. Epub 2025 Apr 4.
4
Machine learning-driven discovery of highly selective antifungal peptides containing non-canonical β-amino acids.机器学习驱动发现含非经典β-氨基酸的高选择性抗真菌肽
Chem Sci. 2025 Feb 20;16(13):5579-5594. doi: 10.1039/d4sc06689h. eCollection 2025 Mar 26.
5
Single-Walled Carbon Nanotube Probes for Protease Characterization Directly in Cell-Free Expression Reactions.用于在无细胞表达反应中直接表征蛋白酶的单壁碳纳米管探针。
bioRxiv. 2025 Jan 14:2025.01.11.632549. doi: 10.1101/2025.01.11.632549.
6
Active learning-assisted directed evolution.主动学习辅助的定向进化
Nat Commun. 2025 Jan 16;16(1):714. doi: 10.1038/s41467-025-55987-8.
7
Biophysics-based protein language models for protein engineering.用于蛋白质工程的基于生物物理学的蛋白质语言模型。
bioRxiv. 2025 Jan 14:2024.03.15.585128. doi: 10.1101/2024.03.15.585128.
利用深度蛋白质语言模型进行全基因组疾病变异效应预测。
Nat Genet. 2023 Sep;55(9):1512-1522. doi: 10.1038/s41588-023-01465-0. Epub 2023 Aug 10.
4
Evolutionary-scale prediction of atomic-level protein structure with a language model.用语言模型进行原子级蛋白质结构的进化尺度预测。
Science. 2023 Mar 17;379(6637):1123-1130. doi: 10.1126/science.ade2574. Epub 2023 Mar 16.
5
Learning Strategies in Protein Directed Evolution.蛋白质定向进化中的学习策略。
Methods Mol Biol. 2022;2461:225-275. doi: 10.1007/978-1-0716-2152-3_15.
6
Machine learning to navigate fitness landscapes for protein engineering.机器学习在蛋白质工程中的应用:探索适应度景观
Curr Opin Biotechnol. 2022 Jun;75:102713. doi: 10.1016/j.copbio.2022.102713. Epub 2022 Apr 9.
7
evSeq: Cost-Effective Amplicon Sequencing of Every Variant in a Protein Library.evSeq:蛋白质文库中每个变体的经济高效扩增子测序
ACS Synth Biol. 2022 Mar 18;11(3):1313-1324. doi: 10.1021/acssynbio.1c00592. Epub 2022 Feb 17.
8
Algorithm-aided engineering of aliphatic halogenase WelO5* for the asymmetric late-stage functionalization of soraphens.基于算法的脂肪族卤酶 WelO5*工程改造用于索拉芬的不对称后期功能化。
Nat Commun. 2022 Jan 18;13(1):371. doi: 10.1038/s41467-022-27999-1.
9
Learning protein fitness models from evolutionary and assay-labeled data.从进化和实验标记数据中学习蛋白质适应性模型。
Nat Biotechnol. 2022 Jul;40(7):1114-1122. doi: 10.1038/s41587-021-01146-5. Epub 2022 Jan 17.
10
Adaptive machine learning for protein engineering.自适应机器学习在蛋白质工程中的应用。
Curr Opin Struct Biol. 2022 Feb;72:145-152. doi: 10.1016/j.sbi.2021.11.002. Epub 2021 Dec 9.