• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

自适应机器学习在蛋白质工程中的应用。

Adaptive machine learning for protein engineering.

机构信息

Department of Biochemistry, Stanford University School of Medicine, Stanford, CA, 94305, USA; Stanford ChEM-H, Stanford University, Stanford, CA, 94305, USA.

Microsoft Research New England, Cambridge, MA, 02142, USA.

出版信息

Curr Opin Struct Biol. 2022 Feb;72:145-152. doi: 10.1016/j.sbi.2021.11.002. Epub 2021 Dec 9.

DOI:10.1016/j.sbi.2021.11.002
PMID:34896756
Abstract

Machine-learning models that learn from data to predict how protein sequence encodes function are emerging as a useful protein engineering tool. However, when using these models to suggest new protein designs, one must deal with the vast combinatorial complexity of protein sequences. Here, we review how to use a sequence-to-function machine-learning surrogate model to select sequences for experimental measurement. First, we discuss how to select sequences through a single round of machine-learning optimization. Then, we discuss sequential optimization, where the goal is to discover optimized sequences and improve the model across multiple rounds of training, optimization, and experimental measurement.

摘要

基于数据学习以预测蛋白质序列如何编码功能的机器学习模型正在成为一种有用的蛋白质工程工具。然而,当使用这些模型来建议新的蛋白质设计时,必须处理蛋白质序列的巨大组合复杂性。在这里,我们回顾如何使用序列到功能的机器学习替代模型来选择用于实验测量的序列。首先,我们讨论如何通过单次机器学习优化来选择序列。然后,我们讨论顺序优化,其目标是在多个训练、优化和实验测量的轮次中发现优化序列并改进模型。

相似文献

1
Adaptive machine learning for protein engineering.自适应机器学习在蛋白质工程中的应用。
Curr Opin Struct Biol. 2022 Feb;72:145-152. doi: 10.1016/j.sbi.2021.11.002. Epub 2021 Dec 9.
2
Machine-learning-guided directed evolution for protein engineering.基于机器学习的定向进化蛋白质工程。
Nat Methods. 2019 Aug;16(8):687-694. doi: 10.1038/s41592-019-0496-6. Epub 2019 Jul 15.
3
Protein sequence design with deep generative models.利用深度生成模型进行蛋白质序列设计。
Curr Opin Chem Biol. 2021 Dec;65:18-27. doi: 10.1016/j.cbpa.2021.04.004. Epub 2021 May 26.
4
Machine learning to navigate fitness landscapes for protein engineering.机器学习在蛋白质工程中的应用:探索适应度景观
Curr Opin Biotechnol. 2022 Jun;75:102713. doi: 10.1016/j.copbio.2022.102713. Epub 2022 Apr 9.
5
Improving Enzyme Fitness with Machine Learning.利用机器学习提高酶的适应性。
Chimia (Aarau). 2023 Mar 29;77(3):116-121. doi: 10.2533/chimia.2023.116.
6
Accurate top protein variant discovery via low-N pick-and-validate machine learning.通过低 N 挑选-验证机器学习实现准确的顶级蛋白质变异体发现。
Cell Syst. 2024 Feb 21;15(2):193-203.e6. doi: 10.1016/j.cels.2024.01.002. Epub 2024 Feb 9.
7
Simulated Design-Build-Test-Learn Cycles for Consistent Comparison of Machine Learning Methods in Metabolic Engineering.模拟设计-构建-测试-学习循环,以在代谢工程中对机器学习方法进行一致比较。
ACS Synth Biol. 2023 Sep 15;12(9):2588-2599. doi: 10.1021/acssynbio.3c00186. Epub 2023 Aug 24.
8
Machine Learning for Biologics: Opportunities for Protein Engineering, Developability, and Formulation.机器学习在生物制药领域的应用:蛋白质工程、可开发性和配方的机遇。
Trends Pharmacol Sci. 2021 Mar;42(3):151-165. doi: 10.1016/j.tips.2020.12.004. Epub 2021 Jan 23.
9
DeCOIL: Optimization of Degenerate Codon Libraries for Machine Learning-Assisted Protein Engineering.DeCOIL:用于机器学习辅助蛋白质工程的简并密码子文库优化。
ACS Synth Biol. 2023 Aug 18;12(8):2444-2454. doi: 10.1021/acssynbio.3c00301. Epub 2023 Jul 31.
10
Develop machine learning-based regression predictive models for engineering protein solubility.开发基于机器学习的回归预测模型,用于工程蛋白质溶解度。
Bioinformatics. 2019 Nov 1;35(22):4640-4646. doi: 10.1093/bioinformatics/btz294.

引用本文的文献

1
Safe model based optimization balancing exploration and reliability for protein sequence design.基于安全模型的优化:平衡蛋白质序列设计中的探索与可靠性
Sci Rep. 2025 Jul 29;15(1):27568. doi: 10.1038/s41598-025-12568-5.
2
Accelerating cell culture media development using Bayesian optimization-based iterative experimental design.使用基于贝叶斯优化的迭代实验设计加速细胞培养基开发
Nat Commun. 2025 Jul 1;16(1):6055. doi: 10.1038/s41467-025-61113-5.
3
Designing diverse and high-performance proteins with a large language model in the loop.
利用大语言模型循环设计多样化且高性能的蛋白质。
PLoS Comput Biol. 2025 Jun 5;21(6):e1013119. doi: 10.1371/journal.pcbi.1013119. eCollection 2025 Jun.
4
Advances in Surrogate Modeling for Biological Agent-Based Simulations: Trends, Challenges, and Future Prospects.基于生物主体的模拟中代理模型的进展:趋势、挑战与未来前景
ArXiv. 2025 Apr 15:arXiv:2504.11617v1.
5
Active learning-assisted directed evolution.主动学习辅助的定向进化
Nat Commun. 2025 Jan 16;16(1):714. doi: 10.1038/s41467-025-55987-8.
6
Bacteriophage RNA polymerases: catalysts for mRNA vaccines and therapeutics.噬菌体RNA聚合酶:mRNA疫苗和治疗药物的催化剂。
Front Mol Biosci. 2024 Nov 21;11:1504876. doi: 10.3389/fmolb.2024.1504876. eCollection 2024.
7
A Study of the Methane Oxidation Mechanism and Reaction Pathways Using Reactive Molecular Simulation and Nonlinear Manifold Learning.利用反应分子模拟和非线性流形学习对甲烷氧化机理及反应途径的研究
ACS Omega. 2024 Oct 17;9(43):43894-43907. doi: 10.1021/acsomega.4c07094. eCollection 2024 Oct 29.
8
Forecasting the trend of tuberculosis incidence in Anhui Province based on machine learning optimization algorithm, 2013-2023.基于机器学习优化算法预测 2013-2023 年安徽省肺结核发病率趋势。
BMC Pulm Med. 2024 Oct 26;24(1):536. doi: 10.1186/s12890-024-03296-z.
9
Reaching New Heights in Genetic Code Manipulation with High Throughput Screening.高通量筛选助力基因密码操作技术新突破
Chem Rev. 2024 Nov 13;124(21):12145-12175. doi: 10.1021/acs.chemrev.4c00329. Epub 2024 Oct 17.
10
Machine learning-assisted amidase-catalytic enantioselectivity prediction and rational design of variants for improving enantioselectivity.机器学习辅助 amidase 催化对映选择性预测和变体的合理设计,以提高对映选择性。
Nat Commun. 2024 Oct 10;15(1):8778. doi: 10.1038/s41467-024-53048-0.