• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于机器学习集成导向的基因编码荧光钙指示剂工程

Machine Learning Ensemble Directed Engineering of Genetically Encoded Fluorescent Calcium Indicators.

作者信息

Wait Sarah J, Rappleye Michael, Lee Justin Daho, Goy Marc Exposit, Smith Netta, Berndt Andre

机构信息

Molecular Engineering and Sciences Institute, University of Washington, Seattle, WA.

Institute of Stem Cell and Regenerative Medicine, University of Washington, Seattle, WA.

出版信息

Res Sq. 2023 Aug 7:rs.3.rs-3146778. doi: 10.21203/rs.3.rs-3146778/v1.

DOI:10.21203/rs.3.rs-3146778/v1
PMID:37609342
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10441480/
Abstract

In this study, we focused on the transformative potential of machine learning in the engineering of genetically encoded fluorescent indicators (GEFIs), protein-based sensing tools that are critical for real-time monitoring of biological activity. GEFIs are complex proteins with multiple dynamic states, rendering optimization by trial-and-error mutagenesis a challenging problem. We applied an alternative approach using machine learning to predict the outcomes of sensor mutagenesis by analyzing established libraries that link sensor sequences to functions. Using the GCaMP calcium indicator as a scaffold, we developed an ensemble of three regression models trained on experimentally derived GCaMP mutation libraries. We used the trained ensemble to perform an in silico functional screen on 1423 novel, uncharacterized GCaMP variants. As a result, we identified the novel ensemble-derived GCaMP (eGCaMP) variants, eGCaMP and eGCaMP+, that achieve both faster kinetics and larger fluorescent responses upon stimulation than previously published fast variants. Furthermore, we identified a combinatorial mutation with extraordinary dynamic range, eGCaMP2+, that outperforms the tested 6th, 7th, and 8th generation GCaMPs. These findings demonstrate the value of machine learning as a tool to facilitate the efficient pre-screening of mutants for functional characteristics. By leveraging the learning capabilities of our ensemble, we were able to accelerate the identification of promising mutations and reduce the experimental burden associated with trial-and-error mutagenesis. Overall, these findings have significant implications for optimizing GEFIs and other protein-based tools, demonstrating the utility of machine learning as a powerful asset in protein engineering.

摘要

在本研究中,我们聚焦于机器学习在基因编码荧光指示剂(GEFIs)工程中的变革潜力,GEFIs是基于蛋白质的传感工具,对生物活性的实时监测至关重要。GEFIs是具有多种动态状态的复杂蛋白质,通过反复试验诱变进行优化是一个具有挑战性的问题。我们采用了一种替代方法,利用机器学习通过分析将传感器序列与功能联系起来的已建立文库来预测传感器诱变的结果。以GCaMP钙指示剂为支架,我们开发了一组基于实验得出的GCaMP突变文库训练的三个回归模型。我们使用训练好的模型对1423个新的、未表征的GCaMP变体进行了虚拟功能筛选。结果,我们鉴定出了新的基于模型得出的GCaMP(eGCaMP)变体,即eGCaMP和eGCaMP+,它们在受到刺激时比之前发表的快速变体具有更快的动力学和更大的荧光响应。此外,我们还鉴定出了具有非凡动态范围的组合突变eGCaMP2+,其性能优于测试的第六代、第七代和第八代GCaMP。这些发现证明了机器学习作为一种工具在促进对突变体功能特性进行高效预筛选方面的价值。通过利用我们模型的学习能力,我们能够加速识别有前景的突变,并减少与反复试验诱变相关的实验负担。总体而言,这些发现对优化GEFIs和其他基于蛋白质的工具具有重要意义,证明了机器学习作为蛋白质工程中一种强大资产的实用性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/04a7/10441480/31e76e94fe97/nihpp-rs3146778v1-f0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/04a7/10441480/8ee3c2e3876c/nihpp-rs3146778v1-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/04a7/10441480/a644a76aba29/nihpp-rs3146778v1-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/04a7/10441480/fe99099d313b/nihpp-rs3146778v1-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/04a7/10441480/1caa6eeb6f7a/nihpp-rs3146778v1-f0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/04a7/10441480/31e76e94fe97/nihpp-rs3146778v1-f0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/04a7/10441480/8ee3c2e3876c/nihpp-rs3146778v1-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/04a7/10441480/a644a76aba29/nihpp-rs3146778v1-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/04a7/10441480/fe99099d313b/nihpp-rs3146778v1-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/04a7/10441480/1caa6eeb6f7a/nihpp-rs3146778v1-f0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/04a7/10441480/31e76e94fe97/nihpp-rs3146778v1-f0005.jpg

相似文献

1
Machine Learning Ensemble Directed Engineering of Genetically Encoded Fluorescent Calcium Indicators.基于机器学习集成导向的基因编码荧光钙指示剂工程
Res Sq. 2023 Aug 7:rs.3.rs-3146778. doi: 10.21203/rs.3.rs-3146778/v1.
2
Machine learning-guided engineering of genetically encoded fluorescent calcium indicators.基于机器学习的基因编码荧光钙指示剂的工程设计。
Nat Comput Sci. 2024 Mar;4(3):224-236. doi: 10.1038/s43588-024-00611-w. Epub 2024 Mar 21.
3
Optogenetic Microwell Array Screening System: A High-Throughput Engineering Platform for Genetically Encoded Fluorescent Indicators.光遗传微井阵列筛选系统:用于基因编码荧光指示剂的高通量工程平台。
ACS Sens. 2023 Nov 24;8(11):4233-4244. doi: 10.1021/acssensors.3c01573. Epub 2023 Nov 13.
4
Development and characterization of novel jGCaMP8f calcium sensor variants with improved kinetics and fluorescence response range.具有改进动力学和荧光响应范围的新型jGCaMP8f钙传感器变体的开发与表征
Front Cell Neurosci. 2023 May 18;17:1155406. doi: 10.3389/fncel.2023.1155406. eCollection 2023.
5
Machine learning-assisted directed protein evolution with combinatorial libraries.机器学习辅助的组合文库定向蛋白质进化。
Proc Natl Acad Sci U S A. 2019 Apr 30;116(18):8852-8858. doi: 10.1073/pnas.1901979116. Epub 2019 Apr 12.
6
Facilitating Machine Learning-Guided Protein Engineering with Smart Library Design and Massively Parallel Assays.通过智能文库设计和大规模平行分析促进机器学习引导的蛋白质工程
Adv Genet (Hoboken). 2021 Dec 7;2(4):2100038. doi: 10.1002/ggn2.202100038. eCollection 2021 Dec.
7
Fast calcium sensor proteins for monitoring neural activity.用于监测神经活动的快速钙传感器蛋白。
Neurophotonics. 2014 Oct;1(2):025008. doi: 10.1117/1.NPh.1.2.025008.
8
Ensemble Learning with Supervised Methods Based on Large-Scale Protein Language Models for Protein Mutation Effects Prediction.基于大规模蛋白质语言模型的监督方法的集成学习在蛋白质突变效应预测中的应用。
Int J Mol Sci. 2023 Nov 18;24(22):16496. doi: 10.3390/ijms242216496.
9
Functional imaging-guided cell selection for evolving genetically encoded fluorescent indicators.功能成像引导细胞选择用于不断发展的遗传编码荧光指示剂。
Cell Rep Methods. 2023 Jul 27;3(8):100544. doi: 10.1016/j.crmeth.2023.100544. eCollection 2023 Aug 28.
10
Improved calcium sensor GCaMP-X overcomes the calcium channel perturbations induced by the calmodulin in GCaMP.改进型钙传感器 GCaMP-X 克服了钙调蛋白在 GCaMP 中引起的钙通道扰动。
Nat Commun. 2018 Apr 17;9(1):1504. doi: 10.1038/s41467-018-03719-6.