• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

深度学习与合成生物学的整合:通过 N 端编码序列增强基因表达的协同设计方法。

Integrating Deep Learning and Synthetic Biology: A Co-Design Approach for Enhancing Gene Expression via N-Terminal Coding Sequences.

机构信息

School of Computing, National University of Singapore, Singapore 117417, Singapore.

Science Center for Future Foods, Jiangnan University, Wuxi 214122, PR China.

出版信息

ACS Synth Biol. 2024 Sep 20;13(9):2960-2968. doi: 10.1021/acssynbio.4c00371. Epub 2024 Sep 4.

DOI:10.1021/acssynbio.4c00371
PMID:39229974
Abstract

N-terminal coding sequence (NCS) influences gene expression by impacting the translation initiation rate. The NCS optimization problem is to find an NCS that maximizes gene expression. The problem is important in genetic engineering. However, current methods for NCS optimization such as rational design and statistics-guided approaches are labor-intensive yield only relatively small improvements. This paper introduces a deep learning/synthetic biology codesigned few-shot training workflow for NCS optimization. Our method utilizes -nearest encoding followed by word2vec to encode the NCS, then performs feature extraction using attention mechanisms, before constructing a time-series network for predicting gene expression intensity, and finally a direct search algorithm identifies the optimal NCS with limited training data. We took green fluorescent protein (GFP) expressed by as a reporting protein of NCSs, and employed the fluorescence enhancement factor as the metric of NCS optimization. Within just six iterative experiments, our model generated an NCS (MLD) that increased average GFP expression by 5.41-fold, outperforming the state-of-the-art NCS designs. Extending our findings beyond GFP, we showed that our engineered NCS (MLD) can effectively boost the production of N-acetylneuraminic acid by enhancing the expression of the crucial rate-limiting gene, demonstrating its practical utility. We have open-sourced our NCS expression database and experimental procedures for public use.

摘要

N 端编码序列 (NCS) 通过影响翻译起始速率来影响基因表达。NCS 优化问题是找到一个能够最大限度地提高基因表达的 NCS。这个问题在基因工程中非常重要。然而,目前的 NCS 优化方法,如理性设计和统计指导方法,都很耗时,只能取得相对较小的改进。本文介绍了一种深度学习/合成生物学联合设计的少样本训练工作流程,用于 NCS 优化。我们的方法使用最近邻编码和 word2vec 对 NCS 进行编码,然后使用注意力机制进行特征提取,再构建一个时间序列网络来预测基因表达强度,最后通过直接搜索算法在有限的训练数据中确定最优 NCS。我们以绿色荧光蛋白 (GFP) 为报告蛋白,采用荧光增强因子作为 NCS 优化的指标。在仅仅六次迭代实验中,我们的模型生成了一个 NCS (MLD),使 GFP 的平均表达量提高了 5.41 倍,优于最先进的 NCS 设计。我们将我们的发现扩展到 GFP 之外,展示了我们设计的工程 NCS (MLD) 可以通过增强关键限速酶的表达来有效提高 N-乙酰神经氨酸的产量,证明了其实际应用价值。我们已经开源了我们的 NCS 表达数据库和实验程序,供公众使用。

相似文献

1
Integrating Deep Learning and Synthetic Biology: A Co-Design Approach for Enhancing Gene Expression via N-Terminal Coding Sequences.深度学习与合成生物学的整合:通过 N 端编码序列增强基因表达的协同设计方法。
ACS Synth Biol. 2024 Sep 20;13(9):2960-2968. doi: 10.1021/acssynbio.4c00371. Epub 2024 Sep 4.
2
Model-driven design of synthetic N-terminal coding sequences for regulating gene expression in yeast and bacteria.基于模型的酵母和细菌中基因表达调控的合成 N 端编码序列设计。
Biotechnol J. 2022 May;17(5):e2100655. doi: 10.1002/biot.202100655. Epub 2022 Feb 1.
3
Synthetic N-terminal coding sequences for fine-tuning gene expression and metabolic engineering in Bacillus subtilis.枯草芽孢杆菌中基因表达精细调控和代谢工程的合成 N 端编码序列。
Metab Eng. 2019 Sep;55:131-141. doi: 10.1016/j.ymben.2019.07.001. Epub 2019 Jul 6.
4
Rational Design of the N-Terminal Coding Sequence for Regulating Enzyme Expression in .调控. 中酶表达的 N 端编码序列的合理设计
ACS Synth Biol. 2021 Feb 19;10(2):265-276. doi: 10.1021/acssynbio.0c00309. Epub 2021 Jan 19.
5
Promoter Screening from Bacillus subtilis in Various Conditions Hunting for Synthetic Biology and Industrial Applications.在各种条件下从枯草芽孢杆菌中筛选启动子以寻找合成生物学和工业应用
PLoS One. 2016 Jul 5;11(7):e0158447. doi: 10.1371/journal.pone.0158447. eCollection 2016.
6
A part toolbox to tune genetic expression in Bacillus subtilis.一个用于调节枯草芽孢杆菌基因表达的部件工具箱。
Nucleic Acids Res. 2016 Sep 6;44(15):7495-508. doi: 10.1093/nar/gkw624. Epub 2016 Jul 8.
7
AI-driven high-throughput droplet screening of cell-free gene expression.基于人工智能的无细胞基因表达高通量微滴筛选
Nat Commun. 2025 Mar 19;16(1):2720. doi: 10.1038/s41467-025-58139-0.
8
Modular pathway engineering of key carbon-precursor supply-pathways for improved N-acetylneuraminic acid production in Bacillus subtilis.枯草芽孢杆菌中关键碳前体供应途径的模块化途径工程改造,以提高 N-乙酰神经氨酸的产量。
Biotechnol Bioeng. 2018 Sep;115(9):2217-2231. doi: 10.1002/bit.26743. Epub 2018 Jul 12.
9
[Genetically marking of natural biocontrol bacterium Bacillus subtilis strains with green fluorescent protein gene].[利用绿色荧光蛋白基因对天然生防细菌枯草芽孢杆菌菌株进行遗传标记]
Sheng Wu Gong Cheng Xue Bao. 2003 Sep;19(5):551-5.
10
Controlling mammalian gene expression by allosteric hepatitis delta virus ribozymes.通过变构性丁型肝炎病毒核酶控制哺乳动物基因表达。
ACS Synth Biol. 2013 Dec 20;2(12):684-9. doi: 10.1021/sb400037a. Epub 2013 May 22.

引用本文的文献

1
Reconstitution of Methionine Cycle With ATP Regeneration for Whole-Cell Catalysis of Creatine Production in Engineered Escherichia coli.通过ATP再生重建甲硫氨酸循环用于工程化大肠杆菌中肌酸生产的全细胞催化
Microb Biotechnol. 2025 May;18(5):e70145. doi: 10.1111/1751-7915.70145.
2
An evolved, orthogonal ssDNA generator for targeted hypermutation of multiple genomic loci.一种经过改进的、用于多个基因组位点靶向超突变的正交单链DNA生成器。
Nucleic Acids Res. 2025 Jan 24;53(3). doi: 10.1093/nar/gkaf051.